ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/BDB/BDB.pm
Revision: 1.45
Committed: Wed Jul 9 21:16:14 2008 UTC (15 years, 10 months ago) by root
Branch: MAIN
Changes since 1.44: +5 -5 lines
Log Message:
*** empty log message ***

File Contents

# User Rev Content
1 root 1.1 =head1 NAME
2    
3 root 1.2 BDB - Asynchronous Berkeley DB access
4 root 1.1
5     =head1 SYNOPSIS
6    
7 root 1.2 use BDB;
8 root 1.1
9 root 1.19 my $env = db_env_create;
10    
11     mkdir "bdtest", 0700;
12     db_env_open
13     $env,
14     "bdtest",
15     BDB::INIT_LOCK | BDB::INIT_LOG | BDB::INIT_MPOOL
16     | BDB::INIT_TXN | BDB::RECOVER | BDB::USE_ENVIRON | BDB::CREATE,
17     0600;
18    
19     $env->set_flags (BDB::AUTO_COMMIT | BDB::TXN_NOSYNC, 1);
20    
21     my $db = db_create $env;
22     db_open $db, undef, "table", undef, BDB::BTREE, BDB::AUTO_COMMIT | BDB::CREATE
23     | BDB::READ_UNCOMMITTED, 0600;
24     db_put $db, undef, "key", "data", 0, sub {
25     db_del $db, undef, "key";
26     };
27     db_sync $db;
28    
29 root 1.28 # when you also use Coro, management is easy:
30     use Coro::BDB;
31    
32 root 1.36 # automatic event loop intergration with AnyEvent:
33     use AnyEvent::BDB;
34 root 1.19
35     # automatic result processing with EV:
36     my $WATCHER = EV::io BDB::poll_fileno, EV::READ, \&BDB::poll_cb;
37    
38     # with Glib:
39     add_watch Glib::IO BDB::poll_fileno,
40     in => sub { BDB::poll_cb; 1 };
41    
42     # or simply flush manually
43     BDB::flush;
44    
45    
46 root 1.1 =head1 DESCRIPTION
47    
48 root 1.10 See the BerkeleyDB documentation (L<http://www.oracle.com/technology/documentation/berkeley-db/db/index.html>).
49 root 1.12 The BDB API is very similar to the C API (the translation has been very faithful).
50 root 1.10
51     See also the example sections in the document below and possibly the eg/
52     subdirectory of the BDB distribution. Last not least see the IO::AIO
53     documentation, as that module uses almost the same asynchronous request
54     model as this module.
55    
56     I know this is woefully inadequate documentation. Send a patch!
57 root 1.7
58 root 1.1
59     =head1 REQUEST ANATOMY AND LIFETIME
60    
61     Every request method creates a request. which is a C data structure not
62     directly visible to Perl.
63    
64     During their existance, bdb requests travel through the following states,
65     in order:
66    
67     =over 4
68    
69     =item ready
70    
71     Immediately after a request is created it is put into the ready state,
72     waiting for a thread to execute it.
73    
74     =item execute
75    
76     A thread has accepted the request for processing and is currently
77     executing it (e.g. blocking in read).
78    
79     =item pending
80    
81     The request has been executed and is waiting for result processing.
82    
83     While request submission and execution is fully asynchronous, result
84     processing is not and relies on the perl interpreter calling C<poll_cb>
85     (or another function with the same effect).
86    
87     =item result
88    
89     The request results are processed synchronously by C<poll_cb>.
90    
91     The C<poll_cb> function will process all outstanding aio requests by
92     calling their callbacks, freeing memory associated with them and managing
93     any groups they are contained in.
94    
95     =item done
96    
97     Request has reached the end of its lifetime and holds no resources anymore
98     (except possibly for the Perl object, but its connection to the actual
99     aio request is severed and calling its methods will either do nothing or
100     result in a runtime error).
101    
102     =back
103    
104     =cut
105    
106 root 1.2 package BDB;
107 root 1.1
108     no warnings;
109     use strict 'vars';
110    
111     use base 'Exporter';
112    
113     BEGIN {
114 root 1.42 our $VERSION = '1.7';
115 root 1.1
116 root 1.3 our @BDB_REQ = qw(
117 root 1.6 db_env_open db_env_close db_env_txn_checkpoint db_env_lock_detect
118 root 1.38 db_env_memp_sync db_env_memp_trickle db_env_dbrename db_env_dbremove
119 root 1.25 db_open db_close db_compact db_sync db_upgrade
120 root 1.44 db_put db_exists db_get db_pget db_del db_key_range
121 root 1.15 db_txn_commit db_txn_abort db_txn_finish
122 root 1.5 db_c_close db_c_count db_c_put db_c_get db_c_pget db_c_del
123 root 1.6 db_sequence_open db_sequence_close
124     db_sequence_get db_sequence_remove
125     );
126     our @EXPORT = (@BDB_REQ, qw(dbreq_pri dbreq_nice db_env_create db_create));
127     our @EXPORT_OK = qw(
128     poll_fileno poll_cb poll_wait flush
129     min_parallel max_parallel max_idle
130     nreqs nready npending nthreads
131     max_poll_time max_poll_reqs
132 root 1.3 );
133 root 1.1
134     require XSLoader;
135 root 1.2 XSLoader::load ("BDB", $VERSION);
136 root 1.1 }
137    
138 root 1.35 =head2 WIN32 FILENAMES/DATABASE NAME MESS
139 root 1.34
140 root 1.35 Perl on Win32 supports only ASCII filenames (the reason is that it abuses
141     an internal flag to store wether a filename is Unicode or ANSI, but that
142     flag is used for somethign else in the perl core, so there is no way to
143     detect wether a filename is ANSI or Unicode-encoded). The BDB module
144     tries to work around this issue by assuming that the filename is an ANSI
145     filename and BDB was built for unicode support.
146 root 1.34
147 root 1.10 =head2 BERKELEYDB FUNCTIONS
148    
149     All of these are functions. The create functions simply return a new
150 root 1.34 object and never block. All the remaining functions take an optional
151 root 1.24 callback as last argument. If it is missing, then the function will be
152     executed synchronously. In both cases, C<$!> will reflect the return value
153     of the function.
154 root 1.10
155     BDB functions that cannot block (mostly functions that manipulate
156     settings) are method calls on the relevant objects, so the rule of thumb
157 root 1.34 is: if it's a method, it's not blocking, if it's a function, it takes a
158 root 1.10 callback as last argument.
159    
160     In the following, C<$int> signifies an integer return value,
161 root 1.38 C<bdb_filename> is a "filename" (octets on unix, madness on windows),
162     C<U32> is an unsigned 32 bit integer, C<int> is some integer, C<NV> is a
163     floating point value.
164 root 1.10
165 root 1.41 Most C<SV *> types are generic perl scalars (for input and output of data
166     values).
167 root 1.10
168 root 1.11 The various C<DB_ENV> etc. arguments are handles return by
169     C<db_env_create>, C<db_create>, C<txn_begin> and so on. If they have an
170     appended C<_ornull> this means they are optional and you can pass C<undef>
171     for them, resulting a NULL pointer on the C level.
172 root 1.10
173 root 1.41 The C<SV *callback> is the optional callback function to call when the
174     request is completed. This last callback argument is special: the callback
175     is simply the last argument passed. If there are "optional" arguments
176     before the callback they can be left out. The callback itself can be left
177     out or specified as C<undef>, in which case the function will be executed
178     synchronously.
179    
180     For example, C<db_env_txn_checkpoint> usually is called with all integer
181     arguments zero. These can be left out, so all of these specify a call
182     to C<< DB_ENV->txn_checkpoint >>, to be executed asynchronously with a
183     callback to be called:
184    
185     db_env_txn_checkpoint $db_env, 0, 0, 0, sub { };
186     db_env_txn_checkpoint $db_env, 0, 0, sub { };
187     db_env_txn_checkpoint $db_env, sub { };
188    
189     While these all specify a call to C<< DB_ENV->txn_checkpoint >> to be
190     executed synchronously:
191    
192     db_env_txn_checkpoint $db_env, 0, 0, 0, undef;
193     db_env_txn_checkpoint $db_env, 0, 0, 0;
194     db_env_txn_checkpoint $db_env, 0;
195    
196 root 1.10 =head3 BDB functions
197    
198     Functions in the BDB namespace, exported by default:
199    
200     $env = db_env_create (U32 env_flags = 0)
201 root 1.14 flags: RPCCLIENT
202 root 1.10
203 root 1.38 db_env_open (DB_ENV *env, bdb_filename db_home, U32 open_flags, int mode, SV *callback = &PL_sv_undef)
204 root 1.14 open_flags: INIT_CDB INIT_LOCK INIT_LOG INIT_MPOOL INIT_REP INIT_TXN RECOVER RECOVER_FATAL USE_ENVIRON USE_ENVIRON_ROOT CREATE LOCKDOWN PRIVATE REGISTER SYSTEM_MEM
205 root 1.10 db_env_close (DB_ENV *env, U32 flags = 0, SV *callback = &PL_sv_undef)
206     db_env_txn_checkpoint (DB_ENV *env, U32 kbyte = 0, U32 min = 0, U32 flags = 0, SV *callback = &PL_sv_undef)
207 root 1.14 flags: FORCE
208 root 1.10 db_env_lock_detect (DB_ENV *env, U32 flags = 0, U32 atype = DB_LOCK_DEFAULT, SV *dummy = 0, SV *callback = &PL_sv_undef)
209 root 1.14 atype: LOCK_DEFAULT LOCK_EXPIRE LOCK_MAXLOCKS LOCK_MAXWRITE LOCK_MINLOCKS LOCK_MINWRITE LOCK_OLDEST LOCK_RANDOM LOCK_YOUNGEST
210 root 1.10 db_env_memp_sync (DB_ENV *env, SV *dummy = 0, SV *callback = &PL_sv_undef)
211     db_env_memp_trickle (DB_ENV *env, int percent, SV *dummy = 0, SV *callback = &PL_sv_undef)
212 root 1.38 db_env_dbremove (DB_ENV *env, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, U32 flags = 0, SV *callback = &PL_sv_undef)
213     db_env_dbrename (DB_ENV *env, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, bdb_filename newname, U32 flags = 0, SV *callback = &PL_sv_undef)
214 root 1.10
215     $db = db_create (DB_ENV *env = 0, U32 flags = 0)
216 root 1.14 flags: XA_CREATE
217 root 1.10
218 root 1.38 db_open (DB *db, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, int type, U32 flags, int mode, SV *callback = &PL_sv_undef)
219 root 1.14 flags: AUTO_COMMIT CREATE EXCL MULTIVERSION NOMMAP RDONLY READ_UNCOMMITTED THREAD TRUNCATE
220 root 1.10 db_close (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef)
221 root 1.14 flags: DB_NOSYNC
222 root 1.38 db_upgrade (DB *db, bdb_filename file, U32 flags = 0, SV *callback = &PL_sv_undef)
223 root 1.12 db_compact (DB *db, DB_TXN_ornull *txn = 0, SV *start = 0, SV *stop = 0, SV *unused1 = 0, U32 flags = DB_FREE_SPACE, SV *unused2 = 0, SV *callback = &PL_sv_undef)
224 root 1.14 flags: FREELIST_ONLY FREE_SPACE
225 root 1.10 db_sync (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef)
226     db_key_range (DB *db, DB_TXN_ornull *txn, SV *key, SV *key_range, U32 flags = 0, SV *callback = &PL_sv_undef)
227     db_put (DB *db, DB_TXN_ornull *txn, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
228 root 1.14 flags: APPEND NODUPDATA NOOVERWRITE
229 root 1.45 db_exists (DB *db, DB_TXN_ornull *txn, SV *key, U32 flags = 0, SV *callback = 0) (v4.6)
230 root 1.10 db_get (DB *db, DB_TXN_ornull *txn, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
231 root 1.14 flags: CONSUME CONSUME_WAIT GET_BOTH SET_RECNO MULTIPLE READ_COMMITTED READ_UNCOMMITTED RMW
232 root 1.10 db_pget (DB *db, DB_TXN_ornull *txn, SV *key, SV *pkey, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
233 root 1.14 flags: CONSUME CONSUME_WAIT GET_BOTH SET_RECNO MULTIPLE READ_COMMITTED READ_UNCOMMITTED RMW
234 root 1.10 db_del (DB *db, DB_TXN_ornull *txn, SV *key, U32 flags = 0, SV *callback = &PL_sv_undef)
235     db_txn_commit (DB_TXN *txn, U32 flags = 0, SV *callback = &PL_sv_undef)
236 root 1.14 flags: TXN_NOSYNC TXN_SYNC
237 root 1.10 db_txn_abort (DB_TXN *txn, SV *callback = &PL_sv_undef)
238 root 1.14
239 root 1.10 db_c_close (DBC *dbc, SV *callback = &PL_sv_undef)
240     db_c_count (DBC *dbc, SV *count, U32 flags = 0, SV *callback = &PL_sv_undef)
241     db_c_put (DBC *dbc, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
242 root 1.14 flags: AFTER BEFORE CURRENT KEYFIRST KEYLAST NODUPDATA
243 root 1.10 db_c_get (DBC *dbc, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
244 root 1.14 flags: CURRENT FIRST GET_BOTH GET_BOTH_RANGE GET_RECNO JOIN_ITEM LAST NEXT NEXT_DUP NEXT_NODUP PREV PREV_DUP PREV_NODUP SET SET_RANGE SET_RECNO READ_UNCOMMITTED MULTIPLE MULTIPLE_KEY RMW
245 root 1.10 db_c_pget (DBC *dbc, SV *key, SV *pkey, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
246     db_c_del (DBC *dbc, U32 flags = 0, SV *callback = &PL_sv_undef)
247    
248     db_sequence_open (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, SV *key, U32 flags = 0, SV *callback = &PL_sv_undef)
249 root 1.14 flags: CREATE EXCL
250 root 1.10 db_sequence_close (DB_SEQUENCE *seq, U32 flags = 0, SV *callback = &PL_sv_undef)
251     db_sequence_get (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, int delta, SV *seq_value, U32 flags = DB_TXN_NOSYNC, SV *callback = &PL_sv_undef)
252 root 1.14 flags: TXN_NOSYNC
253 root 1.10 db_sequence_remove (DB_SEQUENCE *seq, DB_TXN_ornull *txnid = 0, U32 flags = 0, SV *callback = &PL_sv_undef)
254 root 1.14 flags: TXN_NOSYNC
255 root 1.10
256 root 1.15 =head4 db_txn_finish (DB_TXN *txn, U32 flags = 0, SV *callback = &PL_sv_undef)
257    
258 root 1.16 This is not actually a Berkeley DB function but a BDB module
259     extension. The background for this exytension is: It is very annoying to
260     have to check every single BDB function for error returns and provide a
261     codepath out of your transaction. While the BDB module still makes this
262     possible, it contains the following extensions:
263 root 1.15
264     When a transaction-protected function returns any operating system
265     error (errno > 0), BDB will set the C<TXN_DEADLOCK> flag on the
266 root 1.16 transaction. This flag is also set by Berkeley DB functions themselves
267 root 1.15 when an operation fails with LOCK_DEADLOCK, and it causes all further
268     operations on that transaction (including C<db_txn_commit>) to fail.
269    
270     The C<db_txn_finish> request will look at this flag, and, if it is set,
271     will automatically call C<db_txn_abort> (setting errno to C<LOCK_DEADLOCK>
272 root 1.16 if it isn't set to something else yet). If it isn't set, it will call
273     C<db_txn_commit> and return the error normally.
274 root 1.15
275     How to use this? Easy: just write your transaction normally:
276    
277     my $txn = $db_env->txn_begin;
278     db_get $db, $txn, "key", my $data;
279     db_put $db, $txn, "key", $data + 1 unless $! == BDB::NOTFOUND;
280     db_txn_finish $txn;
281     die "transaction failed" if $!;
282    
283     That is, handle only the expected errors. If something unexpected happens
284     (EIO, LOCK_NOTGRANTED or a deadlock in either db_get or db_put), then the remaining
285     requests (db_put in this case) will simply be skipped (they will fail with
286     LOCK_DEADLOCK) and the transaction will be aborted.
287    
288 root 1.16 You can use the C<< $txn->failed >> method to check wether a transaction
289 root 1.15 has failed in this way and abort further processing (excluding
290     C<db_txn_finish>).
291    
292 root 1.10 =head3 DB_ENV/database environment methods
293    
294     Methods available on DB_ENV/$env handles:
295    
296     DESTROY (DB_ENV_ornull *env)
297     CODE:
298     if (env)
299     env->close (env, 0);
300    
301     $int = $env->set_data_dir (const char *dir)
302     $int = $env->set_tmp_dir (const char *dir)
303     $int = $env->set_lg_dir (const char *dir)
304     $int = $env->set_shm_key (long shm_key)
305     $int = $env->set_cachesize (U32 gbytes, U32 bytes, int ncache = 0)
306 root 1.39 $int = $env->set_flags (U32 flags, int onoff = 1)
307 root 1.45 $int = $env->log_set_config (U32 flags, int onoff = 1) (v4.7)
308     $int = $env->set_intermediate_dir_mode (const char *modestring) (v4.7)
309 root 1.10 $env->set_errfile (FILE *errfile = 0)
310     $env->set_msgfile (FILE *msgfile = 0)
311     $int = $env->set_verbose (U32 which, int onoff = 1)
312     $int = $env->set_encrypt (const char *password, U32 flags = 0)
313 root 1.15 $int = $env->set_timeout (NV timeout_seconds, U32 flags = SET_TXN_TIMEOUT)
314 root 1.10 $int = $env->set_mp_max_openfd (int maxopenfd);
315     $int = $env->set_mp_max_write (int maxwrite, int maxwrite_sleep);
316     $int = $env->set_mp_mmapsize (int mmapsize_mb)
317     $int = $env->set_lk_detect (U32 detect = DB_LOCK_DEFAULT)
318     $int = $env->set_lk_max_lockers (U32 max)
319     $int = $env->set_lk_max_locks (U32 max)
320     $int = $env->set_lk_max_objects (U32 max)
321     $int = $env->set_lg_bsize (U32 max)
322     $int = $env->set_lg_max (U32 max)
323 root 1.20 $int = $env->mutex_set_increment (U32 increment)
324     $int = $env->mutex_set_tas_spins (U32 tas_spins)
325     $int = $env->mutex_set_max (U32 max)
326     $int = $env->mutex_set_align (U32 align)
327 root 1.10
328     $txn = $env->txn_begin (DB_TXN_ornull *parent = 0, U32 flags = 0)
329 root 1.14 flags: READ_COMMITTED READ_UNCOMMITTED TXN_NOSYNC TXN_NOWAIT TXN_SNAPSHOT TXN_SYNC TXN_WAIT TXN_WRITE_NOSYNC
330 root 1.45 $txn = $env->cdsgroup_begin; (v4.5)
331 root 1.10
332 root 1.12 =head4 Example:
333 root 1.10
334     use AnyEvent;
335     use BDB;
336    
337     our $FH; open $FH, "<&=" . BDB::poll_fileno;
338     our $WATCHER = AnyEvent->io (fh => $FH, poll => 'r', cb => \&BDB::poll_cb);
339    
340     BDB::min_parallel 8;
341    
342     my $env = db_env_create;
343    
344     mkdir "bdtest", 0700;
345     db_env_open
346     $env,
347     "bdtest",
348     BDB::INIT_LOCK | BDB::INIT_LOG | BDB::INIT_MPOOL | BDB::INIT_TXN | BDB::RECOVER | BDB::USE_ENVIRON | BDB::CREATE,
349     0600;
350    
351     $env->set_flags (BDB::AUTO_COMMIT | BDB::TXN_NOSYNC, 1);
352    
353    
354     =head3 DB/database methods
355    
356     Methods available on DB/$db handles:
357    
358     DESTROY (DB_ornull *db)
359     CODE:
360     if (db)
361     {
362     SV *env = (SV *)db->app_private;
363     db->close (db, 0);
364     SvREFCNT_dec (env);
365     }
366    
367     $int = $db->set_cachesize (U32 gbytes, U32 bytes, int ncache = 0)
368     $int = $db->set_flags (U32 flags)
369 root 1.14 flags: CHKSUM ENCRYPT TXN_NOT_DURABLE
370     Btree: DUP DUPSORT RECNUM REVSPLITOFF
371     Hash: DUP DUPSORT
372     Queue: INORDER
373     Recno: RENUMBER SNAPSHOT
374    
375 root 1.10 $int = $db->set_encrypt (const char *password, U32 flags)
376     $int = $db->set_lorder (int lorder)
377     $int = $db->set_bt_minkey (U32 minkey)
378     $int = $db->set_re_delim (int delim)
379     $int = $db->set_re_pad (int re_pad)
380     $int = $db->set_re_source (char *source)
381     $int = $db->set_re_len (U32 re_len)
382     $int = $db->set_h_ffactor (U32 h_ffactor)
383     $int = $db->set_h_nelem (U32 h_nelem)
384     $int = $db->set_q_extentsize (U32 extentsize)
385    
386     $dbc = $db->cursor (DB_TXN_ornull *txn = 0, U32 flags = 0)
387 root 1.14 flags: READ_COMMITTED READ_UNCOMMITTED WRITECURSOR TXN_SNAPSHOT
388 root 1.10 $seq = $db->sequence (U32 flags = 0)
389    
390 root 1.12 =head4 Example:
391 root 1.10
392     my $db = db_create $env;
393     db_open $db, undef, "table", undef, BDB::BTREE, BDB::AUTO_COMMIT | BDB::CREATE | BDB::READ_UNCOMMITTED, 0600;
394    
395     for (1..1000) {
396     db_put $db, undef, "key $_", "data $_";
397    
398     db_key_range $db, undef, "key $_", my $keyrange;
399     my ($lt, $eq, $gt) = @$keyrange;
400     }
401    
402     db_del $db, undef, "key $_" for 1..1000;
403    
404     db_sync $db;
405    
406    
407     =head3 DB_TXN/transaction methods
408    
409     Methods available on DB_TXN/$txn handles:
410    
411     DESTROY (DB_TXN_ornull *txn)
412     CODE:
413     if (txn)
414     txn->abort (txn);
415    
416 root 1.15 $int = $txn->set_timeout (NV timeout_seconds, U32 flags = SET_TXN_TIMEOUT)
417 root 1.14 flags: SET_LOCK_TIMEOUT SET_TXN_TIMEOUT
418 root 1.10
419 root 1.15 $bool = $txn->failed
420     # see db_txn_finish documentation, above
421    
422 root 1.10
423     =head3 DBC/cursor methods
424    
425     Methods available on DBC/$dbc handles:
426    
427     DESTROY (DBC_ornull *dbc)
428     CODE:
429     if (dbc)
430     dbc->c_close (dbc);
431    
432 root 1.45 $int = $cursor->set_priority ($priority = PRIORITY_*) (v4.6)
433 root 1.28
434 root 1.12 =head4 Example:
435 root 1.10
436     my $c = $db->cursor;
437    
438     for (;;) {
439     db_c_get $c, my $key, my $data, BDB::NEXT;
440     warn "<$!,$key,$data>";
441     last if $!;
442     }
443    
444     db_c_close $c;
445    
446 root 1.12
447 root 1.10 =head3 DB_SEQUENCE/sequence methods
448    
449     Methods available on DB_SEQUENCE/$seq handles:
450    
451     DESTROY (DB_SEQUENCE_ornull *seq)
452     CODE:
453     if (seq)
454     seq->close (seq, 0);
455    
456     $int = $seq->initial_value (db_seq_t value)
457     $int = $seq->set_cachesize (U32 size)
458     $int = $seq->set_flags (U32 flags)
459 root 1.14 flags: SEQ_DEC SEQ_INC SEQ_WRAP
460 root 1.10 $int = $seq->set_range (db_seq_t min, db_seq_t max)
461    
462 root 1.12 =head4 Example:
463 root 1.10
464     my $seq = $db->sequence;
465    
466     db_sequence_open $seq, undef, "seq", BDB::CREATE;
467     db_sequence_get $seq, undef, 1, my $value;
468    
469    
470 root 1.1 =head2 SUPPORT FUNCTIONS
471    
472     =head3 EVENT PROCESSING AND EVENT LOOP INTEGRATION
473    
474     =over 4
475    
476 root 1.23 =item $msg = BDB::strerror [$errno]
477    
478     Returns the string corresponding to the given errno value. If no argument
479     is given, use C<$!>.
480    
481 root 1.33 Note that the BDB module also patches the C<$!> variable directly, so you
482     should be able to get a bdb error string by simply stringifying C<$!>.
483    
484 root 1.2 =item $fileno = BDB::poll_fileno
485 root 1.1
486     Return the I<request result pipe file descriptor>. This filehandle must be
487     polled for reading by some mechanism outside this module (e.g. Event or
488     select, see below or the SYNOPSIS). If the pipe becomes readable you have
489     to call C<poll_cb> to check the results.
490    
491     See C<poll_cb> for an example.
492    
493 root 1.2 =item BDB::poll_cb
494 root 1.1
495     Process some outstanding events on the result pipe. You have to call this
496     regularly. Returns the number of events processed. Returns immediately
497     when no events are outstanding. The amount of events processed depends on
498 root 1.2 the settings of C<BDB::max_poll_req> and C<BDB::max_poll_time>.
499 root 1.1
500     If not all requests were processed for whatever reason, the filehandle
501     will still be ready when C<poll_cb> returns.
502    
503     Example: Install an Event watcher that automatically calls
504 root 1.2 BDB::poll_cb with high priority:
505 root 1.1
506 root 1.2 Event->io (fd => BDB::poll_fileno,
507 root 1.1 poll => 'r', async => 1,
508 root 1.2 cb => \&BDB::poll_cb);
509 root 1.1
510 root 1.2 =item BDB::max_poll_reqs $nreqs
511 root 1.1
512 root 1.2 =item BDB::max_poll_time $seconds
513 root 1.1
514     These set the maximum number of requests (default C<0>, meaning infinity)
515 root 1.2 that are being processed by C<BDB::poll_cb> in one call, respectively
516 root 1.1 the maximum amount of time (default C<0>, meaning infinity) spent in
517 root 1.2 C<BDB::poll_cb> to process requests (more correctly the mininum amount
518 root 1.1 of time C<poll_cb> is allowed to use).
519    
520     Setting C<max_poll_time> to a non-zero value creates an overhead of one
521     syscall per request processed, which is not normally a problem unless your
522     callbacks are really really fast or your OS is really really slow (I am
523     not mentioning Solaris here). Using C<max_poll_reqs> incurs no overhead.
524    
525     Setting these is useful if you want to ensure some level of
526     interactiveness when perl is not fast enough to process all requests in
527     time.
528    
529     For interactive programs, values such as C<0.01> to C<0.1> should be fine.
530    
531 root 1.18 Example: Install an EV watcher that automatically calls
532 root 1.2 BDB::poll_cb with low priority, to ensure that other parts of the
533 root 1.18 program get the CPU sometimes even under high load.
534 root 1.1
535     # try not to spend much more than 0.1s in poll_cb
536 root 1.2 BDB::max_poll_time 0.1;
537 root 1.1
538 root 1.18 my $bdb_poll = EV::io BDB::poll_fileno, EV::READ, \&BDB::poll_cb);
539 root 1.1
540 root 1.2 =item BDB::poll_wait
541 root 1.1
542     If there are any outstanding requests and none of them in the result
543     phase, wait till the result filehandle becomes ready for reading (simply
544     does a C<select> on the filehandle. This is useful if you want to
545     synchronously wait for some requests to finish).
546    
547     See C<nreqs> for an example.
548    
549 root 1.2 =item BDB::poll
550 root 1.1
551     Waits until some requests have been handled.
552    
553     Returns the number of requests processed, but is otherwise strictly
554     equivalent to:
555    
556 root 1.2 BDB::poll_wait, BDB::poll_cb
557 root 1.1
558 root 1.2 =item BDB::flush
559 root 1.1
560 root 1.18 Wait till all outstanding BDB requests have been handled.
561 root 1.1
562     Strictly equivalent to:
563    
564 root 1.2 BDB::poll_wait, BDB::poll_cb
565     while BDB::nreqs;
566 root 1.1
567 root 1.8 =back
568    
569 root 1.39 =head3 VERSION CHECKING
570    
571     BerkeleyDB comes in various versions, many of them have minor
572     incompatibilities. This means that traditional "at least version x.x"
573     checks are often not sufficient.
574    
575 root 1.40 Example: set the log_autoremove option in a way compatible with <v.47 and
576     v4.7. Note the use of & on the constants to avoid triggering a compiletime
577     bug when the symbol isn't available.
578    
579     $DB_ENV->set_flags (&BDB::LOG_AUTOREMOVE ) if BDB::VERSION v0, v4.7;
580     $DB_ENV->log_set_config (&BDB::LOG_AUTO_REMOVE) if BDB::VERSION v4.7;
581    
582 root 1.39 =over 4
583    
584     =item BDB::VERSION
585    
586     The C<BDB::VERSION> function, when called without arguments, returns the
587     Berkeley DB version as a v-string (usually with 3 components). You should
588     use C<lt> and C<ge> operators exclusively to make comparisons.
589    
590     Example: check for at least version 4.7.
591    
592     BDB::VERSION ge v4.7 or die;
593    
594     =item BDB::VERSION min-version
595    
596     Returns true if the BDB version is at least the given version (specified
597     as a v-string), false otherwise.
598    
599     Example: check for at least version 4.5.
600    
601     BDB::VERSION v4.7 or die;
602    
603     =item BDB::VERSION min-version, max-version
604    
605     Returns true of the BDB version is at least version C<min-version> (specify C<undef> or C<v0> for any minimum version)
606     and less then C<max-version>.
607    
608     Example: check wether version is strictly less then v4.7.
609    
610     BDB::VERSION v0, v4.7
611     or die "version 4.7 is not yet supported";
612    
613     =back
614    
615     =cut
616    
617     sub VERSION {
618     if (@_ > 0) {
619     return undef if VERSION_v lt $_[0];
620     if (@_ > 1) {
621     return undef if VERSION_v ge $_[1];
622     }
623     }
624    
625     VERSION_v
626     }
627    
628 root 1.1 =head3 CONTROLLING THE NUMBER OF THREADS
629    
630 root 1.8 =over 4
631    
632 root 1.2 =item BDB::min_parallel $nthreads
633 root 1.1
634 root 1.18 Set the minimum number of BDB threads to C<$nthreads>. The current
635 root 1.1 default is C<8>, which means eight asynchronous operations can execute
636     concurrently at any one time (the number of outstanding requests,
637     however, is unlimited).
638    
639 root 1.18 BDB starts threads only on demand, when an BDB request is queued and
640 root 1.1 no free thread exists. Please note that queueing up a hundred requests can
641     create demand for a hundred threads, even if it turns out that everything
642     is in the cache and could have been processed faster by a single thread.
643    
644     It is recommended to keep the number of threads relatively low, as some
645     Linux kernel versions will scale negatively with the number of threads
646     (higher parallelity => MUCH higher latency). With current Linux 2.6
647     versions, 4-32 threads should be fine.
648    
649     Under most circumstances you don't need to call this function, as the
650     module selects a default that is suitable for low to moderate load.
651    
652 root 1.2 =item BDB::max_parallel $nthreads
653 root 1.1
654 root 1.18 Sets the maximum number of BDB threads to C<$nthreads>. If more than the
655 root 1.1 specified number of threads are currently running, this function kills
656     them. This function blocks until the limit is reached.
657    
658     While C<$nthreads> are zero, aio requests get queued but not executed
659     until the number of threads has been increased again.
660    
661     This module automatically runs C<max_parallel 0> at program end, to ensure
662     that all threads are killed and that there are no outstanding requests.
663    
664     Under normal circumstances you don't need to call this function.
665    
666 root 1.2 =item BDB::max_idle $nthreads
667 root 1.1
668     Limit the number of threads (default: 4) that are allowed to idle (i.e.,
669     threads that did not get a request to process within 10 seconds). That
670     means if a thread becomes idle while C<$nthreads> other threads are also
671     idle, it will free its resources and exit.
672    
673     This is useful when you allow a large number of threads (e.g. 100 or 1000)
674     to allow for extremely high load situations, but want to free resources
675     under normal circumstances (1000 threads can easily consume 30MB of RAM).
676    
677     The default is probably ok in most situations, especially if thread
678     creation is fast. If thread creation is very slow on your system you might
679     want to use larger values.
680    
681 root 1.2 =item $oldmaxreqs = BDB::max_outstanding $maxreqs
682 root 1.1
683     This is a very bad function to use in interactive programs because it
684     blocks, and a bad way to reduce concurrency because it is inexact: Better
685     use an C<aio_group> together with a feed callback.
686    
687     Sets the maximum number of outstanding requests to C<$nreqs>. If you
688     to queue up more than this number of requests, the next call to the
689     C<poll_cb> (and C<poll_some> and other functions calling C<poll_cb>)
690     function will block until the limit is no longer exceeded.
691    
692     The default value is very large, so there is no practical limit on the
693     number of outstanding requests.
694    
695     You can still queue as many requests as you want. Therefore,
696     C<max_oustsanding> is mainly useful in simple scripts (with low values) or
697     as a stop gap to shield against fatal memory overflow (with large values).
698    
699 root 1.3 =item BDB::set_sync_prepare $cb
700    
701     Sets a callback that is called whenever a request is created without an
702     explicit callback. It has to return two code references. The first is used
703 root 1.37 as the request callback (it should save the return status), and the second
704     is called to wait until the first callback has been called (it must set
705     C<$!> to the return status).
706    
707     This mechanism can be used to include BDB into other event mechanisms,
708     such as L<AnyEvent::BDB> or L<Coro::BDB>.
709    
710     The default implementation works like this:
711 root 1.3
712     sub {
713     my $status;
714     (
715     sub { $status = $! },
716     sub { BDB::poll while !defined $status; $! = $status },
717     )
718     }
719    
720 root 1.37 It simply blocks the process till the request has finished and then sets
721     C<$!> to the return value. This means that if you don't use a callback,
722     BDB will simply fall back to synchronous operations.
723    
724 root 1.3 =back
725    
726 root 1.1 =head3 STATISTICAL INFORMATION
727    
728 root 1.3 =over 4
729    
730 root 1.2 =item BDB::nreqs
731 root 1.1
732     Returns the number of requests currently in the ready, execute or pending
733     states (i.e. for which their callback has not been invoked yet).
734    
735     Example: wait till there are no outstanding requests anymore:
736    
737 root 1.2 BDB::poll_wait, BDB::poll_cb
738     while BDB::nreqs;
739 root 1.1
740 root 1.2 =item BDB::nready
741 root 1.1
742     Returns the number of requests currently in the ready state (not yet
743     executed).
744    
745 root 1.2 =item BDB::npending
746 root 1.1
747     Returns the number of requests currently in the pending state (executed,
748     but not yet processed by poll_cb).
749    
750     =back
751    
752     =cut
753    
754 root 1.3 set_sync_prepare {
755     my $status;
756     (
757     sub {
758     $status = $!;
759     },
760     sub {
761     BDB::poll while !defined $status;
762     $! = $status;
763     },
764     )
765     };
766    
767 root 1.1 min_parallel 8;
768    
769     END { flush }
770    
771     1;
772    
773     =head2 FORK BEHAVIOUR
774    
775     This module should do "the right thing" when the process using it forks:
776    
777 root 1.18 Before the fork, BDB enters a quiescent state where no requests
778 root 1.1 can be added in other threads and no results will be processed. After
779     the fork the parent simply leaves the quiescent state and continues
780     request/result processing, while the child frees the request/result queue
781     (so that the requests started before the fork will only be handled in the
782     parent). Threads will be started on demand until the limit set in the
783     parent process has been reached again.
784    
785     In short: the parent will, after a short pause, continue as if fork had
786 root 1.18 not been called, while the child will act as if BDB has not been used
787 root 1.1 yet.
788    
789 root 1.26 Win32 note: there is no fork on win32, and perls emulation of it is too
790     broken to be supported, so do not use BDB in a windows pseudo-fork, better
791     yet, switch to a more capable platform.
792    
793 root 1.1 =head2 MEMORY USAGE
794    
795     Per-request usage:
796    
797     Each aio request uses - depending on your architecture - around 100-200
798     bytes of memory. In addition, stat requests need a stat buffer (possibly
799     a few hundred bytes), readdir requires a result buffer and so on. Perl
800     scalars and other data passed into aio requests will also be locked and
801     will consume memory till the request has entered the done state.
802    
803 root 1.13 This is not awfully much, so queuing lots of requests is not usually a
804 root 1.1 problem.
805    
806     Per-thread usage:
807    
808     In the execution phase, some aio requests require more memory for
809     temporary buffers, and each thread requires a stack and other data
810     structures (usually around 16k-128k, depending on the OS).
811    
812     =head1 KNOWN BUGS
813    
814 root 1.15 Known bugs will be fixed in the next release, except:
815    
816     If you use a transaction in any request, and the request returns
817     with an operating system error or DB_LOCK_NOTGRANTED, the internal
818     TXN_DEADLOCK flag will be set on the transaction. See C<db_txn_finish>,
819     above.
820 root 1.1
821     =head1 SEE ALSO
822    
823 root 1.36 L<AnyEvent::BDB> (event loop integration), L<Coro::BDB> (more natural
824     syntax), L<IO::AIO> (nice to have).
825 root 1.1
826     =head1 AUTHOR
827    
828     Marc Lehmann <schmorp@schmorp.de>
829     http://home.schmorp.de/
830    
831     =cut
832