ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/BDB/BDB.pm
Revision: 1.33
Committed: Sun Mar 30 04:57:55 2008 UTC (16 years, 1 month ago) by root
Branch: MAIN
Changes since 1.32: +3 -0 lines
Log Message:
*** empty log message ***

File Contents

# User Rev Content
1 root 1.1 =head1 NAME
2    
3 root 1.2 BDB - Asynchronous Berkeley DB access
4 root 1.1
5     =head1 SYNOPSIS
6    
7 root 1.2 use BDB;
8 root 1.1
9 root 1.19 my $env = db_env_create;
10    
11     mkdir "bdtest", 0700;
12     db_env_open
13     $env,
14     "bdtest",
15     BDB::INIT_LOCK | BDB::INIT_LOG | BDB::INIT_MPOOL
16     | BDB::INIT_TXN | BDB::RECOVER | BDB::USE_ENVIRON | BDB::CREATE,
17     0600;
18    
19     $env->set_flags (BDB::AUTO_COMMIT | BDB::TXN_NOSYNC, 1);
20    
21     my $db = db_create $env;
22     db_open $db, undef, "table", undef, BDB::BTREE, BDB::AUTO_COMMIT | BDB::CREATE
23     | BDB::READ_UNCOMMITTED, 0600;
24     db_put $db, undef, "key", "data", 0, sub {
25     db_del $db, undef, "key";
26     };
27     db_sync $db;
28    
29 root 1.28 # when you also use Coro, management is easy:
30     use Coro::BDB;
31    
32 root 1.19 # automatic result processing with AnyEvent:
33     our $FH; open $FH, "<&=" . BDB::poll_fileno;
34     our $WATCHER = AnyEvent->io (fh => $FH, poll => 'r', cb => \&BDB::poll_cb);
35    
36     # automatic result processing with EV:
37     my $WATCHER = EV::io BDB::poll_fileno, EV::READ, \&BDB::poll_cb;
38    
39     # with Glib:
40     add_watch Glib::IO BDB::poll_fileno,
41     in => sub { BDB::poll_cb; 1 };
42    
43     # or simply flush manually
44     BDB::flush;
45    
46    
47 root 1.1 =head1 DESCRIPTION
48    
49 root 1.10 See the BerkeleyDB documentation (L<http://www.oracle.com/technology/documentation/berkeley-db/db/index.html>).
50 root 1.12 The BDB API is very similar to the C API (the translation has been very faithful).
51 root 1.10
52     See also the example sections in the document below and possibly the eg/
53     subdirectory of the BDB distribution. Last not least see the IO::AIO
54     documentation, as that module uses almost the same asynchronous request
55     model as this module.
56    
57     I know this is woefully inadequate documentation. Send a patch!
58 root 1.7
59 root 1.1
60     =head1 REQUEST ANATOMY AND LIFETIME
61    
62     Every request method creates a request. which is a C data structure not
63     directly visible to Perl.
64    
65     During their existance, bdb requests travel through the following states,
66     in order:
67    
68     =over 4
69    
70     =item ready
71    
72     Immediately after a request is created it is put into the ready state,
73     waiting for a thread to execute it.
74    
75     =item execute
76    
77     A thread has accepted the request for processing and is currently
78     executing it (e.g. blocking in read).
79    
80     =item pending
81    
82     The request has been executed and is waiting for result processing.
83    
84     While request submission and execution is fully asynchronous, result
85     processing is not and relies on the perl interpreter calling C<poll_cb>
86     (or another function with the same effect).
87    
88     =item result
89    
90     The request results are processed synchronously by C<poll_cb>.
91    
92     The C<poll_cb> function will process all outstanding aio requests by
93     calling their callbacks, freeing memory associated with them and managing
94     any groups they are contained in.
95    
96     =item done
97    
98     Request has reached the end of its lifetime and holds no resources anymore
99     (except possibly for the Perl object, but its connection to the actual
100     aio request is severed and calling its methods will either do nothing or
101     result in a runtime error).
102    
103     =back
104    
105     =cut
106    
107 root 1.2 package BDB;
108 root 1.1
109     no warnings;
110     use strict 'vars';
111    
112     use base 'Exporter';
113    
114     BEGIN {
115 root 1.32 our $VERSION = '1.44';
116 root 1.1
117 root 1.3 our @BDB_REQ = qw(
118 root 1.6 db_env_open db_env_close db_env_txn_checkpoint db_env_lock_detect
119     db_env_memp_sync db_env_memp_trickle
120 root 1.25 db_open db_close db_compact db_sync db_upgrade
121     db_put db_get db_pget db_del db_key_range
122 root 1.15 db_txn_commit db_txn_abort db_txn_finish
123 root 1.5 db_c_close db_c_count db_c_put db_c_get db_c_pget db_c_del
124 root 1.6 db_sequence_open db_sequence_close
125     db_sequence_get db_sequence_remove
126     );
127     our @EXPORT = (@BDB_REQ, qw(dbreq_pri dbreq_nice db_env_create db_create));
128     our @EXPORT_OK = qw(
129     poll_fileno poll_cb poll_wait flush
130     min_parallel max_parallel max_idle
131     nreqs nready npending nthreads
132     max_poll_time max_poll_reqs
133 root 1.3 );
134 root 1.1
135     require XSLoader;
136 root 1.2 XSLoader::load ("BDB", $VERSION);
137 root 1.1 }
138    
139 root 1.10 =head2 BERKELEYDB FUNCTIONS
140    
141     All of these are functions. The create functions simply return a new
142     object and never block. All the remaining functions all take an optional
143 root 1.24 callback as last argument. If it is missing, then the function will be
144     executed synchronously. In both cases, C<$!> will reflect the return value
145     of the function.
146 root 1.10
147     BDB functions that cannot block (mostly functions that manipulate
148     settings) are method calls on the relevant objects, so the rule of thumb
149     is: if its a method, its not blocking, if its a function, it takes a
150     callback as last argument.
151    
152     In the following, C<$int> signifies an integer return value,
153     C<octetstring> is a "binary string" (i.e. a perl string with no character
154     indices >255), C<U32> is an unsigned 32 bit integer, C<int> is some
155     integer, C<NV> is a floating point value.
156    
157     The C<SV *> types are generic perl scalars (for input and output of data
158     values), and the C<SV *callback> is the optional callback function to call
159     when the request is completed.
160    
161 root 1.11 The various C<DB_ENV> etc. arguments are handles return by
162     C<db_env_create>, C<db_create>, C<txn_begin> and so on. If they have an
163     appended C<_ornull> this means they are optional and you can pass C<undef>
164     for them, resulting a NULL pointer on the C level.
165 root 1.10
166     =head3 BDB functions
167    
168     Functions in the BDB namespace, exported by default:
169    
170     $env = db_env_create (U32 env_flags = 0)
171 root 1.14 flags: RPCCLIENT
172 root 1.10
173     db_env_open (DB_ENV *env, octetstring db_home, U32 open_flags, int mode, SV *callback = &PL_sv_undef)
174 root 1.14 open_flags: INIT_CDB INIT_LOCK INIT_LOG INIT_MPOOL INIT_REP INIT_TXN RECOVER RECOVER_FATAL USE_ENVIRON USE_ENVIRON_ROOT CREATE LOCKDOWN PRIVATE REGISTER SYSTEM_MEM
175 root 1.10 db_env_close (DB_ENV *env, U32 flags = 0, SV *callback = &PL_sv_undef)
176     db_env_txn_checkpoint (DB_ENV *env, U32 kbyte = 0, U32 min = 0, U32 flags = 0, SV *callback = &PL_sv_undef)
177 root 1.14 flags: FORCE
178 root 1.10 db_env_lock_detect (DB_ENV *env, U32 flags = 0, U32 atype = DB_LOCK_DEFAULT, SV *dummy = 0, SV *callback = &PL_sv_undef)
179 root 1.14 atype: LOCK_DEFAULT LOCK_EXPIRE LOCK_MAXLOCKS LOCK_MAXWRITE LOCK_MINLOCKS LOCK_MINWRITE LOCK_OLDEST LOCK_RANDOM LOCK_YOUNGEST
180 root 1.10 db_env_memp_sync (DB_ENV *env, SV *dummy = 0, SV *callback = &PL_sv_undef)
181     db_env_memp_trickle (DB_ENV *env, int percent, SV *dummy = 0, SV *callback = &PL_sv_undef)
182    
183     $db = db_create (DB_ENV *env = 0, U32 flags = 0)
184 root 1.14 flags: XA_CREATE
185 root 1.10
186     db_open (DB *db, DB_TXN_ornull *txnid, octetstring file, octetstring database, int type, U32 flags, int mode, SV *callback = &PL_sv_undef)
187 root 1.14 flags: AUTO_COMMIT CREATE EXCL MULTIVERSION NOMMAP RDONLY READ_UNCOMMITTED THREAD TRUNCATE
188 root 1.10 db_close (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef)
189 root 1.14 flags: DB_NOSYNC
190 root 1.25 db_upgrade (DB *db, octetstring file, U32 flags = 0, SV *callback = &PL_sv_undef)
191 root 1.12 db_compact (DB *db, DB_TXN_ornull *txn = 0, SV *start = 0, SV *stop = 0, SV *unused1 = 0, U32 flags = DB_FREE_SPACE, SV *unused2 = 0, SV *callback = &PL_sv_undef)
192 root 1.14 flags: FREELIST_ONLY FREE_SPACE
193 root 1.10 db_sync (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef)
194     db_key_range (DB *db, DB_TXN_ornull *txn, SV *key, SV *key_range, U32 flags = 0, SV *callback = &PL_sv_undef)
195     db_put (DB *db, DB_TXN_ornull *txn, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
196 root 1.14 flags: APPEND NODUPDATA NOOVERWRITE
197 root 1.10 db_get (DB *db, DB_TXN_ornull *txn, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
198 root 1.14 flags: CONSUME CONSUME_WAIT GET_BOTH SET_RECNO MULTIPLE READ_COMMITTED READ_UNCOMMITTED RMW
199 root 1.10 db_pget (DB *db, DB_TXN_ornull *txn, SV *key, SV *pkey, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
200 root 1.14 flags: CONSUME CONSUME_WAIT GET_BOTH SET_RECNO MULTIPLE READ_COMMITTED READ_UNCOMMITTED RMW
201 root 1.10 db_del (DB *db, DB_TXN_ornull *txn, SV *key, U32 flags = 0, SV *callback = &PL_sv_undef)
202     db_txn_commit (DB_TXN *txn, U32 flags = 0, SV *callback = &PL_sv_undef)
203 root 1.14 flags: TXN_NOSYNC TXN_SYNC
204 root 1.10 db_txn_abort (DB_TXN *txn, SV *callback = &PL_sv_undef)
205 root 1.14
206 root 1.10 db_c_close (DBC *dbc, SV *callback = &PL_sv_undef)
207     db_c_count (DBC *dbc, SV *count, U32 flags = 0, SV *callback = &PL_sv_undef)
208     db_c_put (DBC *dbc, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
209 root 1.14 flags: AFTER BEFORE CURRENT KEYFIRST KEYLAST NODUPDATA
210 root 1.10 db_c_get (DBC *dbc, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
211 root 1.14 flags: CURRENT FIRST GET_BOTH GET_BOTH_RANGE GET_RECNO JOIN_ITEM LAST NEXT NEXT_DUP NEXT_NODUP PREV PREV_DUP PREV_NODUP SET SET_RANGE SET_RECNO READ_UNCOMMITTED MULTIPLE MULTIPLE_KEY RMW
212 root 1.10 db_c_pget (DBC *dbc, SV *key, SV *pkey, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
213     db_c_del (DBC *dbc, U32 flags = 0, SV *callback = &PL_sv_undef)
214    
215     db_sequence_open (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, SV *key, U32 flags = 0, SV *callback = &PL_sv_undef)
216 root 1.14 flags: CREATE EXCL
217 root 1.10 db_sequence_close (DB_SEQUENCE *seq, U32 flags = 0, SV *callback = &PL_sv_undef)
218     db_sequence_get (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, int delta, SV *seq_value, U32 flags = DB_TXN_NOSYNC, SV *callback = &PL_sv_undef)
219 root 1.14 flags: TXN_NOSYNC
220 root 1.10 db_sequence_remove (DB_SEQUENCE *seq, DB_TXN_ornull *txnid = 0, U32 flags = 0, SV *callback = &PL_sv_undef)
221 root 1.14 flags: TXN_NOSYNC
222 root 1.10
223 root 1.15 =head4 db_txn_finish (DB_TXN *txn, U32 flags = 0, SV *callback = &PL_sv_undef)
224    
225 root 1.16 This is not actually a Berkeley DB function but a BDB module
226     extension. The background for this exytension is: It is very annoying to
227     have to check every single BDB function for error returns and provide a
228     codepath out of your transaction. While the BDB module still makes this
229     possible, it contains the following extensions:
230 root 1.15
231     When a transaction-protected function returns any operating system
232     error (errno > 0), BDB will set the C<TXN_DEADLOCK> flag on the
233 root 1.16 transaction. This flag is also set by Berkeley DB functions themselves
234 root 1.15 when an operation fails with LOCK_DEADLOCK, and it causes all further
235     operations on that transaction (including C<db_txn_commit>) to fail.
236    
237     The C<db_txn_finish> request will look at this flag, and, if it is set,
238     will automatically call C<db_txn_abort> (setting errno to C<LOCK_DEADLOCK>
239 root 1.16 if it isn't set to something else yet). If it isn't set, it will call
240     C<db_txn_commit> and return the error normally.
241 root 1.15
242     How to use this? Easy: just write your transaction normally:
243    
244     my $txn = $db_env->txn_begin;
245     db_get $db, $txn, "key", my $data;
246     db_put $db, $txn, "key", $data + 1 unless $! == BDB::NOTFOUND;
247     db_txn_finish $txn;
248     die "transaction failed" if $!;
249    
250     That is, handle only the expected errors. If something unexpected happens
251     (EIO, LOCK_NOTGRANTED or a deadlock in either db_get or db_put), then the remaining
252     requests (db_put in this case) will simply be skipped (they will fail with
253     LOCK_DEADLOCK) and the transaction will be aborted.
254    
255 root 1.16 You can use the C<< $txn->failed >> method to check wether a transaction
256 root 1.15 has failed in this way and abort further processing (excluding
257     C<db_txn_finish>).
258    
259 root 1.10 =head3 DB_ENV/database environment methods
260    
261     Methods available on DB_ENV/$env handles:
262    
263     DESTROY (DB_ENV_ornull *env)
264     CODE:
265     if (env)
266     env->close (env, 0);
267    
268     $int = $env->set_data_dir (const char *dir)
269     $int = $env->set_tmp_dir (const char *dir)
270     $int = $env->set_lg_dir (const char *dir)
271     $int = $env->set_shm_key (long shm_key)
272     $int = $env->set_cachesize (U32 gbytes, U32 bytes, int ncache = 0)
273     $int = $env->set_flags (U32 flags, int onoff)
274     $env->set_errfile (FILE *errfile = 0)
275     $env->set_msgfile (FILE *msgfile = 0)
276     $int = $env->set_verbose (U32 which, int onoff = 1)
277     $int = $env->set_encrypt (const char *password, U32 flags = 0)
278 root 1.15 $int = $env->set_timeout (NV timeout_seconds, U32 flags = SET_TXN_TIMEOUT)
279 root 1.10 $int = $env->set_mp_max_openfd (int maxopenfd);
280     $int = $env->set_mp_max_write (int maxwrite, int maxwrite_sleep);
281     $int = $env->set_mp_mmapsize (int mmapsize_mb)
282     $int = $env->set_lk_detect (U32 detect = DB_LOCK_DEFAULT)
283     $int = $env->set_lk_max_lockers (U32 max)
284     $int = $env->set_lk_max_locks (U32 max)
285     $int = $env->set_lk_max_objects (U32 max)
286     $int = $env->set_lg_bsize (U32 max)
287     $int = $env->set_lg_max (U32 max)
288 root 1.20 $int = $env->mutex_set_increment (U32 increment)
289     $int = $env->mutex_set_tas_spins (U32 tas_spins)
290     $int = $env->mutex_set_max (U32 max)
291     $int = $env->mutex_set_align (U32 align)
292 root 1.10
293     $txn = $env->txn_begin (DB_TXN_ornull *parent = 0, U32 flags = 0)
294 root 1.14 flags: READ_COMMITTED READ_UNCOMMITTED TXN_NOSYNC TXN_NOWAIT TXN_SNAPSHOT TXN_SYNC TXN_WAIT TXN_WRITE_NOSYNC
295 root 1.10
296 root 1.12 =head4 Example:
297 root 1.10
298     use AnyEvent;
299     use BDB;
300    
301     our $FH; open $FH, "<&=" . BDB::poll_fileno;
302     our $WATCHER = AnyEvent->io (fh => $FH, poll => 'r', cb => \&BDB::poll_cb);
303    
304     BDB::min_parallel 8;
305    
306     my $env = db_env_create;
307    
308     mkdir "bdtest", 0700;
309     db_env_open
310     $env,
311     "bdtest",
312     BDB::INIT_LOCK | BDB::INIT_LOG | BDB::INIT_MPOOL | BDB::INIT_TXN | BDB::RECOVER | BDB::USE_ENVIRON | BDB::CREATE,
313     0600;
314    
315     $env->set_flags (BDB::AUTO_COMMIT | BDB::TXN_NOSYNC, 1);
316    
317    
318     =head3 DB/database methods
319    
320     Methods available on DB/$db handles:
321    
322     DESTROY (DB_ornull *db)
323     CODE:
324     if (db)
325     {
326     SV *env = (SV *)db->app_private;
327     db->close (db, 0);
328     SvREFCNT_dec (env);
329     }
330    
331     $int = $db->set_cachesize (U32 gbytes, U32 bytes, int ncache = 0)
332     $int = $db->set_flags (U32 flags)
333 root 1.14 flags: CHKSUM ENCRYPT TXN_NOT_DURABLE
334     Btree: DUP DUPSORT RECNUM REVSPLITOFF
335     Hash: DUP DUPSORT
336     Queue: INORDER
337     Recno: RENUMBER SNAPSHOT
338    
339 root 1.10 $int = $db->set_encrypt (const char *password, U32 flags)
340     $int = $db->set_lorder (int lorder)
341     $int = $db->set_bt_minkey (U32 minkey)
342     $int = $db->set_re_delim (int delim)
343     $int = $db->set_re_pad (int re_pad)
344     $int = $db->set_re_source (char *source)
345     $int = $db->set_re_len (U32 re_len)
346     $int = $db->set_h_ffactor (U32 h_ffactor)
347     $int = $db->set_h_nelem (U32 h_nelem)
348     $int = $db->set_q_extentsize (U32 extentsize)
349    
350     $dbc = $db->cursor (DB_TXN_ornull *txn = 0, U32 flags = 0)
351 root 1.14 flags: READ_COMMITTED READ_UNCOMMITTED WRITECURSOR TXN_SNAPSHOT
352 root 1.10 $seq = $db->sequence (U32 flags = 0)
353    
354 root 1.12 =head4 Example:
355 root 1.10
356     my $db = db_create $env;
357     db_open $db, undef, "table", undef, BDB::BTREE, BDB::AUTO_COMMIT | BDB::CREATE | BDB::READ_UNCOMMITTED, 0600;
358    
359     for (1..1000) {
360     db_put $db, undef, "key $_", "data $_";
361    
362     db_key_range $db, undef, "key $_", my $keyrange;
363     my ($lt, $eq, $gt) = @$keyrange;
364     }
365    
366     db_del $db, undef, "key $_" for 1..1000;
367    
368     db_sync $db;
369    
370    
371     =head3 DB_TXN/transaction methods
372    
373     Methods available on DB_TXN/$txn handles:
374    
375     DESTROY (DB_TXN_ornull *txn)
376     CODE:
377     if (txn)
378     txn->abort (txn);
379    
380 root 1.15 $int = $txn->set_timeout (NV timeout_seconds, U32 flags = SET_TXN_TIMEOUT)
381 root 1.14 flags: SET_LOCK_TIMEOUT SET_TXN_TIMEOUT
382 root 1.10
383 root 1.15 $bool = $txn->failed
384     # see db_txn_finish documentation, above
385    
386 root 1.10
387     =head3 DBC/cursor methods
388    
389     Methods available on DBC/$dbc handles:
390    
391     DESTROY (DBC_ornull *dbc)
392     CODE:
393     if (dbc)
394     dbc->c_close (dbc);
395    
396 root 1.28 $int = $cursor->set_priority ($priority = PRIORITY_*)
397    
398 root 1.12 =head4 Example:
399 root 1.10
400     my $c = $db->cursor;
401    
402     for (;;) {
403     db_c_get $c, my $key, my $data, BDB::NEXT;
404     warn "<$!,$key,$data>";
405     last if $!;
406     }
407    
408     db_c_close $c;
409    
410 root 1.12
411 root 1.10 =head3 DB_SEQUENCE/sequence methods
412    
413     Methods available on DB_SEQUENCE/$seq handles:
414    
415     DESTROY (DB_SEQUENCE_ornull *seq)
416     CODE:
417     if (seq)
418     seq->close (seq, 0);
419    
420     $int = $seq->initial_value (db_seq_t value)
421     $int = $seq->set_cachesize (U32 size)
422     $int = $seq->set_flags (U32 flags)
423 root 1.14 flags: SEQ_DEC SEQ_INC SEQ_WRAP
424 root 1.10 $int = $seq->set_range (db_seq_t min, db_seq_t max)
425    
426 root 1.12 =head4 Example:
427 root 1.10
428     my $seq = $db->sequence;
429    
430     db_sequence_open $seq, undef, "seq", BDB::CREATE;
431     db_sequence_get $seq, undef, 1, my $value;
432    
433    
434 root 1.1 =head2 SUPPORT FUNCTIONS
435    
436     =head3 EVENT PROCESSING AND EVENT LOOP INTEGRATION
437    
438     =over 4
439    
440 root 1.23 =item $msg = BDB::strerror [$errno]
441    
442     Returns the string corresponding to the given errno value. If no argument
443     is given, use C<$!>.
444    
445 root 1.33 Note that the BDB module also patches the C<$!> variable directly, so you
446     should be able to get a bdb error string by simply stringifying C<$!>.
447    
448 root 1.2 =item $fileno = BDB::poll_fileno
449 root 1.1
450     Return the I<request result pipe file descriptor>. This filehandle must be
451     polled for reading by some mechanism outside this module (e.g. Event or
452     select, see below or the SYNOPSIS). If the pipe becomes readable you have
453     to call C<poll_cb> to check the results.
454    
455     See C<poll_cb> for an example.
456    
457 root 1.2 =item BDB::poll_cb
458 root 1.1
459     Process some outstanding events on the result pipe. You have to call this
460     regularly. Returns the number of events processed. Returns immediately
461     when no events are outstanding. The amount of events processed depends on
462 root 1.2 the settings of C<BDB::max_poll_req> and C<BDB::max_poll_time>.
463 root 1.1
464     If not all requests were processed for whatever reason, the filehandle
465     will still be ready when C<poll_cb> returns.
466    
467     Example: Install an Event watcher that automatically calls
468 root 1.2 BDB::poll_cb with high priority:
469 root 1.1
470 root 1.2 Event->io (fd => BDB::poll_fileno,
471 root 1.1 poll => 'r', async => 1,
472 root 1.2 cb => \&BDB::poll_cb);
473 root 1.1
474 root 1.2 =item BDB::max_poll_reqs $nreqs
475 root 1.1
476 root 1.2 =item BDB::max_poll_time $seconds
477 root 1.1
478     These set the maximum number of requests (default C<0>, meaning infinity)
479 root 1.2 that are being processed by C<BDB::poll_cb> in one call, respectively
480 root 1.1 the maximum amount of time (default C<0>, meaning infinity) spent in
481 root 1.2 C<BDB::poll_cb> to process requests (more correctly the mininum amount
482 root 1.1 of time C<poll_cb> is allowed to use).
483    
484     Setting C<max_poll_time> to a non-zero value creates an overhead of one
485     syscall per request processed, which is not normally a problem unless your
486     callbacks are really really fast or your OS is really really slow (I am
487     not mentioning Solaris here). Using C<max_poll_reqs> incurs no overhead.
488    
489     Setting these is useful if you want to ensure some level of
490     interactiveness when perl is not fast enough to process all requests in
491     time.
492    
493     For interactive programs, values such as C<0.01> to C<0.1> should be fine.
494    
495 root 1.18 Example: Install an EV watcher that automatically calls
496 root 1.2 BDB::poll_cb with low priority, to ensure that other parts of the
497 root 1.18 program get the CPU sometimes even under high load.
498 root 1.1
499     # try not to spend much more than 0.1s in poll_cb
500 root 1.2 BDB::max_poll_time 0.1;
501 root 1.1
502 root 1.18 my $bdb_poll = EV::io BDB::poll_fileno, EV::READ, \&BDB::poll_cb);
503 root 1.1
504 root 1.2 =item BDB::poll_wait
505 root 1.1
506     If there are any outstanding requests and none of them in the result
507     phase, wait till the result filehandle becomes ready for reading (simply
508     does a C<select> on the filehandle. This is useful if you want to
509     synchronously wait for some requests to finish).
510    
511     See C<nreqs> for an example.
512    
513 root 1.2 =item BDB::poll
514 root 1.1
515     Waits until some requests have been handled.
516    
517     Returns the number of requests processed, but is otherwise strictly
518     equivalent to:
519    
520 root 1.2 BDB::poll_wait, BDB::poll_cb
521 root 1.1
522 root 1.2 =item BDB::flush
523 root 1.1
524 root 1.18 Wait till all outstanding BDB requests have been handled.
525 root 1.1
526     Strictly equivalent to:
527    
528 root 1.2 BDB::poll_wait, BDB::poll_cb
529     while BDB::nreqs;
530 root 1.1
531 root 1.8 =back
532    
533 root 1.1 =head3 CONTROLLING THE NUMBER OF THREADS
534    
535 root 1.8 =over 4
536    
537 root 1.2 =item BDB::min_parallel $nthreads
538 root 1.1
539 root 1.18 Set the minimum number of BDB threads to C<$nthreads>. The current
540 root 1.1 default is C<8>, which means eight asynchronous operations can execute
541     concurrently at any one time (the number of outstanding requests,
542     however, is unlimited).
543    
544 root 1.18 BDB starts threads only on demand, when an BDB request is queued and
545 root 1.1 no free thread exists. Please note that queueing up a hundred requests can
546     create demand for a hundred threads, even if it turns out that everything
547     is in the cache and could have been processed faster by a single thread.
548    
549     It is recommended to keep the number of threads relatively low, as some
550     Linux kernel versions will scale negatively with the number of threads
551     (higher parallelity => MUCH higher latency). With current Linux 2.6
552     versions, 4-32 threads should be fine.
553    
554     Under most circumstances you don't need to call this function, as the
555     module selects a default that is suitable for low to moderate load.
556    
557 root 1.2 =item BDB::max_parallel $nthreads
558 root 1.1
559 root 1.18 Sets the maximum number of BDB threads to C<$nthreads>. If more than the
560 root 1.1 specified number of threads are currently running, this function kills
561     them. This function blocks until the limit is reached.
562    
563     While C<$nthreads> are zero, aio requests get queued but not executed
564     until the number of threads has been increased again.
565    
566     This module automatically runs C<max_parallel 0> at program end, to ensure
567     that all threads are killed and that there are no outstanding requests.
568    
569     Under normal circumstances you don't need to call this function.
570    
571 root 1.2 =item BDB::max_idle $nthreads
572 root 1.1
573     Limit the number of threads (default: 4) that are allowed to idle (i.e.,
574     threads that did not get a request to process within 10 seconds). That
575     means if a thread becomes idle while C<$nthreads> other threads are also
576     idle, it will free its resources and exit.
577    
578     This is useful when you allow a large number of threads (e.g. 100 or 1000)
579     to allow for extremely high load situations, but want to free resources
580     under normal circumstances (1000 threads can easily consume 30MB of RAM).
581    
582     The default is probably ok in most situations, especially if thread
583     creation is fast. If thread creation is very slow on your system you might
584     want to use larger values.
585    
586 root 1.2 =item $oldmaxreqs = BDB::max_outstanding $maxreqs
587 root 1.1
588     This is a very bad function to use in interactive programs because it
589     blocks, and a bad way to reduce concurrency because it is inexact: Better
590     use an C<aio_group> together with a feed callback.
591    
592     Sets the maximum number of outstanding requests to C<$nreqs>. If you
593     to queue up more than this number of requests, the next call to the
594     C<poll_cb> (and C<poll_some> and other functions calling C<poll_cb>)
595     function will block until the limit is no longer exceeded.
596    
597     The default value is very large, so there is no practical limit on the
598     number of outstanding requests.
599    
600     You can still queue as many requests as you want. Therefore,
601     C<max_oustsanding> is mainly useful in simple scripts (with low values) or
602     as a stop gap to shield against fatal memory overflow (with large values).
603    
604 root 1.3 =item BDB::set_sync_prepare $cb
605    
606     Sets a callback that is called whenever a request is created without an
607     explicit callback. It has to return two code references. The first is used
608     as the request callback, and the second is called to wait until the first
609     callback has been called. The default implementation works like this:
610    
611     sub {
612     my $status;
613     (
614     sub { $status = $! },
615     sub { BDB::poll while !defined $status; $! = $status },
616     )
617     }
618    
619     =back
620    
621 root 1.1 =head3 STATISTICAL INFORMATION
622    
623 root 1.3 =over 4
624    
625 root 1.2 =item BDB::nreqs
626 root 1.1
627     Returns the number of requests currently in the ready, execute or pending
628     states (i.e. for which their callback has not been invoked yet).
629    
630     Example: wait till there are no outstanding requests anymore:
631    
632 root 1.2 BDB::poll_wait, BDB::poll_cb
633     while BDB::nreqs;
634 root 1.1
635 root 1.2 =item BDB::nready
636 root 1.1
637     Returns the number of requests currently in the ready state (not yet
638     executed).
639    
640 root 1.2 =item BDB::npending
641 root 1.1
642     Returns the number of requests currently in the pending state (executed,
643     but not yet processed by poll_cb).
644    
645     =back
646    
647     =cut
648    
649 root 1.3 set_sync_prepare {
650     my $status;
651     (
652     sub {
653     $status = $!;
654     },
655     sub {
656     BDB::poll while !defined $status;
657     $! = $status;
658     },
659     )
660     };
661    
662 root 1.1 min_parallel 8;
663    
664     END { flush }
665    
666     1;
667    
668     =head2 FORK BEHAVIOUR
669    
670     This module should do "the right thing" when the process using it forks:
671    
672 root 1.18 Before the fork, BDB enters a quiescent state where no requests
673 root 1.1 can be added in other threads and no results will be processed. After
674     the fork the parent simply leaves the quiescent state and continues
675     request/result processing, while the child frees the request/result queue
676     (so that the requests started before the fork will only be handled in the
677     parent). Threads will be started on demand until the limit set in the
678     parent process has been reached again.
679    
680     In short: the parent will, after a short pause, continue as if fork had
681 root 1.18 not been called, while the child will act as if BDB has not been used
682 root 1.1 yet.
683    
684 root 1.26 Win32 note: there is no fork on win32, and perls emulation of it is too
685     broken to be supported, so do not use BDB in a windows pseudo-fork, better
686     yet, switch to a more capable platform.
687    
688 root 1.1 =head2 MEMORY USAGE
689    
690     Per-request usage:
691    
692     Each aio request uses - depending on your architecture - around 100-200
693     bytes of memory. In addition, stat requests need a stat buffer (possibly
694     a few hundred bytes), readdir requires a result buffer and so on. Perl
695     scalars and other data passed into aio requests will also be locked and
696     will consume memory till the request has entered the done state.
697    
698 root 1.13 This is not awfully much, so queuing lots of requests is not usually a
699 root 1.1 problem.
700    
701     Per-thread usage:
702    
703     In the execution phase, some aio requests require more memory for
704     temporary buffers, and each thread requires a stack and other data
705     structures (usually around 16k-128k, depending on the OS).
706    
707     =head1 KNOWN BUGS
708    
709 root 1.15 Known bugs will be fixed in the next release, except:
710    
711     If you use a transaction in any request, and the request returns
712     with an operating system error or DB_LOCK_NOTGRANTED, the internal
713     TXN_DEADLOCK flag will be set on the transaction. See C<db_txn_finish>,
714     above.
715 root 1.1
716     =head1 SEE ALSO
717    
718 root 1.18 L<Coro::BDB>, L<IO::AIO>.
719 root 1.1
720     =head1 AUTHOR
721    
722     Marc Lehmann <schmorp@schmorp.de>
723     http://home.schmorp.de/
724    
725     =cut
726