ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/BDB/BDB.pm
Revision: 1.34
Committed: Sun Mar 30 06:17:31 2008 UTC (16 years, 1 month ago) by root
Branch: MAIN
CVS Tags: rel-1_44
Changes since 1.33: +8 -2 lines
Log Message:
*** empty log message ***

File Contents

# User Rev Content
1 root 1.1 =head1 NAME
2    
3 root 1.2 BDB - Asynchronous Berkeley DB access
4 root 1.1
5     =head1 SYNOPSIS
6    
7 root 1.2 use BDB;
8 root 1.1
9 root 1.19 my $env = db_env_create;
10    
11     mkdir "bdtest", 0700;
12     db_env_open
13     $env,
14     "bdtest",
15     BDB::INIT_LOCK | BDB::INIT_LOG | BDB::INIT_MPOOL
16     | BDB::INIT_TXN | BDB::RECOVER | BDB::USE_ENVIRON | BDB::CREATE,
17     0600;
18    
19     $env->set_flags (BDB::AUTO_COMMIT | BDB::TXN_NOSYNC, 1);
20    
21     my $db = db_create $env;
22     db_open $db, undef, "table", undef, BDB::BTREE, BDB::AUTO_COMMIT | BDB::CREATE
23     | BDB::READ_UNCOMMITTED, 0600;
24     db_put $db, undef, "key", "data", 0, sub {
25     db_del $db, undef, "key";
26     };
27     db_sync $db;
28    
29 root 1.28 # when you also use Coro, management is easy:
30     use Coro::BDB;
31    
32 root 1.19 # automatic result processing with AnyEvent:
33     our $FH; open $FH, "<&=" . BDB::poll_fileno;
34     our $WATCHER = AnyEvent->io (fh => $FH, poll => 'r', cb => \&BDB::poll_cb);
35    
36     # automatic result processing with EV:
37     my $WATCHER = EV::io BDB::poll_fileno, EV::READ, \&BDB::poll_cb;
38    
39     # with Glib:
40     add_watch Glib::IO BDB::poll_fileno,
41     in => sub { BDB::poll_cb; 1 };
42    
43     # or simply flush manually
44     BDB::flush;
45    
46    
47 root 1.1 =head1 DESCRIPTION
48    
49 root 1.10 See the BerkeleyDB documentation (L<http://www.oracle.com/technology/documentation/berkeley-db/db/index.html>).
50 root 1.12 The BDB API is very similar to the C API (the translation has been very faithful).
51 root 1.10
52     See also the example sections in the document below and possibly the eg/
53     subdirectory of the BDB distribution. Last not least see the IO::AIO
54     documentation, as that module uses almost the same asynchronous request
55     model as this module.
56    
57     I know this is woefully inadequate documentation. Send a patch!
58 root 1.7
59 root 1.1
60     =head1 REQUEST ANATOMY AND LIFETIME
61    
62     Every request method creates a request. which is a C data structure not
63     directly visible to Perl.
64    
65     During their existance, bdb requests travel through the following states,
66     in order:
67    
68     =over 4
69    
70     =item ready
71    
72     Immediately after a request is created it is put into the ready state,
73     waiting for a thread to execute it.
74    
75     =item execute
76    
77     A thread has accepted the request for processing and is currently
78     executing it (e.g. blocking in read).
79    
80     =item pending
81    
82     The request has been executed and is waiting for result processing.
83    
84     While request submission and execution is fully asynchronous, result
85     processing is not and relies on the perl interpreter calling C<poll_cb>
86     (or another function with the same effect).
87    
88     =item result
89    
90     The request results are processed synchronously by C<poll_cb>.
91    
92     The C<poll_cb> function will process all outstanding aio requests by
93     calling their callbacks, freeing memory associated with them and managing
94     any groups they are contained in.
95    
96     =item done
97    
98     Request has reached the end of its lifetime and holds no resources anymore
99     (except possibly for the Perl object, but its connection to the actual
100     aio request is severed and calling its methods will either do nothing or
101     result in a runtime error).
102    
103     =back
104    
105     =cut
106    
107 root 1.2 package BDB;
108 root 1.1
109     no warnings;
110     use strict 'vars';
111    
112     use base 'Exporter';
113    
114     BEGIN {
115 root 1.32 our $VERSION = '1.44';
116 root 1.1
117 root 1.3 our @BDB_REQ = qw(
118 root 1.6 db_env_open db_env_close db_env_txn_checkpoint db_env_lock_detect
119     db_env_memp_sync db_env_memp_trickle
120 root 1.25 db_open db_close db_compact db_sync db_upgrade
121     db_put db_get db_pget db_del db_key_range
122 root 1.15 db_txn_commit db_txn_abort db_txn_finish
123 root 1.5 db_c_close db_c_count db_c_put db_c_get db_c_pget db_c_del
124 root 1.6 db_sequence_open db_sequence_close
125     db_sequence_get db_sequence_remove
126     );
127     our @EXPORT = (@BDB_REQ, qw(dbreq_pri dbreq_nice db_env_create db_create));
128     our @EXPORT_OK = qw(
129     poll_fileno poll_cb poll_wait flush
130     min_parallel max_parallel max_idle
131     nreqs nready npending nthreads
132     max_poll_time max_poll_reqs
133 root 1.3 );
134 root 1.1
135     require XSLoader;
136 root 1.2 XSLoader::load ("BDB", $VERSION);
137 root 1.1 }
138    
139 root 1.34 =head2 FILENAMES/DATABASE NAMES
140    
141     The BDB expects "binary" filenames (octet strings) for pathnames on POSIX
142     systems, and "unicode" filenames (strings with characters potentially
143     >255) on Win32 (expecting a Unicode win32 build - win32 is a total mess).
144    
145 root 1.10 =head2 BERKELEYDB FUNCTIONS
146    
147     All of these are functions. The create functions simply return a new
148 root 1.34 object and never block. All the remaining functions take an optional
149 root 1.24 callback as last argument. If it is missing, then the function will be
150     executed synchronously. In both cases, C<$!> will reflect the return value
151     of the function.
152 root 1.10
153     BDB functions that cannot block (mostly functions that manipulate
154     settings) are method calls on the relevant objects, so the rule of thumb
155 root 1.34 is: if it's a method, it's not blocking, if it's a function, it takes a
156 root 1.10 callback as last argument.
157    
158     In the following, C<$int> signifies an integer return value,
159     C<octetstring> is a "binary string" (i.e. a perl string with no character
160     indices >255), C<U32> is an unsigned 32 bit integer, C<int> is some
161     integer, C<NV> is a floating point value.
162    
163     The C<SV *> types are generic perl scalars (for input and output of data
164     values), and the C<SV *callback> is the optional callback function to call
165     when the request is completed.
166    
167 root 1.11 The various C<DB_ENV> etc. arguments are handles return by
168     C<db_env_create>, C<db_create>, C<txn_begin> and so on. If they have an
169     appended C<_ornull> this means they are optional and you can pass C<undef>
170     for them, resulting a NULL pointer on the C level.
171 root 1.10
172     =head3 BDB functions
173    
174     Functions in the BDB namespace, exported by default:
175    
176     $env = db_env_create (U32 env_flags = 0)
177 root 1.14 flags: RPCCLIENT
178 root 1.10
179     db_env_open (DB_ENV *env, octetstring db_home, U32 open_flags, int mode, SV *callback = &PL_sv_undef)
180 root 1.14 open_flags: INIT_CDB INIT_LOCK INIT_LOG INIT_MPOOL INIT_REP INIT_TXN RECOVER RECOVER_FATAL USE_ENVIRON USE_ENVIRON_ROOT CREATE LOCKDOWN PRIVATE REGISTER SYSTEM_MEM
181 root 1.10 db_env_close (DB_ENV *env, U32 flags = 0, SV *callback = &PL_sv_undef)
182     db_env_txn_checkpoint (DB_ENV *env, U32 kbyte = 0, U32 min = 0, U32 flags = 0, SV *callback = &PL_sv_undef)
183 root 1.14 flags: FORCE
184 root 1.10 db_env_lock_detect (DB_ENV *env, U32 flags = 0, U32 atype = DB_LOCK_DEFAULT, SV *dummy = 0, SV *callback = &PL_sv_undef)
185 root 1.14 atype: LOCK_DEFAULT LOCK_EXPIRE LOCK_MAXLOCKS LOCK_MAXWRITE LOCK_MINLOCKS LOCK_MINWRITE LOCK_OLDEST LOCK_RANDOM LOCK_YOUNGEST
186 root 1.10 db_env_memp_sync (DB_ENV *env, SV *dummy = 0, SV *callback = &PL_sv_undef)
187     db_env_memp_trickle (DB_ENV *env, int percent, SV *dummy = 0, SV *callback = &PL_sv_undef)
188    
189     $db = db_create (DB_ENV *env = 0, U32 flags = 0)
190 root 1.14 flags: XA_CREATE
191 root 1.10
192     db_open (DB *db, DB_TXN_ornull *txnid, octetstring file, octetstring database, int type, U32 flags, int mode, SV *callback = &PL_sv_undef)
193 root 1.14 flags: AUTO_COMMIT CREATE EXCL MULTIVERSION NOMMAP RDONLY READ_UNCOMMITTED THREAD TRUNCATE
194 root 1.10 db_close (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef)
195 root 1.14 flags: DB_NOSYNC
196 root 1.25 db_upgrade (DB *db, octetstring file, U32 flags = 0, SV *callback = &PL_sv_undef)
197 root 1.12 db_compact (DB *db, DB_TXN_ornull *txn = 0, SV *start = 0, SV *stop = 0, SV *unused1 = 0, U32 flags = DB_FREE_SPACE, SV *unused2 = 0, SV *callback = &PL_sv_undef)
198 root 1.14 flags: FREELIST_ONLY FREE_SPACE
199 root 1.10 db_sync (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef)
200     db_key_range (DB *db, DB_TXN_ornull *txn, SV *key, SV *key_range, U32 flags = 0, SV *callback = &PL_sv_undef)
201     db_put (DB *db, DB_TXN_ornull *txn, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
202 root 1.14 flags: APPEND NODUPDATA NOOVERWRITE
203 root 1.10 db_get (DB *db, DB_TXN_ornull *txn, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
204 root 1.14 flags: CONSUME CONSUME_WAIT GET_BOTH SET_RECNO MULTIPLE READ_COMMITTED READ_UNCOMMITTED RMW
205 root 1.10 db_pget (DB *db, DB_TXN_ornull *txn, SV *key, SV *pkey, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
206 root 1.14 flags: CONSUME CONSUME_WAIT GET_BOTH SET_RECNO MULTIPLE READ_COMMITTED READ_UNCOMMITTED RMW
207 root 1.10 db_del (DB *db, DB_TXN_ornull *txn, SV *key, U32 flags = 0, SV *callback = &PL_sv_undef)
208     db_txn_commit (DB_TXN *txn, U32 flags = 0, SV *callback = &PL_sv_undef)
209 root 1.14 flags: TXN_NOSYNC TXN_SYNC
210 root 1.10 db_txn_abort (DB_TXN *txn, SV *callback = &PL_sv_undef)
211 root 1.14
212 root 1.10 db_c_close (DBC *dbc, SV *callback = &PL_sv_undef)
213     db_c_count (DBC *dbc, SV *count, U32 flags = 0, SV *callback = &PL_sv_undef)
214     db_c_put (DBC *dbc, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
215 root 1.14 flags: AFTER BEFORE CURRENT KEYFIRST KEYLAST NODUPDATA
216 root 1.10 db_c_get (DBC *dbc, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
217 root 1.14 flags: CURRENT FIRST GET_BOTH GET_BOTH_RANGE GET_RECNO JOIN_ITEM LAST NEXT NEXT_DUP NEXT_NODUP PREV PREV_DUP PREV_NODUP SET SET_RANGE SET_RECNO READ_UNCOMMITTED MULTIPLE MULTIPLE_KEY RMW
218 root 1.10 db_c_pget (DBC *dbc, SV *key, SV *pkey, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
219     db_c_del (DBC *dbc, U32 flags = 0, SV *callback = &PL_sv_undef)
220    
221     db_sequence_open (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, SV *key, U32 flags = 0, SV *callback = &PL_sv_undef)
222 root 1.14 flags: CREATE EXCL
223 root 1.10 db_sequence_close (DB_SEQUENCE *seq, U32 flags = 0, SV *callback = &PL_sv_undef)
224     db_sequence_get (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, int delta, SV *seq_value, U32 flags = DB_TXN_NOSYNC, SV *callback = &PL_sv_undef)
225 root 1.14 flags: TXN_NOSYNC
226 root 1.10 db_sequence_remove (DB_SEQUENCE *seq, DB_TXN_ornull *txnid = 0, U32 flags = 0, SV *callback = &PL_sv_undef)
227 root 1.14 flags: TXN_NOSYNC
228 root 1.10
229 root 1.15 =head4 db_txn_finish (DB_TXN *txn, U32 flags = 0, SV *callback = &PL_sv_undef)
230    
231 root 1.16 This is not actually a Berkeley DB function but a BDB module
232     extension. The background for this exytension is: It is very annoying to
233     have to check every single BDB function for error returns and provide a
234     codepath out of your transaction. While the BDB module still makes this
235     possible, it contains the following extensions:
236 root 1.15
237     When a transaction-protected function returns any operating system
238     error (errno > 0), BDB will set the C<TXN_DEADLOCK> flag on the
239 root 1.16 transaction. This flag is also set by Berkeley DB functions themselves
240 root 1.15 when an operation fails with LOCK_DEADLOCK, and it causes all further
241     operations on that transaction (including C<db_txn_commit>) to fail.
242    
243     The C<db_txn_finish> request will look at this flag, and, if it is set,
244     will automatically call C<db_txn_abort> (setting errno to C<LOCK_DEADLOCK>
245 root 1.16 if it isn't set to something else yet). If it isn't set, it will call
246     C<db_txn_commit> and return the error normally.
247 root 1.15
248     How to use this? Easy: just write your transaction normally:
249    
250     my $txn = $db_env->txn_begin;
251     db_get $db, $txn, "key", my $data;
252     db_put $db, $txn, "key", $data + 1 unless $! == BDB::NOTFOUND;
253     db_txn_finish $txn;
254     die "transaction failed" if $!;
255    
256     That is, handle only the expected errors. If something unexpected happens
257     (EIO, LOCK_NOTGRANTED or a deadlock in either db_get or db_put), then the remaining
258     requests (db_put in this case) will simply be skipped (they will fail with
259     LOCK_DEADLOCK) and the transaction will be aborted.
260    
261 root 1.16 You can use the C<< $txn->failed >> method to check wether a transaction
262 root 1.15 has failed in this way and abort further processing (excluding
263     C<db_txn_finish>).
264    
265 root 1.10 =head3 DB_ENV/database environment methods
266    
267     Methods available on DB_ENV/$env handles:
268    
269     DESTROY (DB_ENV_ornull *env)
270     CODE:
271     if (env)
272     env->close (env, 0);
273    
274     $int = $env->set_data_dir (const char *dir)
275     $int = $env->set_tmp_dir (const char *dir)
276     $int = $env->set_lg_dir (const char *dir)
277     $int = $env->set_shm_key (long shm_key)
278     $int = $env->set_cachesize (U32 gbytes, U32 bytes, int ncache = 0)
279     $int = $env->set_flags (U32 flags, int onoff)
280     $env->set_errfile (FILE *errfile = 0)
281     $env->set_msgfile (FILE *msgfile = 0)
282     $int = $env->set_verbose (U32 which, int onoff = 1)
283     $int = $env->set_encrypt (const char *password, U32 flags = 0)
284 root 1.15 $int = $env->set_timeout (NV timeout_seconds, U32 flags = SET_TXN_TIMEOUT)
285 root 1.10 $int = $env->set_mp_max_openfd (int maxopenfd);
286     $int = $env->set_mp_max_write (int maxwrite, int maxwrite_sleep);
287     $int = $env->set_mp_mmapsize (int mmapsize_mb)
288     $int = $env->set_lk_detect (U32 detect = DB_LOCK_DEFAULT)
289     $int = $env->set_lk_max_lockers (U32 max)
290     $int = $env->set_lk_max_locks (U32 max)
291     $int = $env->set_lk_max_objects (U32 max)
292     $int = $env->set_lg_bsize (U32 max)
293     $int = $env->set_lg_max (U32 max)
294 root 1.20 $int = $env->mutex_set_increment (U32 increment)
295     $int = $env->mutex_set_tas_spins (U32 tas_spins)
296     $int = $env->mutex_set_max (U32 max)
297     $int = $env->mutex_set_align (U32 align)
298 root 1.10
299     $txn = $env->txn_begin (DB_TXN_ornull *parent = 0, U32 flags = 0)
300 root 1.14 flags: READ_COMMITTED READ_UNCOMMITTED TXN_NOSYNC TXN_NOWAIT TXN_SNAPSHOT TXN_SYNC TXN_WAIT TXN_WRITE_NOSYNC
301 root 1.10
302 root 1.12 =head4 Example:
303 root 1.10
304     use AnyEvent;
305     use BDB;
306    
307     our $FH; open $FH, "<&=" . BDB::poll_fileno;
308     our $WATCHER = AnyEvent->io (fh => $FH, poll => 'r', cb => \&BDB::poll_cb);
309    
310     BDB::min_parallel 8;
311    
312     my $env = db_env_create;
313    
314     mkdir "bdtest", 0700;
315     db_env_open
316     $env,
317     "bdtest",
318     BDB::INIT_LOCK | BDB::INIT_LOG | BDB::INIT_MPOOL | BDB::INIT_TXN | BDB::RECOVER | BDB::USE_ENVIRON | BDB::CREATE,
319     0600;
320    
321     $env->set_flags (BDB::AUTO_COMMIT | BDB::TXN_NOSYNC, 1);
322    
323    
324     =head3 DB/database methods
325    
326     Methods available on DB/$db handles:
327    
328     DESTROY (DB_ornull *db)
329     CODE:
330     if (db)
331     {
332     SV *env = (SV *)db->app_private;
333     db->close (db, 0);
334     SvREFCNT_dec (env);
335     }
336    
337     $int = $db->set_cachesize (U32 gbytes, U32 bytes, int ncache = 0)
338     $int = $db->set_flags (U32 flags)
339 root 1.14 flags: CHKSUM ENCRYPT TXN_NOT_DURABLE
340     Btree: DUP DUPSORT RECNUM REVSPLITOFF
341     Hash: DUP DUPSORT
342     Queue: INORDER
343     Recno: RENUMBER SNAPSHOT
344    
345 root 1.10 $int = $db->set_encrypt (const char *password, U32 flags)
346     $int = $db->set_lorder (int lorder)
347     $int = $db->set_bt_minkey (U32 minkey)
348     $int = $db->set_re_delim (int delim)
349     $int = $db->set_re_pad (int re_pad)
350     $int = $db->set_re_source (char *source)
351     $int = $db->set_re_len (U32 re_len)
352     $int = $db->set_h_ffactor (U32 h_ffactor)
353     $int = $db->set_h_nelem (U32 h_nelem)
354     $int = $db->set_q_extentsize (U32 extentsize)
355    
356     $dbc = $db->cursor (DB_TXN_ornull *txn = 0, U32 flags = 0)
357 root 1.14 flags: READ_COMMITTED READ_UNCOMMITTED WRITECURSOR TXN_SNAPSHOT
358 root 1.10 $seq = $db->sequence (U32 flags = 0)
359    
360 root 1.12 =head4 Example:
361 root 1.10
362     my $db = db_create $env;
363     db_open $db, undef, "table", undef, BDB::BTREE, BDB::AUTO_COMMIT | BDB::CREATE | BDB::READ_UNCOMMITTED, 0600;
364    
365     for (1..1000) {
366     db_put $db, undef, "key $_", "data $_";
367    
368     db_key_range $db, undef, "key $_", my $keyrange;
369     my ($lt, $eq, $gt) = @$keyrange;
370     }
371    
372     db_del $db, undef, "key $_" for 1..1000;
373    
374     db_sync $db;
375    
376    
377     =head3 DB_TXN/transaction methods
378    
379     Methods available on DB_TXN/$txn handles:
380    
381     DESTROY (DB_TXN_ornull *txn)
382     CODE:
383     if (txn)
384     txn->abort (txn);
385    
386 root 1.15 $int = $txn->set_timeout (NV timeout_seconds, U32 flags = SET_TXN_TIMEOUT)
387 root 1.14 flags: SET_LOCK_TIMEOUT SET_TXN_TIMEOUT
388 root 1.10
389 root 1.15 $bool = $txn->failed
390     # see db_txn_finish documentation, above
391    
392 root 1.10
393     =head3 DBC/cursor methods
394    
395     Methods available on DBC/$dbc handles:
396    
397     DESTROY (DBC_ornull *dbc)
398     CODE:
399     if (dbc)
400     dbc->c_close (dbc);
401    
402 root 1.28 $int = $cursor->set_priority ($priority = PRIORITY_*)
403    
404 root 1.12 =head4 Example:
405 root 1.10
406     my $c = $db->cursor;
407    
408     for (;;) {
409     db_c_get $c, my $key, my $data, BDB::NEXT;
410     warn "<$!,$key,$data>";
411     last if $!;
412     }
413    
414     db_c_close $c;
415    
416 root 1.12
417 root 1.10 =head3 DB_SEQUENCE/sequence methods
418    
419     Methods available on DB_SEQUENCE/$seq handles:
420    
421     DESTROY (DB_SEQUENCE_ornull *seq)
422     CODE:
423     if (seq)
424     seq->close (seq, 0);
425    
426     $int = $seq->initial_value (db_seq_t value)
427     $int = $seq->set_cachesize (U32 size)
428     $int = $seq->set_flags (U32 flags)
429 root 1.14 flags: SEQ_DEC SEQ_INC SEQ_WRAP
430 root 1.10 $int = $seq->set_range (db_seq_t min, db_seq_t max)
431    
432 root 1.12 =head4 Example:
433 root 1.10
434     my $seq = $db->sequence;
435    
436     db_sequence_open $seq, undef, "seq", BDB::CREATE;
437     db_sequence_get $seq, undef, 1, my $value;
438    
439    
440 root 1.1 =head2 SUPPORT FUNCTIONS
441    
442     =head3 EVENT PROCESSING AND EVENT LOOP INTEGRATION
443    
444     =over 4
445    
446 root 1.23 =item $msg = BDB::strerror [$errno]
447    
448     Returns the string corresponding to the given errno value. If no argument
449     is given, use C<$!>.
450    
451 root 1.33 Note that the BDB module also patches the C<$!> variable directly, so you
452     should be able to get a bdb error string by simply stringifying C<$!>.
453    
454 root 1.2 =item $fileno = BDB::poll_fileno
455 root 1.1
456     Return the I<request result pipe file descriptor>. This filehandle must be
457     polled for reading by some mechanism outside this module (e.g. Event or
458     select, see below or the SYNOPSIS). If the pipe becomes readable you have
459     to call C<poll_cb> to check the results.
460    
461     See C<poll_cb> for an example.
462    
463 root 1.2 =item BDB::poll_cb
464 root 1.1
465     Process some outstanding events on the result pipe. You have to call this
466     regularly. Returns the number of events processed. Returns immediately
467     when no events are outstanding. The amount of events processed depends on
468 root 1.2 the settings of C<BDB::max_poll_req> and C<BDB::max_poll_time>.
469 root 1.1
470     If not all requests were processed for whatever reason, the filehandle
471     will still be ready when C<poll_cb> returns.
472    
473     Example: Install an Event watcher that automatically calls
474 root 1.2 BDB::poll_cb with high priority:
475 root 1.1
476 root 1.2 Event->io (fd => BDB::poll_fileno,
477 root 1.1 poll => 'r', async => 1,
478 root 1.2 cb => \&BDB::poll_cb);
479 root 1.1
480 root 1.2 =item BDB::max_poll_reqs $nreqs
481 root 1.1
482 root 1.2 =item BDB::max_poll_time $seconds
483 root 1.1
484     These set the maximum number of requests (default C<0>, meaning infinity)
485 root 1.2 that are being processed by C<BDB::poll_cb> in one call, respectively
486 root 1.1 the maximum amount of time (default C<0>, meaning infinity) spent in
487 root 1.2 C<BDB::poll_cb> to process requests (more correctly the mininum amount
488 root 1.1 of time C<poll_cb> is allowed to use).
489    
490     Setting C<max_poll_time> to a non-zero value creates an overhead of one
491     syscall per request processed, which is not normally a problem unless your
492     callbacks are really really fast or your OS is really really slow (I am
493     not mentioning Solaris here). Using C<max_poll_reqs> incurs no overhead.
494    
495     Setting these is useful if you want to ensure some level of
496     interactiveness when perl is not fast enough to process all requests in
497     time.
498    
499     For interactive programs, values such as C<0.01> to C<0.1> should be fine.
500    
501 root 1.18 Example: Install an EV watcher that automatically calls
502 root 1.2 BDB::poll_cb with low priority, to ensure that other parts of the
503 root 1.18 program get the CPU sometimes even under high load.
504 root 1.1
505     # try not to spend much more than 0.1s in poll_cb
506 root 1.2 BDB::max_poll_time 0.1;
507 root 1.1
508 root 1.18 my $bdb_poll = EV::io BDB::poll_fileno, EV::READ, \&BDB::poll_cb);
509 root 1.1
510 root 1.2 =item BDB::poll_wait
511 root 1.1
512     If there are any outstanding requests and none of them in the result
513     phase, wait till the result filehandle becomes ready for reading (simply
514     does a C<select> on the filehandle. This is useful if you want to
515     synchronously wait for some requests to finish).
516    
517     See C<nreqs> for an example.
518    
519 root 1.2 =item BDB::poll
520 root 1.1
521     Waits until some requests have been handled.
522    
523     Returns the number of requests processed, but is otherwise strictly
524     equivalent to:
525    
526 root 1.2 BDB::poll_wait, BDB::poll_cb
527 root 1.1
528 root 1.2 =item BDB::flush
529 root 1.1
530 root 1.18 Wait till all outstanding BDB requests have been handled.
531 root 1.1
532     Strictly equivalent to:
533    
534 root 1.2 BDB::poll_wait, BDB::poll_cb
535     while BDB::nreqs;
536 root 1.1
537 root 1.8 =back
538    
539 root 1.1 =head3 CONTROLLING THE NUMBER OF THREADS
540    
541 root 1.8 =over 4
542    
543 root 1.2 =item BDB::min_parallel $nthreads
544 root 1.1
545 root 1.18 Set the minimum number of BDB threads to C<$nthreads>. The current
546 root 1.1 default is C<8>, which means eight asynchronous operations can execute
547     concurrently at any one time (the number of outstanding requests,
548     however, is unlimited).
549    
550 root 1.18 BDB starts threads only on demand, when an BDB request is queued and
551 root 1.1 no free thread exists. Please note that queueing up a hundred requests can
552     create demand for a hundred threads, even if it turns out that everything
553     is in the cache and could have been processed faster by a single thread.
554    
555     It is recommended to keep the number of threads relatively low, as some
556     Linux kernel versions will scale negatively with the number of threads
557     (higher parallelity => MUCH higher latency). With current Linux 2.6
558     versions, 4-32 threads should be fine.
559    
560     Under most circumstances you don't need to call this function, as the
561     module selects a default that is suitable for low to moderate load.
562    
563 root 1.2 =item BDB::max_parallel $nthreads
564 root 1.1
565 root 1.18 Sets the maximum number of BDB threads to C<$nthreads>. If more than the
566 root 1.1 specified number of threads are currently running, this function kills
567     them. This function blocks until the limit is reached.
568    
569     While C<$nthreads> are zero, aio requests get queued but not executed
570     until the number of threads has been increased again.
571    
572     This module automatically runs C<max_parallel 0> at program end, to ensure
573     that all threads are killed and that there are no outstanding requests.
574    
575     Under normal circumstances you don't need to call this function.
576    
577 root 1.2 =item BDB::max_idle $nthreads
578 root 1.1
579     Limit the number of threads (default: 4) that are allowed to idle (i.e.,
580     threads that did not get a request to process within 10 seconds). That
581     means if a thread becomes idle while C<$nthreads> other threads are also
582     idle, it will free its resources and exit.
583    
584     This is useful when you allow a large number of threads (e.g. 100 or 1000)
585     to allow for extremely high load situations, but want to free resources
586     under normal circumstances (1000 threads can easily consume 30MB of RAM).
587    
588     The default is probably ok in most situations, especially if thread
589     creation is fast. If thread creation is very slow on your system you might
590     want to use larger values.
591    
592 root 1.2 =item $oldmaxreqs = BDB::max_outstanding $maxreqs
593 root 1.1
594     This is a very bad function to use in interactive programs because it
595     blocks, and a bad way to reduce concurrency because it is inexact: Better
596     use an C<aio_group> together with a feed callback.
597    
598     Sets the maximum number of outstanding requests to C<$nreqs>. If you
599     to queue up more than this number of requests, the next call to the
600     C<poll_cb> (and C<poll_some> and other functions calling C<poll_cb>)
601     function will block until the limit is no longer exceeded.
602    
603     The default value is very large, so there is no practical limit on the
604     number of outstanding requests.
605    
606     You can still queue as many requests as you want. Therefore,
607     C<max_oustsanding> is mainly useful in simple scripts (with low values) or
608     as a stop gap to shield against fatal memory overflow (with large values).
609    
610 root 1.3 =item BDB::set_sync_prepare $cb
611    
612     Sets a callback that is called whenever a request is created without an
613     explicit callback. It has to return two code references. The first is used
614     as the request callback, and the second is called to wait until the first
615     callback has been called. The default implementation works like this:
616    
617     sub {
618     my $status;
619     (
620     sub { $status = $! },
621     sub { BDB::poll while !defined $status; $! = $status },
622     )
623     }
624    
625     =back
626    
627 root 1.1 =head3 STATISTICAL INFORMATION
628    
629 root 1.3 =over 4
630    
631 root 1.2 =item BDB::nreqs
632 root 1.1
633     Returns the number of requests currently in the ready, execute or pending
634     states (i.e. for which their callback has not been invoked yet).
635    
636     Example: wait till there are no outstanding requests anymore:
637    
638 root 1.2 BDB::poll_wait, BDB::poll_cb
639     while BDB::nreqs;
640 root 1.1
641 root 1.2 =item BDB::nready
642 root 1.1
643     Returns the number of requests currently in the ready state (not yet
644     executed).
645    
646 root 1.2 =item BDB::npending
647 root 1.1
648     Returns the number of requests currently in the pending state (executed,
649     but not yet processed by poll_cb).
650    
651     =back
652    
653     =cut
654    
655 root 1.3 set_sync_prepare {
656     my $status;
657     (
658     sub {
659     $status = $!;
660     },
661     sub {
662     BDB::poll while !defined $status;
663     $! = $status;
664     },
665     )
666     };
667    
668 root 1.1 min_parallel 8;
669    
670     END { flush }
671    
672     1;
673    
674     =head2 FORK BEHAVIOUR
675    
676     This module should do "the right thing" when the process using it forks:
677    
678 root 1.18 Before the fork, BDB enters a quiescent state where no requests
679 root 1.1 can be added in other threads and no results will be processed. After
680     the fork the parent simply leaves the quiescent state and continues
681     request/result processing, while the child frees the request/result queue
682     (so that the requests started before the fork will only be handled in the
683     parent). Threads will be started on demand until the limit set in the
684     parent process has been reached again.
685    
686     In short: the parent will, after a short pause, continue as if fork had
687 root 1.18 not been called, while the child will act as if BDB has not been used
688 root 1.1 yet.
689    
690 root 1.26 Win32 note: there is no fork on win32, and perls emulation of it is too
691     broken to be supported, so do not use BDB in a windows pseudo-fork, better
692     yet, switch to a more capable platform.
693    
694 root 1.1 =head2 MEMORY USAGE
695    
696     Per-request usage:
697    
698     Each aio request uses - depending on your architecture - around 100-200
699     bytes of memory. In addition, stat requests need a stat buffer (possibly
700     a few hundred bytes), readdir requires a result buffer and so on. Perl
701     scalars and other data passed into aio requests will also be locked and
702     will consume memory till the request has entered the done state.
703    
704 root 1.13 This is not awfully much, so queuing lots of requests is not usually a
705 root 1.1 problem.
706    
707     Per-thread usage:
708    
709     In the execution phase, some aio requests require more memory for
710     temporary buffers, and each thread requires a stack and other data
711     structures (usually around 16k-128k, depending on the OS).
712    
713     =head1 KNOWN BUGS
714    
715 root 1.15 Known bugs will be fixed in the next release, except:
716    
717     If you use a transaction in any request, and the request returns
718     with an operating system error or DB_LOCK_NOTGRANTED, the internal
719     TXN_DEADLOCK flag will be set on the transaction. See C<db_txn_finish>,
720     above.
721 root 1.1
722     =head1 SEE ALSO
723    
724 root 1.18 L<Coro::BDB>, L<IO::AIO>.
725 root 1.1
726     =head1 AUTHOR
727    
728     Marc Lehmann <schmorp@schmorp.de>
729     http://home.schmorp.de/
730    
731     =cut
732