ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/BDB/BDB.pm
Revision: 1.40
Committed: Tue Jul 8 08:35:12 2008 UTC (15 years, 10 months ago) by root
Branch: MAIN
CVS Tags: rel-1_6
Changes since 1.39: +7 -0 lines
Log Message:
1.6

File Contents

# User Rev Content
1 root 1.1 =head1 NAME
2    
3 root 1.2 BDB - Asynchronous Berkeley DB access
4 root 1.1
5     =head1 SYNOPSIS
6    
7 root 1.2 use BDB;
8 root 1.1
9 root 1.19 my $env = db_env_create;
10    
11     mkdir "bdtest", 0700;
12     db_env_open
13     $env,
14     "bdtest",
15     BDB::INIT_LOCK | BDB::INIT_LOG | BDB::INIT_MPOOL
16     | BDB::INIT_TXN | BDB::RECOVER | BDB::USE_ENVIRON | BDB::CREATE,
17     0600;
18    
19     $env->set_flags (BDB::AUTO_COMMIT | BDB::TXN_NOSYNC, 1);
20    
21     my $db = db_create $env;
22     db_open $db, undef, "table", undef, BDB::BTREE, BDB::AUTO_COMMIT | BDB::CREATE
23     | BDB::READ_UNCOMMITTED, 0600;
24     db_put $db, undef, "key", "data", 0, sub {
25     db_del $db, undef, "key";
26     };
27     db_sync $db;
28    
29 root 1.28 # when you also use Coro, management is easy:
30     use Coro::BDB;
31    
32 root 1.36 # automatic event loop intergration with AnyEvent:
33     use AnyEvent::BDB;
34 root 1.19
35     # automatic result processing with EV:
36     my $WATCHER = EV::io BDB::poll_fileno, EV::READ, \&BDB::poll_cb;
37    
38     # with Glib:
39     add_watch Glib::IO BDB::poll_fileno,
40     in => sub { BDB::poll_cb; 1 };
41    
42     # or simply flush manually
43     BDB::flush;
44    
45    
46 root 1.1 =head1 DESCRIPTION
47    
48 root 1.10 See the BerkeleyDB documentation (L<http://www.oracle.com/technology/documentation/berkeley-db/db/index.html>).
49 root 1.12 The BDB API is very similar to the C API (the translation has been very faithful).
50 root 1.10
51     See also the example sections in the document below and possibly the eg/
52     subdirectory of the BDB distribution. Last not least see the IO::AIO
53     documentation, as that module uses almost the same asynchronous request
54     model as this module.
55    
56     I know this is woefully inadequate documentation. Send a patch!
57 root 1.7
58 root 1.1
59     =head1 REQUEST ANATOMY AND LIFETIME
60    
61     Every request method creates a request. which is a C data structure not
62     directly visible to Perl.
63    
64     During their existance, bdb requests travel through the following states,
65     in order:
66    
67     =over 4
68    
69     =item ready
70    
71     Immediately after a request is created it is put into the ready state,
72     waiting for a thread to execute it.
73    
74     =item execute
75    
76     A thread has accepted the request for processing and is currently
77     executing it (e.g. blocking in read).
78    
79     =item pending
80    
81     The request has been executed and is waiting for result processing.
82    
83     While request submission and execution is fully asynchronous, result
84     processing is not and relies on the perl interpreter calling C<poll_cb>
85     (or another function with the same effect).
86    
87     =item result
88    
89     The request results are processed synchronously by C<poll_cb>.
90    
91     The C<poll_cb> function will process all outstanding aio requests by
92     calling their callbacks, freeing memory associated with them and managing
93     any groups they are contained in.
94    
95     =item done
96    
97     Request has reached the end of its lifetime and holds no resources anymore
98     (except possibly for the Perl object, but its connection to the actual
99     aio request is severed and calling its methods will either do nothing or
100     result in a runtime error).
101    
102     =back
103    
104     =cut
105    
106 root 1.2 package BDB;
107 root 1.1
108     no warnings;
109     use strict 'vars';
110    
111     use base 'Exporter';
112    
113     BEGIN {
114 root 1.39 our $VERSION = '1.6';
115 root 1.1
116 root 1.3 our @BDB_REQ = qw(
117 root 1.6 db_env_open db_env_close db_env_txn_checkpoint db_env_lock_detect
118 root 1.38 db_env_memp_sync db_env_memp_trickle db_env_dbrename db_env_dbremove
119 root 1.25 db_open db_close db_compact db_sync db_upgrade
120     db_put db_get db_pget db_del db_key_range
121 root 1.15 db_txn_commit db_txn_abort db_txn_finish
122 root 1.5 db_c_close db_c_count db_c_put db_c_get db_c_pget db_c_del
123 root 1.6 db_sequence_open db_sequence_close
124     db_sequence_get db_sequence_remove
125     );
126     our @EXPORT = (@BDB_REQ, qw(dbreq_pri dbreq_nice db_env_create db_create));
127     our @EXPORT_OK = qw(
128     poll_fileno poll_cb poll_wait flush
129     min_parallel max_parallel max_idle
130     nreqs nready npending nthreads
131     max_poll_time max_poll_reqs
132 root 1.3 );
133 root 1.1
134     require XSLoader;
135 root 1.2 XSLoader::load ("BDB", $VERSION);
136 root 1.1 }
137    
138 root 1.35 =head2 WIN32 FILENAMES/DATABASE NAME MESS
139 root 1.34
140 root 1.35 Perl on Win32 supports only ASCII filenames (the reason is that it abuses
141     an internal flag to store wether a filename is Unicode or ANSI, but that
142     flag is used for somethign else in the perl core, so there is no way to
143     detect wether a filename is ANSI or Unicode-encoded). The BDB module
144     tries to work around this issue by assuming that the filename is an ANSI
145     filename and BDB was built for unicode support.
146 root 1.34
147 root 1.10 =head2 BERKELEYDB FUNCTIONS
148    
149     All of these are functions. The create functions simply return a new
150 root 1.34 object and never block. All the remaining functions take an optional
151 root 1.24 callback as last argument. If it is missing, then the function will be
152     executed synchronously. In both cases, C<$!> will reflect the return value
153     of the function.
154 root 1.10
155     BDB functions that cannot block (mostly functions that manipulate
156     settings) are method calls on the relevant objects, so the rule of thumb
157 root 1.34 is: if it's a method, it's not blocking, if it's a function, it takes a
158 root 1.10 callback as last argument.
159    
160     In the following, C<$int> signifies an integer return value,
161 root 1.38 C<bdb_filename> is a "filename" (octets on unix, madness on windows),
162     C<U32> is an unsigned 32 bit integer, C<int> is some integer, C<NV> is a
163     floating point value.
164 root 1.10
165     The C<SV *> types are generic perl scalars (for input and output of data
166     values), and the C<SV *callback> is the optional callback function to call
167     when the request is completed.
168    
169 root 1.11 The various C<DB_ENV> etc. arguments are handles return by
170     C<db_env_create>, C<db_create>, C<txn_begin> and so on. If they have an
171     appended C<_ornull> this means they are optional and you can pass C<undef>
172     for them, resulting a NULL pointer on the C level.
173 root 1.10
174     =head3 BDB functions
175    
176     Functions in the BDB namespace, exported by default:
177    
178     $env = db_env_create (U32 env_flags = 0)
179 root 1.14 flags: RPCCLIENT
180 root 1.10
181 root 1.38 db_env_open (DB_ENV *env, bdb_filename db_home, U32 open_flags, int mode, SV *callback = &PL_sv_undef)
182 root 1.14 open_flags: INIT_CDB INIT_LOCK INIT_LOG INIT_MPOOL INIT_REP INIT_TXN RECOVER RECOVER_FATAL USE_ENVIRON USE_ENVIRON_ROOT CREATE LOCKDOWN PRIVATE REGISTER SYSTEM_MEM
183 root 1.10 db_env_close (DB_ENV *env, U32 flags = 0, SV *callback = &PL_sv_undef)
184     db_env_txn_checkpoint (DB_ENV *env, U32 kbyte = 0, U32 min = 0, U32 flags = 0, SV *callback = &PL_sv_undef)
185 root 1.14 flags: FORCE
186 root 1.10 db_env_lock_detect (DB_ENV *env, U32 flags = 0, U32 atype = DB_LOCK_DEFAULT, SV *dummy = 0, SV *callback = &PL_sv_undef)
187 root 1.14 atype: LOCK_DEFAULT LOCK_EXPIRE LOCK_MAXLOCKS LOCK_MAXWRITE LOCK_MINLOCKS LOCK_MINWRITE LOCK_OLDEST LOCK_RANDOM LOCK_YOUNGEST
188 root 1.10 db_env_memp_sync (DB_ENV *env, SV *dummy = 0, SV *callback = &PL_sv_undef)
189     db_env_memp_trickle (DB_ENV *env, int percent, SV *dummy = 0, SV *callback = &PL_sv_undef)
190 root 1.38 db_env_dbremove (DB_ENV *env, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, U32 flags = 0, SV *callback = &PL_sv_undef)
191     db_env_dbrename (DB_ENV *env, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, bdb_filename newname, U32 flags = 0, SV *callback = &PL_sv_undef)
192 root 1.10
193     $db = db_create (DB_ENV *env = 0, U32 flags = 0)
194 root 1.14 flags: XA_CREATE
195 root 1.10
196 root 1.38 db_open (DB *db, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, int type, U32 flags, int mode, SV *callback = &PL_sv_undef)
197 root 1.14 flags: AUTO_COMMIT CREATE EXCL MULTIVERSION NOMMAP RDONLY READ_UNCOMMITTED THREAD TRUNCATE
198 root 1.10 db_close (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef)
199 root 1.14 flags: DB_NOSYNC
200 root 1.38 db_upgrade (DB *db, bdb_filename file, U32 flags = 0, SV *callback = &PL_sv_undef)
201 root 1.12 db_compact (DB *db, DB_TXN_ornull *txn = 0, SV *start = 0, SV *stop = 0, SV *unused1 = 0, U32 flags = DB_FREE_SPACE, SV *unused2 = 0, SV *callback = &PL_sv_undef)
202 root 1.14 flags: FREELIST_ONLY FREE_SPACE
203 root 1.10 db_sync (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef)
204     db_key_range (DB *db, DB_TXN_ornull *txn, SV *key, SV *key_range, U32 flags = 0, SV *callback = &PL_sv_undef)
205     db_put (DB *db, DB_TXN_ornull *txn, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
206 root 1.14 flags: APPEND NODUPDATA NOOVERWRITE
207 root 1.10 db_get (DB *db, DB_TXN_ornull *txn, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
208 root 1.14 flags: CONSUME CONSUME_WAIT GET_BOTH SET_RECNO MULTIPLE READ_COMMITTED READ_UNCOMMITTED RMW
209 root 1.10 db_pget (DB *db, DB_TXN_ornull *txn, SV *key, SV *pkey, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
210 root 1.14 flags: CONSUME CONSUME_WAIT GET_BOTH SET_RECNO MULTIPLE READ_COMMITTED READ_UNCOMMITTED RMW
211 root 1.10 db_del (DB *db, DB_TXN_ornull *txn, SV *key, U32 flags = 0, SV *callback = &PL_sv_undef)
212     db_txn_commit (DB_TXN *txn, U32 flags = 0, SV *callback = &PL_sv_undef)
213 root 1.14 flags: TXN_NOSYNC TXN_SYNC
214 root 1.10 db_txn_abort (DB_TXN *txn, SV *callback = &PL_sv_undef)
215 root 1.14
216 root 1.10 db_c_close (DBC *dbc, SV *callback = &PL_sv_undef)
217     db_c_count (DBC *dbc, SV *count, U32 flags = 0, SV *callback = &PL_sv_undef)
218     db_c_put (DBC *dbc, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
219 root 1.14 flags: AFTER BEFORE CURRENT KEYFIRST KEYLAST NODUPDATA
220 root 1.10 db_c_get (DBC *dbc, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
221 root 1.14 flags: CURRENT FIRST GET_BOTH GET_BOTH_RANGE GET_RECNO JOIN_ITEM LAST NEXT NEXT_DUP NEXT_NODUP PREV PREV_DUP PREV_NODUP SET SET_RANGE SET_RECNO READ_UNCOMMITTED MULTIPLE MULTIPLE_KEY RMW
222 root 1.10 db_c_pget (DBC *dbc, SV *key, SV *pkey, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
223     db_c_del (DBC *dbc, U32 flags = 0, SV *callback = &PL_sv_undef)
224    
225     db_sequence_open (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, SV *key, U32 flags = 0, SV *callback = &PL_sv_undef)
226 root 1.14 flags: CREATE EXCL
227 root 1.10 db_sequence_close (DB_SEQUENCE *seq, U32 flags = 0, SV *callback = &PL_sv_undef)
228     db_sequence_get (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, int delta, SV *seq_value, U32 flags = DB_TXN_NOSYNC, SV *callback = &PL_sv_undef)
229 root 1.14 flags: TXN_NOSYNC
230 root 1.10 db_sequence_remove (DB_SEQUENCE *seq, DB_TXN_ornull *txnid = 0, U32 flags = 0, SV *callback = &PL_sv_undef)
231 root 1.14 flags: TXN_NOSYNC
232 root 1.10
233 root 1.15 =head4 db_txn_finish (DB_TXN *txn, U32 flags = 0, SV *callback = &PL_sv_undef)
234    
235 root 1.16 This is not actually a Berkeley DB function but a BDB module
236     extension. The background for this exytension is: It is very annoying to
237     have to check every single BDB function for error returns and provide a
238     codepath out of your transaction. While the BDB module still makes this
239     possible, it contains the following extensions:
240 root 1.15
241     When a transaction-protected function returns any operating system
242     error (errno > 0), BDB will set the C<TXN_DEADLOCK> flag on the
243 root 1.16 transaction. This flag is also set by Berkeley DB functions themselves
244 root 1.15 when an operation fails with LOCK_DEADLOCK, and it causes all further
245     operations on that transaction (including C<db_txn_commit>) to fail.
246    
247     The C<db_txn_finish> request will look at this flag, and, if it is set,
248     will automatically call C<db_txn_abort> (setting errno to C<LOCK_DEADLOCK>
249 root 1.16 if it isn't set to something else yet). If it isn't set, it will call
250     C<db_txn_commit> and return the error normally.
251 root 1.15
252     How to use this? Easy: just write your transaction normally:
253    
254     my $txn = $db_env->txn_begin;
255     db_get $db, $txn, "key", my $data;
256     db_put $db, $txn, "key", $data + 1 unless $! == BDB::NOTFOUND;
257     db_txn_finish $txn;
258     die "transaction failed" if $!;
259    
260     That is, handle only the expected errors. If something unexpected happens
261     (EIO, LOCK_NOTGRANTED or a deadlock in either db_get or db_put), then the remaining
262     requests (db_put in this case) will simply be skipped (they will fail with
263     LOCK_DEADLOCK) and the transaction will be aborted.
264    
265 root 1.16 You can use the C<< $txn->failed >> method to check wether a transaction
266 root 1.15 has failed in this way and abort further processing (excluding
267     C<db_txn_finish>).
268    
269 root 1.10 =head3 DB_ENV/database environment methods
270    
271     Methods available on DB_ENV/$env handles:
272    
273     DESTROY (DB_ENV_ornull *env)
274     CODE:
275     if (env)
276     env->close (env, 0);
277    
278     $int = $env->set_data_dir (const char *dir)
279     $int = $env->set_tmp_dir (const char *dir)
280     $int = $env->set_lg_dir (const char *dir)
281     $int = $env->set_shm_key (long shm_key)
282     $int = $env->set_cachesize (U32 gbytes, U32 bytes, int ncache = 0)
283 root 1.39 $int = $env->set_flags (U32 flags, int onoff = 1)
284     $int = $env->log_set_config (U32 flags, int onoff = 1) [v4.7]
285     $int = $env->set_intermediate_dir_mode (const char *modestring) [v4.7]
286 root 1.10 $env->set_errfile (FILE *errfile = 0)
287     $env->set_msgfile (FILE *msgfile = 0)
288     $int = $env->set_verbose (U32 which, int onoff = 1)
289     $int = $env->set_encrypt (const char *password, U32 flags = 0)
290 root 1.15 $int = $env->set_timeout (NV timeout_seconds, U32 flags = SET_TXN_TIMEOUT)
291 root 1.10 $int = $env->set_mp_max_openfd (int maxopenfd);
292     $int = $env->set_mp_max_write (int maxwrite, int maxwrite_sleep);
293     $int = $env->set_mp_mmapsize (int mmapsize_mb)
294     $int = $env->set_lk_detect (U32 detect = DB_LOCK_DEFAULT)
295     $int = $env->set_lk_max_lockers (U32 max)
296     $int = $env->set_lk_max_locks (U32 max)
297     $int = $env->set_lk_max_objects (U32 max)
298     $int = $env->set_lg_bsize (U32 max)
299     $int = $env->set_lg_max (U32 max)
300 root 1.20 $int = $env->mutex_set_increment (U32 increment)
301     $int = $env->mutex_set_tas_spins (U32 tas_spins)
302     $int = $env->mutex_set_max (U32 max)
303     $int = $env->mutex_set_align (U32 align)
304 root 1.10
305     $txn = $env->txn_begin (DB_TXN_ornull *parent = 0, U32 flags = 0)
306 root 1.14 flags: READ_COMMITTED READ_UNCOMMITTED TXN_NOSYNC TXN_NOWAIT TXN_SNAPSHOT TXN_SYNC TXN_WAIT TXN_WRITE_NOSYNC
307 root 1.10
308 root 1.12 =head4 Example:
309 root 1.10
310     use AnyEvent;
311     use BDB;
312    
313     our $FH; open $FH, "<&=" . BDB::poll_fileno;
314     our $WATCHER = AnyEvent->io (fh => $FH, poll => 'r', cb => \&BDB::poll_cb);
315    
316     BDB::min_parallel 8;
317    
318     my $env = db_env_create;
319    
320     mkdir "bdtest", 0700;
321     db_env_open
322     $env,
323     "bdtest",
324     BDB::INIT_LOCK | BDB::INIT_LOG | BDB::INIT_MPOOL | BDB::INIT_TXN | BDB::RECOVER | BDB::USE_ENVIRON | BDB::CREATE,
325     0600;
326    
327     $env->set_flags (BDB::AUTO_COMMIT | BDB::TXN_NOSYNC, 1);
328    
329    
330     =head3 DB/database methods
331    
332     Methods available on DB/$db handles:
333    
334     DESTROY (DB_ornull *db)
335     CODE:
336     if (db)
337     {
338     SV *env = (SV *)db->app_private;
339     db->close (db, 0);
340     SvREFCNT_dec (env);
341     }
342    
343     $int = $db->set_cachesize (U32 gbytes, U32 bytes, int ncache = 0)
344     $int = $db->set_flags (U32 flags)
345 root 1.14 flags: CHKSUM ENCRYPT TXN_NOT_DURABLE
346     Btree: DUP DUPSORT RECNUM REVSPLITOFF
347     Hash: DUP DUPSORT
348     Queue: INORDER
349     Recno: RENUMBER SNAPSHOT
350    
351 root 1.10 $int = $db->set_encrypt (const char *password, U32 flags)
352     $int = $db->set_lorder (int lorder)
353     $int = $db->set_bt_minkey (U32 minkey)
354     $int = $db->set_re_delim (int delim)
355     $int = $db->set_re_pad (int re_pad)
356     $int = $db->set_re_source (char *source)
357     $int = $db->set_re_len (U32 re_len)
358     $int = $db->set_h_ffactor (U32 h_ffactor)
359     $int = $db->set_h_nelem (U32 h_nelem)
360     $int = $db->set_q_extentsize (U32 extentsize)
361    
362     $dbc = $db->cursor (DB_TXN_ornull *txn = 0, U32 flags = 0)
363 root 1.14 flags: READ_COMMITTED READ_UNCOMMITTED WRITECURSOR TXN_SNAPSHOT
364 root 1.10 $seq = $db->sequence (U32 flags = 0)
365    
366 root 1.12 =head4 Example:
367 root 1.10
368     my $db = db_create $env;
369     db_open $db, undef, "table", undef, BDB::BTREE, BDB::AUTO_COMMIT | BDB::CREATE | BDB::READ_UNCOMMITTED, 0600;
370    
371     for (1..1000) {
372     db_put $db, undef, "key $_", "data $_";
373    
374     db_key_range $db, undef, "key $_", my $keyrange;
375     my ($lt, $eq, $gt) = @$keyrange;
376     }
377    
378     db_del $db, undef, "key $_" for 1..1000;
379    
380     db_sync $db;
381    
382    
383     =head3 DB_TXN/transaction methods
384    
385     Methods available on DB_TXN/$txn handles:
386    
387     DESTROY (DB_TXN_ornull *txn)
388     CODE:
389     if (txn)
390     txn->abort (txn);
391    
392 root 1.15 $int = $txn->set_timeout (NV timeout_seconds, U32 flags = SET_TXN_TIMEOUT)
393 root 1.14 flags: SET_LOCK_TIMEOUT SET_TXN_TIMEOUT
394 root 1.10
395 root 1.15 $bool = $txn->failed
396     # see db_txn_finish documentation, above
397    
398 root 1.10
399     =head3 DBC/cursor methods
400    
401     Methods available on DBC/$dbc handles:
402    
403     DESTROY (DBC_ornull *dbc)
404     CODE:
405     if (dbc)
406     dbc->c_close (dbc);
407    
408 root 1.28 $int = $cursor->set_priority ($priority = PRIORITY_*)
409    
410 root 1.12 =head4 Example:
411 root 1.10
412     my $c = $db->cursor;
413    
414     for (;;) {
415     db_c_get $c, my $key, my $data, BDB::NEXT;
416     warn "<$!,$key,$data>";
417     last if $!;
418     }
419    
420     db_c_close $c;
421    
422 root 1.12
423 root 1.10 =head3 DB_SEQUENCE/sequence methods
424    
425     Methods available on DB_SEQUENCE/$seq handles:
426    
427     DESTROY (DB_SEQUENCE_ornull *seq)
428     CODE:
429     if (seq)
430     seq->close (seq, 0);
431    
432     $int = $seq->initial_value (db_seq_t value)
433     $int = $seq->set_cachesize (U32 size)
434     $int = $seq->set_flags (U32 flags)
435 root 1.14 flags: SEQ_DEC SEQ_INC SEQ_WRAP
436 root 1.10 $int = $seq->set_range (db_seq_t min, db_seq_t max)
437    
438 root 1.12 =head4 Example:
439 root 1.10
440     my $seq = $db->sequence;
441    
442     db_sequence_open $seq, undef, "seq", BDB::CREATE;
443     db_sequence_get $seq, undef, 1, my $value;
444    
445    
446 root 1.1 =head2 SUPPORT FUNCTIONS
447    
448     =head3 EVENT PROCESSING AND EVENT LOOP INTEGRATION
449    
450     =over 4
451    
452 root 1.23 =item $msg = BDB::strerror [$errno]
453    
454     Returns the string corresponding to the given errno value. If no argument
455     is given, use C<$!>.
456    
457 root 1.33 Note that the BDB module also patches the C<$!> variable directly, so you
458     should be able to get a bdb error string by simply stringifying C<$!>.
459    
460 root 1.2 =item $fileno = BDB::poll_fileno
461 root 1.1
462     Return the I<request result pipe file descriptor>. This filehandle must be
463     polled for reading by some mechanism outside this module (e.g. Event or
464     select, see below or the SYNOPSIS). If the pipe becomes readable you have
465     to call C<poll_cb> to check the results.
466    
467     See C<poll_cb> for an example.
468    
469 root 1.2 =item BDB::poll_cb
470 root 1.1
471     Process some outstanding events on the result pipe. You have to call this
472     regularly. Returns the number of events processed. Returns immediately
473     when no events are outstanding. The amount of events processed depends on
474 root 1.2 the settings of C<BDB::max_poll_req> and C<BDB::max_poll_time>.
475 root 1.1
476     If not all requests were processed for whatever reason, the filehandle
477     will still be ready when C<poll_cb> returns.
478    
479     Example: Install an Event watcher that automatically calls
480 root 1.2 BDB::poll_cb with high priority:
481 root 1.1
482 root 1.2 Event->io (fd => BDB::poll_fileno,
483 root 1.1 poll => 'r', async => 1,
484 root 1.2 cb => \&BDB::poll_cb);
485 root 1.1
486 root 1.2 =item BDB::max_poll_reqs $nreqs
487 root 1.1
488 root 1.2 =item BDB::max_poll_time $seconds
489 root 1.1
490     These set the maximum number of requests (default C<0>, meaning infinity)
491 root 1.2 that are being processed by C<BDB::poll_cb> in one call, respectively
492 root 1.1 the maximum amount of time (default C<0>, meaning infinity) spent in
493 root 1.2 C<BDB::poll_cb> to process requests (more correctly the mininum amount
494 root 1.1 of time C<poll_cb> is allowed to use).
495    
496     Setting C<max_poll_time> to a non-zero value creates an overhead of one
497     syscall per request processed, which is not normally a problem unless your
498     callbacks are really really fast or your OS is really really slow (I am
499     not mentioning Solaris here). Using C<max_poll_reqs> incurs no overhead.
500    
501     Setting these is useful if you want to ensure some level of
502     interactiveness when perl is not fast enough to process all requests in
503     time.
504    
505     For interactive programs, values such as C<0.01> to C<0.1> should be fine.
506    
507 root 1.18 Example: Install an EV watcher that automatically calls
508 root 1.2 BDB::poll_cb with low priority, to ensure that other parts of the
509 root 1.18 program get the CPU sometimes even under high load.
510 root 1.1
511     # try not to spend much more than 0.1s in poll_cb
512 root 1.2 BDB::max_poll_time 0.1;
513 root 1.1
514 root 1.18 my $bdb_poll = EV::io BDB::poll_fileno, EV::READ, \&BDB::poll_cb);
515 root 1.1
516 root 1.2 =item BDB::poll_wait
517 root 1.1
518     If there are any outstanding requests and none of them in the result
519     phase, wait till the result filehandle becomes ready for reading (simply
520     does a C<select> on the filehandle. This is useful if you want to
521     synchronously wait for some requests to finish).
522    
523     See C<nreqs> for an example.
524    
525 root 1.2 =item BDB::poll
526 root 1.1
527     Waits until some requests have been handled.
528    
529     Returns the number of requests processed, but is otherwise strictly
530     equivalent to:
531    
532 root 1.2 BDB::poll_wait, BDB::poll_cb
533 root 1.1
534 root 1.2 =item BDB::flush
535 root 1.1
536 root 1.18 Wait till all outstanding BDB requests have been handled.
537 root 1.1
538     Strictly equivalent to:
539    
540 root 1.2 BDB::poll_wait, BDB::poll_cb
541     while BDB::nreqs;
542 root 1.1
543 root 1.8 =back
544    
545 root 1.39 =head3 VERSION CHECKING
546    
547     BerkeleyDB comes in various versions, many of them have minor
548     incompatibilities. This means that traditional "at least version x.x"
549     checks are often not sufficient.
550    
551 root 1.40 Example: set the log_autoremove option in a way compatible with <v.47 and
552     v4.7. Note the use of & on the constants to avoid triggering a compiletime
553     bug when the symbol isn't available.
554    
555     $DB_ENV->set_flags (&BDB::LOG_AUTOREMOVE ) if BDB::VERSION v0, v4.7;
556     $DB_ENV->log_set_config (&BDB::LOG_AUTO_REMOVE) if BDB::VERSION v4.7;
557    
558 root 1.39 =over 4
559    
560     =item BDB::VERSION
561    
562     The C<BDB::VERSION> function, when called without arguments, returns the
563     Berkeley DB version as a v-string (usually with 3 components). You should
564     use C<lt> and C<ge> operators exclusively to make comparisons.
565    
566     Example: check for at least version 4.7.
567    
568     BDB::VERSION ge v4.7 or die;
569    
570     =item BDB::VERSION min-version
571    
572     Returns true if the BDB version is at least the given version (specified
573     as a v-string), false otherwise.
574    
575     Example: check for at least version 4.5.
576    
577     BDB::VERSION v4.7 or die;
578    
579     =item BDB::VERSION min-version, max-version
580    
581     Returns true of the BDB version is at least version C<min-version> (specify C<undef> or C<v0> for any minimum version)
582     and less then C<max-version>.
583    
584     Example: check wether version is strictly less then v4.7.
585    
586     BDB::VERSION v0, v4.7
587     or die "version 4.7 is not yet supported";
588    
589     =back
590    
591     =cut
592    
593     sub VERSION {
594     if (@_ > 0) {
595     return undef if VERSION_v lt $_[0];
596     if (@_ > 1) {
597     return undef if VERSION_v ge $_[1];
598     }
599     }
600    
601     VERSION_v
602     }
603    
604 root 1.1 =head3 CONTROLLING THE NUMBER OF THREADS
605    
606 root 1.8 =over 4
607    
608 root 1.2 =item BDB::min_parallel $nthreads
609 root 1.1
610 root 1.18 Set the minimum number of BDB threads to C<$nthreads>. The current
611 root 1.1 default is C<8>, which means eight asynchronous operations can execute
612     concurrently at any one time (the number of outstanding requests,
613     however, is unlimited).
614    
615 root 1.18 BDB starts threads only on demand, when an BDB request is queued and
616 root 1.1 no free thread exists. Please note that queueing up a hundred requests can
617     create demand for a hundred threads, even if it turns out that everything
618     is in the cache and could have been processed faster by a single thread.
619    
620     It is recommended to keep the number of threads relatively low, as some
621     Linux kernel versions will scale negatively with the number of threads
622     (higher parallelity => MUCH higher latency). With current Linux 2.6
623     versions, 4-32 threads should be fine.
624    
625     Under most circumstances you don't need to call this function, as the
626     module selects a default that is suitable for low to moderate load.
627    
628 root 1.2 =item BDB::max_parallel $nthreads
629 root 1.1
630 root 1.18 Sets the maximum number of BDB threads to C<$nthreads>. If more than the
631 root 1.1 specified number of threads are currently running, this function kills
632     them. This function blocks until the limit is reached.
633    
634     While C<$nthreads> are zero, aio requests get queued but not executed
635     until the number of threads has been increased again.
636    
637     This module automatically runs C<max_parallel 0> at program end, to ensure
638     that all threads are killed and that there are no outstanding requests.
639    
640     Under normal circumstances you don't need to call this function.
641    
642 root 1.2 =item BDB::max_idle $nthreads
643 root 1.1
644     Limit the number of threads (default: 4) that are allowed to idle (i.e.,
645     threads that did not get a request to process within 10 seconds). That
646     means if a thread becomes idle while C<$nthreads> other threads are also
647     idle, it will free its resources and exit.
648    
649     This is useful when you allow a large number of threads (e.g. 100 or 1000)
650     to allow for extremely high load situations, but want to free resources
651     under normal circumstances (1000 threads can easily consume 30MB of RAM).
652    
653     The default is probably ok in most situations, especially if thread
654     creation is fast. If thread creation is very slow on your system you might
655     want to use larger values.
656    
657 root 1.2 =item $oldmaxreqs = BDB::max_outstanding $maxreqs
658 root 1.1
659     This is a very bad function to use in interactive programs because it
660     blocks, and a bad way to reduce concurrency because it is inexact: Better
661     use an C<aio_group> together with a feed callback.
662    
663     Sets the maximum number of outstanding requests to C<$nreqs>. If you
664     to queue up more than this number of requests, the next call to the
665     C<poll_cb> (and C<poll_some> and other functions calling C<poll_cb>)
666     function will block until the limit is no longer exceeded.
667    
668     The default value is very large, so there is no practical limit on the
669     number of outstanding requests.
670    
671     You can still queue as many requests as you want. Therefore,
672     C<max_oustsanding> is mainly useful in simple scripts (with low values) or
673     as a stop gap to shield against fatal memory overflow (with large values).
674    
675 root 1.3 =item BDB::set_sync_prepare $cb
676    
677     Sets a callback that is called whenever a request is created without an
678     explicit callback. It has to return two code references. The first is used
679 root 1.37 as the request callback (it should save the return status), and the second
680     is called to wait until the first callback has been called (it must set
681     C<$!> to the return status).
682    
683     This mechanism can be used to include BDB into other event mechanisms,
684     such as L<AnyEvent::BDB> or L<Coro::BDB>.
685    
686     The default implementation works like this:
687 root 1.3
688     sub {
689     my $status;
690     (
691     sub { $status = $! },
692     sub { BDB::poll while !defined $status; $! = $status },
693     )
694     }
695    
696 root 1.37 It simply blocks the process till the request has finished and then sets
697     C<$!> to the return value. This means that if you don't use a callback,
698     BDB will simply fall back to synchronous operations.
699    
700 root 1.3 =back
701    
702 root 1.1 =head3 STATISTICAL INFORMATION
703    
704 root 1.3 =over 4
705    
706 root 1.2 =item BDB::nreqs
707 root 1.1
708     Returns the number of requests currently in the ready, execute or pending
709     states (i.e. for which their callback has not been invoked yet).
710    
711     Example: wait till there are no outstanding requests anymore:
712    
713 root 1.2 BDB::poll_wait, BDB::poll_cb
714     while BDB::nreqs;
715 root 1.1
716 root 1.2 =item BDB::nready
717 root 1.1
718     Returns the number of requests currently in the ready state (not yet
719     executed).
720    
721 root 1.2 =item BDB::npending
722 root 1.1
723     Returns the number of requests currently in the pending state (executed,
724     but not yet processed by poll_cb).
725    
726     =back
727    
728     =cut
729    
730 root 1.3 set_sync_prepare {
731     my $status;
732     (
733     sub {
734     $status = $!;
735     },
736     sub {
737     BDB::poll while !defined $status;
738     $! = $status;
739     },
740     )
741     };
742    
743 root 1.1 min_parallel 8;
744    
745     END { flush }
746    
747     1;
748    
749     =head2 FORK BEHAVIOUR
750    
751     This module should do "the right thing" when the process using it forks:
752    
753 root 1.18 Before the fork, BDB enters a quiescent state where no requests
754 root 1.1 can be added in other threads and no results will be processed. After
755     the fork the parent simply leaves the quiescent state and continues
756     request/result processing, while the child frees the request/result queue
757     (so that the requests started before the fork will only be handled in the
758     parent). Threads will be started on demand until the limit set in the
759     parent process has been reached again.
760    
761     In short: the parent will, after a short pause, continue as if fork had
762 root 1.18 not been called, while the child will act as if BDB has not been used
763 root 1.1 yet.
764    
765 root 1.26 Win32 note: there is no fork on win32, and perls emulation of it is too
766     broken to be supported, so do not use BDB in a windows pseudo-fork, better
767     yet, switch to a more capable platform.
768    
769 root 1.1 =head2 MEMORY USAGE
770    
771     Per-request usage:
772    
773     Each aio request uses - depending on your architecture - around 100-200
774     bytes of memory. In addition, stat requests need a stat buffer (possibly
775     a few hundred bytes), readdir requires a result buffer and so on. Perl
776     scalars and other data passed into aio requests will also be locked and
777     will consume memory till the request has entered the done state.
778    
779 root 1.13 This is not awfully much, so queuing lots of requests is not usually a
780 root 1.1 problem.
781    
782     Per-thread usage:
783    
784     In the execution phase, some aio requests require more memory for
785     temporary buffers, and each thread requires a stack and other data
786     structures (usually around 16k-128k, depending on the OS).
787    
788     =head1 KNOWN BUGS
789    
790 root 1.15 Known bugs will be fixed in the next release, except:
791    
792     If you use a transaction in any request, and the request returns
793     with an operating system error or DB_LOCK_NOTGRANTED, the internal
794     TXN_DEADLOCK flag will be set on the transaction. See C<db_txn_finish>,
795     above.
796 root 1.1
797     =head1 SEE ALSO
798    
799 root 1.36 L<AnyEvent::BDB> (event loop integration), L<Coro::BDB> (more natural
800     syntax), L<IO::AIO> (nice to have).
801 root 1.1
802     =head1 AUTHOR
803    
804     Marc Lehmann <schmorp@schmorp.de>
805     http://home.schmorp.de/
806    
807     =cut
808