ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/BDB/BDB.pm
Revision: 1.53
Committed: Tue Oct 21 02:21:25 2008 UTC (15 years, 7 months ago) by root
Branch: MAIN
CVS Tags: rel-1_81
Changes since 1.52: +53 -28 lines
Log Message:
1.81

File Contents

# User Rev Content
1 root 1.1 =head1 NAME
2    
3 root 1.2 BDB - Asynchronous Berkeley DB access
4 root 1.1
5     =head1 SYNOPSIS
6    
7 root 1.2 use BDB;
8 root 1.1
9 root 1.19 my $env = db_env_create;
10    
11     mkdir "bdtest", 0700;
12     db_env_open
13     $env,
14     "bdtest",
15     BDB::INIT_LOCK | BDB::INIT_LOG | BDB::INIT_MPOOL
16     | BDB::INIT_TXN | BDB::RECOVER | BDB::USE_ENVIRON | BDB::CREATE,
17     0600;
18    
19     $env->set_flags (BDB::AUTO_COMMIT | BDB::TXN_NOSYNC, 1);
20    
21     my $db = db_create $env;
22     db_open $db, undef, "table", undef, BDB::BTREE, BDB::AUTO_COMMIT | BDB::CREATE
23     | BDB::READ_UNCOMMITTED, 0600;
24     db_put $db, undef, "key", "data", 0, sub {
25     db_del $db, undef, "key";
26     };
27     db_sync $db;
28    
29 root 1.28 # when you also use Coro, management is easy:
30     use Coro::BDB;
31    
32 root 1.36 # automatic event loop intergration with AnyEvent:
33     use AnyEvent::BDB;
34 root 1.19
35     # automatic result processing with EV:
36     my $WATCHER = EV::io BDB::poll_fileno, EV::READ, \&BDB::poll_cb;
37    
38     # with Glib:
39     add_watch Glib::IO BDB::poll_fileno,
40     in => sub { BDB::poll_cb; 1 };
41    
42     # or simply flush manually
43     BDB::flush;
44    
45    
46 root 1.1 =head1 DESCRIPTION
47    
48 root 1.10 See the BerkeleyDB documentation (L<http://www.oracle.com/technology/documentation/berkeley-db/db/index.html>).
49 root 1.12 The BDB API is very similar to the C API (the translation has been very faithful).
50 root 1.10
51     See also the example sections in the document below and possibly the eg/
52     subdirectory of the BDB distribution. Last not least see the IO::AIO
53     documentation, as that module uses almost the same asynchronous request
54     model as this module.
55    
56     I know this is woefully inadequate documentation. Send a patch!
57 root 1.7
58 root 1.1
59     =head1 REQUEST ANATOMY AND LIFETIME
60    
61     Every request method creates a request. which is a C data structure not
62     directly visible to Perl.
63    
64     During their existance, bdb requests travel through the following states,
65     in order:
66    
67     =over 4
68    
69     =item ready
70    
71     Immediately after a request is created it is put into the ready state,
72     waiting for a thread to execute it.
73    
74     =item execute
75    
76     A thread has accepted the request for processing and is currently
77     executing it (e.g. blocking in read).
78    
79     =item pending
80    
81     The request has been executed and is waiting for result processing.
82    
83     While request submission and execution is fully asynchronous, result
84     processing is not and relies on the perl interpreter calling C<poll_cb>
85     (or another function with the same effect).
86    
87     =item result
88    
89     The request results are processed synchronously by C<poll_cb>.
90    
91     The C<poll_cb> function will process all outstanding aio requests by
92     calling their callbacks, freeing memory associated with them and managing
93     any groups they are contained in.
94    
95     =item done
96    
97     Request has reached the end of its lifetime and holds no resources anymore
98     (except possibly for the Perl object, but its connection to the actual
99     aio request is severed and calling its methods will either do nothing or
100     result in a runtime error).
101    
102     =back
103    
104     =cut
105    
106 root 1.2 package BDB;
107 root 1.1
108     no warnings;
109     use strict 'vars';
110    
111     use base 'Exporter';
112    
113 root 1.46 our $VERSION;
114    
115 root 1.1 BEGIN {
116 root 1.52 $VERSION = '1.81';
117 root 1.1
118 root 1.3 our @BDB_REQ = qw(
119 root 1.6 db_env_open db_env_close db_env_txn_checkpoint db_env_lock_detect
120 root 1.38 db_env_memp_sync db_env_memp_trickle db_env_dbrename db_env_dbremove
121 root 1.53 db_env_log_archive
122 root 1.25 db_open db_close db_compact db_sync db_upgrade
123 root 1.44 db_put db_exists db_get db_pget db_del db_key_range
124 root 1.15 db_txn_commit db_txn_abort db_txn_finish
125 root 1.5 db_c_close db_c_count db_c_put db_c_get db_c_pget db_c_del
126 root 1.6 db_sequence_open db_sequence_close
127     db_sequence_get db_sequence_remove
128     );
129     our @EXPORT = (@BDB_REQ, qw(dbreq_pri dbreq_nice db_env_create db_create));
130     our @EXPORT_OK = qw(
131     poll_fileno poll_cb poll_wait flush
132     min_parallel max_parallel max_idle
133     nreqs nready npending nthreads
134     max_poll_time max_poll_reqs
135 root 1.3 );
136 root 1.1
137     require XSLoader;
138 root 1.2 XSLoader::load ("BDB", $VERSION);
139 root 1.1 }
140    
141 root 1.53 =head1 BERKELEYDB FUNCTIONS
142 root 1.10
143     All of these are functions. The create functions simply return a new
144 root 1.34 object and never block. All the remaining functions take an optional
145 root 1.24 callback as last argument. If it is missing, then the function will be
146     executed synchronously. In both cases, C<$!> will reflect the return value
147     of the function.
148 root 1.10
149     BDB functions that cannot block (mostly functions that manipulate
150     settings) are method calls on the relevant objects, so the rule of thumb
151 root 1.34 is: if it's a method, it's not blocking, if it's a function, it takes a
152 root 1.10 callback as last argument.
153    
154     In the following, C<$int> signifies an integer return value,
155 root 1.38 C<bdb_filename> is a "filename" (octets on unix, madness on windows),
156     C<U32> is an unsigned 32 bit integer, C<int> is some integer, C<NV> is a
157     floating point value.
158 root 1.10
159 root 1.41 Most C<SV *> types are generic perl scalars (for input and output of data
160     values).
161 root 1.10
162 root 1.11 The various C<DB_ENV> etc. arguments are handles return by
163     C<db_env_create>, C<db_create>, C<txn_begin> and so on. If they have an
164     appended C<_ornull> this means they are optional and you can pass C<undef>
165     for them, resulting a NULL pointer on the C level.
166 root 1.10
167 root 1.41 The C<SV *callback> is the optional callback function to call when the
168     request is completed. This last callback argument is special: the callback
169     is simply the last argument passed. If there are "optional" arguments
170     before the callback they can be left out. The callback itself can be left
171     out or specified as C<undef>, in which case the function will be executed
172     synchronously.
173    
174     For example, C<db_env_txn_checkpoint> usually is called with all integer
175     arguments zero. These can be left out, so all of these specify a call
176     to C<< DB_ENV->txn_checkpoint >>, to be executed asynchronously with a
177     callback to be called:
178    
179     db_env_txn_checkpoint $db_env, 0, 0, 0, sub { };
180     db_env_txn_checkpoint $db_env, 0, 0, sub { };
181     db_env_txn_checkpoint $db_env, sub { };
182    
183     While these all specify a call to C<< DB_ENV->txn_checkpoint >> to be
184     executed synchronously:
185    
186     db_env_txn_checkpoint $db_env, 0, 0, 0, undef;
187     db_env_txn_checkpoint $db_env, 0, 0, 0;
188     db_env_txn_checkpoint $db_env, 0;
189    
190 root 1.53 =head2 BDB functions
191 root 1.10
192     Functions in the BDB namespace, exported by default:
193    
194     $env = db_env_create (U32 env_flags = 0)
195 root 1.14 flags: RPCCLIENT
196 root 1.10
197 root 1.38 db_env_open (DB_ENV *env, bdb_filename db_home, U32 open_flags, int mode, SV *callback = &PL_sv_undef)
198 root 1.14 open_flags: INIT_CDB INIT_LOCK INIT_LOG INIT_MPOOL INIT_REP INIT_TXN RECOVER RECOVER_FATAL USE_ENVIRON USE_ENVIRON_ROOT CREATE LOCKDOWN PRIVATE REGISTER SYSTEM_MEM
199 root 1.10 db_env_close (DB_ENV *env, U32 flags = 0, SV *callback = &PL_sv_undef)
200     db_env_txn_checkpoint (DB_ENV *env, U32 kbyte = 0, U32 min = 0, U32 flags = 0, SV *callback = &PL_sv_undef)
201 root 1.14 flags: FORCE
202 root 1.10 db_env_lock_detect (DB_ENV *env, U32 flags = 0, U32 atype = DB_LOCK_DEFAULT, SV *dummy = 0, SV *callback = &PL_sv_undef)
203 root 1.14 atype: LOCK_DEFAULT LOCK_EXPIRE LOCK_MAXLOCKS LOCK_MAXWRITE LOCK_MINLOCKS LOCK_MINWRITE LOCK_OLDEST LOCK_RANDOM LOCK_YOUNGEST
204 root 1.10 db_env_memp_sync (DB_ENV *env, SV *dummy = 0, SV *callback = &PL_sv_undef)
205     db_env_memp_trickle (DB_ENV *env, int percent, SV *dummy = 0, SV *callback = &PL_sv_undef)
206 root 1.38 db_env_dbremove (DB_ENV *env, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, U32 flags = 0, SV *callback = &PL_sv_undef)
207     db_env_dbrename (DB_ENV *env, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, bdb_filename newname, U32 flags = 0, SV *callback = &PL_sv_undef)
208 root 1.53 db_env_log_archive (DB_ENV *env, SV *listp, U32 flags = 0, SV *callback = &PL_sv_undef)
209 root 1.10
210     $db = db_create (DB_ENV *env = 0, U32 flags = 0)
211 root 1.14 flags: XA_CREATE
212 root 1.10
213 root 1.38 db_open (DB *db, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, int type, U32 flags, int mode, SV *callback = &PL_sv_undef)
214 root 1.14 flags: AUTO_COMMIT CREATE EXCL MULTIVERSION NOMMAP RDONLY READ_UNCOMMITTED THREAD TRUNCATE
215 root 1.10 db_close (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef)
216 root 1.14 flags: DB_NOSYNC
217 root 1.38 db_upgrade (DB *db, bdb_filename file, U32 flags = 0, SV *callback = &PL_sv_undef)
218 root 1.12 db_compact (DB *db, DB_TXN_ornull *txn = 0, SV *start = 0, SV *stop = 0, SV *unused1 = 0, U32 flags = DB_FREE_SPACE, SV *unused2 = 0, SV *callback = &PL_sv_undef)
219 root 1.14 flags: FREELIST_ONLY FREE_SPACE
220 root 1.10 db_sync (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef)
221     db_key_range (DB *db, DB_TXN_ornull *txn, SV *key, SV *key_range, U32 flags = 0, SV *callback = &PL_sv_undef)
222     db_put (DB *db, DB_TXN_ornull *txn, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
223 root 1.14 flags: APPEND NODUPDATA NOOVERWRITE
224 root 1.45 db_exists (DB *db, DB_TXN_ornull *txn, SV *key, U32 flags = 0, SV *callback = 0) (v4.6)
225 root 1.10 db_get (DB *db, DB_TXN_ornull *txn, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
226 root 1.14 flags: CONSUME CONSUME_WAIT GET_BOTH SET_RECNO MULTIPLE READ_COMMITTED READ_UNCOMMITTED RMW
227 root 1.10 db_pget (DB *db, DB_TXN_ornull *txn, SV *key, SV *pkey, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
228 root 1.14 flags: CONSUME CONSUME_WAIT GET_BOTH SET_RECNO MULTIPLE READ_COMMITTED READ_UNCOMMITTED RMW
229 root 1.10 db_del (DB *db, DB_TXN_ornull *txn, SV *key, U32 flags = 0, SV *callback = &PL_sv_undef)
230     db_txn_commit (DB_TXN *txn, U32 flags = 0, SV *callback = &PL_sv_undef)
231 root 1.14 flags: TXN_NOSYNC TXN_SYNC
232 root 1.10 db_txn_abort (DB_TXN *txn, SV *callback = &PL_sv_undef)
233 root 1.14
234 root 1.10 db_c_close (DBC *dbc, SV *callback = &PL_sv_undef)
235     db_c_count (DBC *dbc, SV *count, U32 flags = 0, SV *callback = &PL_sv_undef)
236     db_c_put (DBC *dbc, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
237 root 1.14 flags: AFTER BEFORE CURRENT KEYFIRST KEYLAST NODUPDATA
238 root 1.10 db_c_get (DBC *dbc, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
239 root 1.14 flags: CURRENT FIRST GET_BOTH GET_BOTH_RANGE GET_RECNO JOIN_ITEM LAST NEXT NEXT_DUP NEXT_NODUP PREV PREV_DUP PREV_NODUP SET SET_RANGE SET_RECNO READ_UNCOMMITTED MULTIPLE MULTIPLE_KEY RMW
240 root 1.10 db_c_pget (DBC *dbc, SV *key, SV *pkey, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
241     db_c_del (DBC *dbc, U32 flags = 0, SV *callback = &PL_sv_undef)
242    
243     db_sequence_open (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, SV *key, U32 flags = 0, SV *callback = &PL_sv_undef)
244 root 1.14 flags: CREATE EXCL
245 root 1.10 db_sequence_close (DB_SEQUENCE *seq, U32 flags = 0, SV *callback = &PL_sv_undef)
246     db_sequence_get (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, int delta, SV *seq_value, U32 flags = DB_TXN_NOSYNC, SV *callback = &PL_sv_undef)
247 root 1.14 flags: TXN_NOSYNC
248 root 1.10 db_sequence_remove (DB_SEQUENCE *seq, DB_TXN_ornull *txnid = 0, U32 flags = 0, SV *callback = &PL_sv_undef)
249 root 1.14 flags: TXN_NOSYNC
250 root 1.10
251 root 1.53 =head3 db_txn_finish (DB_TXN *txn, U32 flags = 0, SV *callback = &PL_sv_undef)
252 root 1.15
253 root 1.16 This is not actually a Berkeley DB function but a BDB module
254     extension. The background for this exytension is: It is very annoying to
255     have to check every single BDB function for error returns and provide a
256     codepath out of your transaction. While the BDB module still makes this
257     possible, it contains the following extensions:
258 root 1.15
259     When a transaction-protected function returns any operating system
260     error (errno > 0), BDB will set the C<TXN_DEADLOCK> flag on the
261 root 1.16 transaction. This flag is also set by Berkeley DB functions themselves
262 root 1.15 when an operation fails with LOCK_DEADLOCK, and it causes all further
263     operations on that transaction (including C<db_txn_commit>) to fail.
264    
265     The C<db_txn_finish> request will look at this flag, and, if it is set,
266     will automatically call C<db_txn_abort> (setting errno to C<LOCK_DEADLOCK>
267 root 1.16 if it isn't set to something else yet). If it isn't set, it will call
268     C<db_txn_commit> and return the error normally.
269 root 1.15
270     How to use this? Easy: just write your transaction normally:
271    
272     my $txn = $db_env->txn_begin;
273     db_get $db, $txn, "key", my $data;
274     db_put $db, $txn, "key", $data + 1 unless $! == BDB::NOTFOUND;
275     db_txn_finish $txn;
276     die "transaction failed" if $!;
277    
278     That is, handle only the expected errors. If something unexpected happens
279     (EIO, LOCK_NOTGRANTED or a deadlock in either db_get or db_put), then the remaining
280     requests (db_put in this case) will simply be skipped (they will fail with
281     LOCK_DEADLOCK) and the transaction will be aborted.
282    
283 root 1.16 You can use the C<< $txn->failed >> method to check wether a transaction
284 root 1.15 has failed in this way and abort further processing (excluding
285     C<db_txn_finish>).
286    
287 root 1.53 =head2 DB_ENV/database environment methods
288 root 1.10
289     Methods available on DB_ENV/$env handles:
290    
291     DESTROY (DB_ENV_ornull *env)
292     CODE:
293     if (env)
294     env->close (env, 0);
295    
296     $int = $env->set_data_dir (const char *dir)
297     $int = $env->set_tmp_dir (const char *dir)
298     $int = $env->set_lg_dir (const char *dir)
299     $int = $env->set_shm_key (long shm_key)
300     $int = $env->set_cachesize (U32 gbytes, U32 bytes, int ncache = 0)
301 root 1.39 $int = $env->set_flags (U32 flags, int onoff = 1)
302 root 1.45 $int = $env->log_set_config (U32 flags, int onoff = 1) (v4.7)
303     $int = $env->set_intermediate_dir_mode (const char *modestring) (v4.7)
304 root 1.10 $env->set_errfile (FILE *errfile = 0)
305     $env->set_msgfile (FILE *msgfile = 0)
306     $int = $env->set_verbose (U32 which, int onoff = 1)
307     $int = $env->set_encrypt (const char *password, U32 flags = 0)
308 root 1.15 $int = $env->set_timeout (NV timeout_seconds, U32 flags = SET_TXN_TIMEOUT)
309 root 1.10 $int = $env->set_mp_max_openfd (int maxopenfd);
310     $int = $env->set_mp_max_write (int maxwrite, int maxwrite_sleep);
311     $int = $env->set_mp_mmapsize (int mmapsize_mb)
312     $int = $env->set_lk_detect (U32 detect = DB_LOCK_DEFAULT)
313     $int = $env->set_lk_max_lockers (U32 max)
314     $int = $env->set_lk_max_locks (U32 max)
315     $int = $env->set_lk_max_objects (U32 max)
316     $int = $env->set_lg_bsize (U32 max)
317     $int = $env->set_lg_max (U32 max)
318 root 1.20 $int = $env->mutex_set_increment (U32 increment)
319     $int = $env->mutex_set_tas_spins (U32 tas_spins)
320     $int = $env->mutex_set_max (U32 max)
321     $int = $env->mutex_set_align (U32 align)
322 root 1.10
323     $txn = $env->txn_begin (DB_TXN_ornull *parent = 0, U32 flags = 0)
324 root 1.14 flags: READ_COMMITTED READ_UNCOMMITTED TXN_NOSYNC TXN_NOWAIT TXN_SNAPSHOT TXN_SYNC TXN_WAIT TXN_WRITE_NOSYNC
325 root 1.45 $txn = $env->cdsgroup_begin; (v4.5)
326 root 1.10
327 root 1.53 =head3 Example:
328 root 1.10
329     use AnyEvent;
330     use BDB;
331    
332     our $FH; open $FH, "<&=" . BDB::poll_fileno;
333     our $WATCHER = AnyEvent->io (fh => $FH, poll => 'r', cb => \&BDB::poll_cb);
334    
335     BDB::min_parallel 8;
336    
337     my $env = db_env_create;
338    
339     mkdir "bdtest", 0700;
340     db_env_open
341     $env,
342     "bdtest",
343     BDB::INIT_LOCK | BDB::INIT_LOG | BDB::INIT_MPOOL | BDB::INIT_TXN | BDB::RECOVER | BDB::USE_ENVIRON | BDB::CREATE,
344     0600;
345    
346     $env->set_flags (BDB::AUTO_COMMIT | BDB::TXN_NOSYNC, 1);
347    
348    
349 root 1.53 =head2 DB/database methods
350 root 1.10
351     Methods available on DB/$db handles:
352    
353     DESTROY (DB_ornull *db)
354     CODE:
355     if (db)
356     {
357     SV *env = (SV *)db->app_private;
358     db->close (db, 0);
359     SvREFCNT_dec (env);
360     }
361    
362     $int = $db->set_cachesize (U32 gbytes, U32 bytes, int ncache = 0)
363     $int = $db->set_flags (U32 flags)
364 root 1.14 flags: CHKSUM ENCRYPT TXN_NOT_DURABLE
365     Btree: DUP DUPSORT RECNUM REVSPLITOFF
366     Hash: DUP DUPSORT
367     Queue: INORDER
368     Recno: RENUMBER SNAPSHOT
369    
370 root 1.10 $int = $db->set_encrypt (const char *password, U32 flags)
371     $int = $db->set_lorder (int lorder)
372     $int = $db->set_bt_minkey (U32 minkey)
373     $int = $db->set_re_delim (int delim)
374     $int = $db->set_re_pad (int re_pad)
375     $int = $db->set_re_source (char *source)
376     $int = $db->set_re_len (U32 re_len)
377     $int = $db->set_h_ffactor (U32 h_ffactor)
378     $int = $db->set_h_nelem (U32 h_nelem)
379     $int = $db->set_q_extentsize (U32 extentsize)
380    
381     $dbc = $db->cursor (DB_TXN_ornull *txn = 0, U32 flags = 0)
382 root 1.14 flags: READ_COMMITTED READ_UNCOMMITTED WRITECURSOR TXN_SNAPSHOT
383 root 1.10 $seq = $db->sequence (U32 flags = 0)
384    
385 root 1.53 =head3 Example:
386 root 1.10
387     my $db = db_create $env;
388     db_open $db, undef, "table", undef, BDB::BTREE, BDB::AUTO_COMMIT | BDB::CREATE | BDB::READ_UNCOMMITTED, 0600;
389    
390     for (1..1000) {
391     db_put $db, undef, "key $_", "data $_";
392    
393     db_key_range $db, undef, "key $_", my $keyrange;
394     my ($lt, $eq, $gt) = @$keyrange;
395     }
396    
397     db_del $db, undef, "key $_" for 1..1000;
398    
399     db_sync $db;
400    
401    
402 root 1.53 =head2 DB_TXN/transaction methods
403 root 1.10
404     Methods available on DB_TXN/$txn handles:
405    
406     DESTROY (DB_TXN_ornull *txn)
407     CODE:
408     if (txn)
409     txn->abort (txn);
410    
411 root 1.15 $int = $txn->set_timeout (NV timeout_seconds, U32 flags = SET_TXN_TIMEOUT)
412 root 1.14 flags: SET_LOCK_TIMEOUT SET_TXN_TIMEOUT
413 root 1.10
414 root 1.15 $bool = $txn->failed
415     # see db_txn_finish documentation, above
416    
417 root 1.10
418 root 1.53 =head2 DBC/cursor methods
419 root 1.10
420     Methods available on DBC/$dbc handles:
421    
422     DESTROY (DBC_ornull *dbc)
423     CODE:
424     if (dbc)
425     dbc->c_close (dbc);
426    
427 root 1.45 $int = $cursor->set_priority ($priority = PRIORITY_*) (v4.6)
428 root 1.28
429 root 1.53 =head3 Example:
430 root 1.10
431     my $c = $db->cursor;
432    
433     for (;;) {
434     db_c_get $c, my $key, my $data, BDB::NEXT;
435     warn "<$!,$key,$data>";
436     last if $!;
437     }
438    
439     db_c_close $c;
440    
441 root 1.12
442 root 1.53 =head2 DB_SEQUENCE/sequence methods
443 root 1.10
444     Methods available on DB_SEQUENCE/$seq handles:
445    
446     DESTROY (DB_SEQUENCE_ornull *seq)
447     CODE:
448     if (seq)
449     seq->close (seq, 0);
450    
451     $int = $seq->initial_value (db_seq_t value)
452     $int = $seq->set_cachesize (U32 size)
453     $int = $seq->set_flags (U32 flags)
454 root 1.14 flags: SEQ_DEC SEQ_INC SEQ_WRAP
455 root 1.10 $int = $seq->set_range (db_seq_t min, db_seq_t max)
456    
457 root 1.53 =head3 Example:
458 root 1.10
459     my $seq = $db->sequence;
460    
461     db_sequence_open $seq, undef, "seq", BDB::CREATE;
462     db_sequence_get $seq, undef, 1, my $value;
463    
464    
465 root 1.53 =head1 SUPPORT FUNCTIONS
466 root 1.1
467 root 1.53 =head2 EVENT PROCESSING AND EVENT LOOP INTEGRATION
468 root 1.1
469     =over 4
470    
471 root 1.23 =item $msg = BDB::strerror [$errno]
472    
473     Returns the string corresponding to the given errno value. If no argument
474     is given, use C<$!>.
475    
476 root 1.33 Note that the BDB module also patches the C<$!> variable directly, so you
477     should be able to get a bdb error string by simply stringifying C<$!>.
478    
479 root 1.2 =item $fileno = BDB::poll_fileno
480 root 1.1
481     Return the I<request result pipe file descriptor>. This filehandle must be
482     polled for reading by some mechanism outside this module (e.g. Event or
483     select, see below or the SYNOPSIS). If the pipe becomes readable you have
484     to call C<poll_cb> to check the results.
485    
486     See C<poll_cb> for an example.
487    
488 root 1.2 =item BDB::poll_cb
489 root 1.1
490     Process some outstanding events on the result pipe. You have to call this
491     regularly. Returns the number of events processed. Returns immediately
492     when no events are outstanding. The amount of events processed depends on
493 root 1.2 the settings of C<BDB::max_poll_req> and C<BDB::max_poll_time>.
494 root 1.1
495     If not all requests were processed for whatever reason, the filehandle
496     will still be ready when C<poll_cb> returns.
497    
498     Example: Install an Event watcher that automatically calls
499 root 1.2 BDB::poll_cb with high priority:
500 root 1.1
501 root 1.2 Event->io (fd => BDB::poll_fileno,
502 root 1.1 poll => 'r', async => 1,
503 root 1.2 cb => \&BDB::poll_cb);
504 root 1.1
505 root 1.2 =item BDB::max_poll_reqs $nreqs
506 root 1.1
507 root 1.2 =item BDB::max_poll_time $seconds
508 root 1.1
509     These set the maximum number of requests (default C<0>, meaning infinity)
510 root 1.2 that are being processed by C<BDB::poll_cb> in one call, respectively
511 root 1.1 the maximum amount of time (default C<0>, meaning infinity) spent in
512 root 1.2 C<BDB::poll_cb> to process requests (more correctly the mininum amount
513 root 1.1 of time C<poll_cb> is allowed to use).
514    
515     Setting C<max_poll_time> to a non-zero value creates an overhead of one
516     syscall per request processed, which is not normally a problem unless your
517     callbacks are really really fast or your OS is really really slow (I am
518     not mentioning Solaris here). Using C<max_poll_reqs> incurs no overhead.
519    
520     Setting these is useful if you want to ensure some level of
521     interactiveness when perl is not fast enough to process all requests in
522     time.
523    
524     For interactive programs, values such as C<0.01> to C<0.1> should be fine.
525    
526 root 1.18 Example: Install an EV watcher that automatically calls
527 root 1.2 BDB::poll_cb with low priority, to ensure that other parts of the
528 root 1.18 program get the CPU sometimes even under high load.
529 root 1.1
530     # try not to spend much more than 0.1s in poll_cb
531 root 1.2 BDB::max_poll_time 0.1;
532 root 1.1
533 root 1.18 my $bdb_poll = EV::io BDB::poll_fileno, EV::READ, \&BDB::poll_cb);
534 root 1.1
535 root 1.2 =item BDB::poll_wait
536 root 1.1
537     If there are any outstanding requests and none of them in the result
538     phase, wait till the result filehandle becomes ready for reading (simply
539     does a C<select> on the filehandle. This is useful if you want to
540     synchronously wait for some requests to finish).
541    
542     See C<nreqs> for an example.
543    
544 root 1.2 =item BDB::poll
545 root 1.1
546     Waits until some requests have been handled.
547    
548     Returns the number of requests processed, but is otherwise strictly
549     equivalent to:
550    
551 root 1.2 BDB::poll_wait, BDB::poll_cb
552 root 1.1
553 root 1.2 =item BDB::flush
554 root 1.1
555 root 1.18 Wait till all outstanding BDB requests have been handled.
556 root 1.1
557     Strictly equivalent to:
558    
559 root 1.2 BDB::poll_wait, BDB::poll_cb
560     while BDB::nreqs;
561 root 1.1
562 root 1.8 =back
563    
564 root 1.53 =head2 VERSION CHECKING
565 root 1.39
566     BerkeleyDB comes in various versions, many of them have minor
567     incompatibilities. This means that traditional "at least version x.x"
568     checks are often not sufficient.
569    
570 root 1.47 Example: set the log_autoremove option in a way compatible with <v4.7 and
571 root 1.40 v4.7. Note the use of & on the constants to avoid triggering a compiletime
572     bug when the symbol isn't available.
573    
574     $DB_ENV->set_flags (&BDB::LOG_AUTOREMOVE ) if BDB::VERSION v0, v4.7;
575     $DB_ENV->log_set_config (&BDB::LOG_AUTO_REMOVE) if BDB::VERSION v4.7;
576    
577 root 1.39 =over 4
578    
579     =item BDB::VERSION
580    
581     The C<BDB::VERSION> function, when called without arguments, returns the
582     Berkeley DB version as a v-string (usually with 3 components). You should
583     use C<lt> and C<ge> operators exclusively to make comparisons.
584    
585     Example: check for at least version 4.7.
586    
587     BDB::VERSION ge v4.7 or die;
588    
589     =item BDB::VERSION min-version
590    
591     Returns true if the BDB version is at least the given version (specified
592     as a v-string), false otherwise.
593    
594     Example: check for at least version 4.5.
595    
596     BDB::VERSION v4.7 or die;
597    
598     =item BDB::VERSION min-version, max-version
599    
600     Returns true of the BDB version is at least version C<min-version> (specify C<undef> or C<v0> for any minimum version)
601     and less then C<max-version>.
602    
603     Example: check wether version is strictly less then v4.7.
604    
605     BDB::VERSION v0, v4.7
606     or die "version 4.7 is not yet supported";
607    
608     =back
609    
610     =cut
611    
612     sub VERSION {
613 root 1.46 # I was dumb enough to override the VERSION method here, so let's try
614     # to fix it up.
615    
616     if ($_[0] eq __PACKAGE__) {
617     $VERSION
618     } else {
619     if (@_ > 0) {
620     return undef if VERSION_v lt $_[0];
621     if (@_ > 1) {
622     return undef if VERSION_v ge $_[1];
623     }
624 root 1.39 }
625 root 1.46
626     VERSION_v
627 root 1.39 }
628     }
629    
630 root 1.53 =head2 CONTROLLING THE NUMBER OF THREADS
631 root 1.1
632 root 1.8 =over 4
633    
634 root 1.2 =item BDB::min_parallel $nthreads
635 root 1.1
636 root 1.18 Set the minimum number of BDB threads to C<$nthreads>. The current
637 root 1.1 default is C<8>, which means eight asynchronous operations can execute
638     concurrently at any one time (the number of outstanding requests,
639     however, is unlimited).
640    
641 root 1.18 BDB starts threads only on demand, when an BDB request is queued and
642 root 1.1 no free thread exists. Please note that queueing up a hundred requests can
643     create demand for a hundred threads, even if it turns out that everything
644     is in the cache and could have been processed faster by a single thread.
645    
646     It is recommended to keep the number of threads relatively low, as some
647     Linux kernel versions will scale negatively with the number of threads
648     (higher parallelity => MUCH higher latency). With current Linux 2.6
649     versions, 4-32 threads should be fine.
650    
651     Under most circumstances you don't need to call this function, as the
652     module selects a default that is suitable for low to moderate load.
653    
654 root 1.2 =item BDB::max_parallel $nthreads
655 root 1.1
656 root 1.18 Sets the maximum number of BDB threads to C<$nthreads>. If more than the
657 root 1.1 specified number of threads are currently running, this function kills
658     them. This function blocks until the limit is reached.
659    
660     While C<$nthreads> are zero, aio requests get queued but not executed
661     until the number of threads has been increased again.
662    
663     This module automatically runs C<max_parallel 0> at program end, to ensure
664     that all threads are killed and that there are no outstanding requests.
665    
666     Under normal circumstances you don't need to call this function.
667    
668 root 1.2 =item BDB::max_idle $nthreads
669 root 1.1
670     Limit the number of threads (default: 4) that are allowed to idle (i.e.,
671     threads that did not get a request to process within 10 seconds). That
672     means if a thread becomes idle while C<$nthreads> other threads are also
673     idle, it will free its resources and exit.
674    
675     This is useful when you allow a large number of threads (e.g. 100 or 1000)
676     to allow for extremely high load situations, but want to free resources
677     under normal circumstances (1000 threads can easily consume 30MB of RAM).
678    
679     The default is probably ok in most situations, especially if thread
680     creation is fast. If thread creation is very slow on your system you might
681     want to use larger values.
682    
683 root 1.2 =item $oldmaxreqs = BDB::max_outstanding $maxreqs
684 root 1.1
685     This is a very bad function to use in interactive programs because it
686     blocks, and a bad way to reduce concurrency because it is inexact: Better
687     use an C<aio_group> together with a feed callback.
688    
689     Sets the maximum number of outstanding requests to C<$nreqs>. If you
690     to queue up more than this number of requests, the next call to the
691     C<poll_cb> (and C<poll_some> and other functions calling C<poll_cb>)
692     function will block until the limit is no longer exceeded.
693    
694     The default value is very large, so there is no practical limit on the
695     number of outstanding requests.
696    
697     You can still queue as many requests as you want. Therefore,
698     C<max_oustsanding> is mainly useful in simple scripts (with low values) or
699     as a stop gap to shield against fatal memory overflow (with large values).
700    
701 root 1.50 =item $old_cb = BDB::set_sync_prepare $cb
702 root 1.3
703     Sets a callback that is called whenever a request is created without an
704     explicit callback. It has to return two code references. The first is used
705 root 1.37 as the request callback (it should save the return status), and the second
706     is called to wait until the first callback has been called (it must set
707     C<$!> to the return status).
708    
709     This mechanism can be used to include BDB into other event mechanisms,
710 root 1.50 such as L<Coro::BDB>.
711 root 1.37
712 root 1.50 To allow other, callback-based, events to be executed while callback-less
713     ones are run, you could use this sync prepare function:
714 root 1.3
715     sub {
716     my $status;
717     (
718     sub { $status = $! },
719     sub { BDB::poll while !defined $status; $! = $status },
720     )
721     }
722    
723 root 1.50 It works by polling for results till the request has finished and then
724     sets C<$!> to the return value. This means that if you don't use a
725     callback, BDB would simply fall back to synchronous operations.
726    
727     By default, or if the sync prepare function is set to C<undef>, is to
728     execute callback-less BDB requests in the foreground thread, setting C<$!>
729     to the return value, without polling for other events.
730 root 1.37
731 root 1.3 =back
732    
733 root 1.53 =head2 STATISTICAL INFORMATION
734 root 1.1
735 root 1.3 =over 4
736    
737 root 1.2 =item BDB::nreqs
738 root 1.1
739     Returns the number of requests currently in the ready, execute or pending
740     states (i.e. for which their callback has not been invoked yet).
741    
742     Example: wait till there are no outstanding requests anymore:
743    
744 root 1.2 BDB::poll_wait, BDB::poll_cb
745     while BDB::nreqs;
746 root 1.1
747 root 1.2 =item BDB::nready
748 root 1.1
749     Returns the number of requests currently in the ready state (not yet
750     executed).
751    
752 root 1.2 =item BDB::npending
753 root 1.1
754     Returns the number of requests currently in the pending state (executed,
755     but not yet processed by poll_cb).
756    
757     =back
758    
759     =cut
760    
761 root 1.50 set_sync_prepare (undef);
762 root 1.3
763 root 1.1 min_parallel 8;
764    
765     END { flush }
766    
767     1;
768    
769 root 1.53 =head1 COMMON PITFALLS
770    
771     =head2 Unexpected Crashes
772    
773     Remember that, by default, BDB will execute requests in parallel, in
774     somewhat random order. That means that it is easy to run a C<db_get>
775     request on thesa me database as a concurrent C<db_close> request, leading
776     to a crash, silent data corruption, eventually the next world war on
777     terrorism.
778    
779     If you only ever use foreground requests (without a callback), this will
780     not be an issue.
781    
782     =head2 Unexpected Freezes or Deadlocks
783    
784     Remember that, by default, BDB will execute requests in parallel, which
785     easily leads to deadlocks (even concurrent put's on the same database can
786     deadlock).
787    
788     You either need to run deadlock detection (and handle the resulting
789     errors), or make sure only one process ever updates the database, ine one
790     thread, e.g. by using only foreground requests (without a callback).
791    
792     =head1 FORK BEHAVIOUR
793 root 1.1
794     This module should do "the right thing" when the process using it forks:
795    
796 root 1.18 Before the fork, BDB enters a quiescent state where no requests
797 root 1.1 can be added in other threads and no results will be processed. After
798     the fork the parent simply leaves the quiescent state and continues
799     request/result processing, while the child frees the request/result queue
800     (so that the requests started before the fork will only be handled in the
801     parent). Threads will be started on demand until the limit set in the
802     parent process has been reached again.
803    
804     In short: the parent will, after a short pause, continue as if fork had
805 root 1.18 not been called, while the child will act as if BDB has not been used
806 root 1.1 yet.
807    
808 root 1.26 Win32 note: there is no fork on win32, and perls emulation of it is too
809     broken to be supported, so do not use BDB in a windows pseudo-fork, better
810     yet, switch to a more capable platform.
811    
812 root 1.53 =head1 MEMORY USAGE
813 root 1.1
814     Per-request usage:
815    
816     Each aio request uses - depending on your architecture - around 100-200
817     bytes of memory. In addition, stat requests need a stat buffer (possibly
818     a few hundred bytes), readdir requires a result buffer and so on. Perl
819     scalars and other data passed into aio requests will also be locked and
820     will consume memory till the request has entered the done state.
821    
822 root 1.13 This is not awfully much, so queuing lots of requests is not usually a
823 root 1.1 problem.
824    
825     Per-thread usage:
826    
827     In the execution phase, some aio requests require more memory for
828     temporary buffers, and each thread requires a stack and other data
829     structures (usually around 16k-128k, depending on the OS).
830    
831 root 1.53 =head1 WIN32 FILENAMES/DATABASE NAME MESS
832    
833     Perl on Win32 supports only ASCII filenames (the reason is that it abuses
834     an internal flag to store wether a filename is Unicode or ANSI, but that
835     flag is used for somethign else in the perl core, so there is no way to
836     detect wether a filename is ANSI or Unicode-encoded). The BDB module
837     tries to work around this issue by assuming that the filename is an ANSI
838     filename and BDB was built for unicode support.
839    
840 root 1.1 =head1 KNOWN BUGS
841    
842 root 1.15 Known bugs will be fixed in the next release, except:
843    
844     If you use a transaction in any request, and the request returns
845     with an operating system error or DB_LOCK_NOTGRANTED, the internal
846     TXN_DEADLOCK flag will be set on the transaction. See C<db_txn_finish>,
847     above.
848 root 1.1
849     =head1 SEE ALSO
850    
851 root 1.36 L<AnyEvent::BDB> (event loop integration), L<Coro::BDB> (more natural
852     syntax), L<IO::AIO> (nice to have).
853 root 1.1
854     =head1 AUTHOR
855    
856     Marc Lehmann <schmorp@schmorp.de>
857     http://home.schmorp.de/
858    
859     =cut
860