ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/BDB/BDB.pm
Revision: 1.69
Committed: Tue Feb 2 04:28:06 2016 UTC (8 years, 3 months ago) by root
Branch: MAIN
Changes since 1.68: +2 -1 lines
Log Message:
*** empty log message ***

File Contents

# User Rev Content
1 root 1.1 =head1 NAME
2    
3 root 1.2 BDB - Asynchronous Berkeley DB access
4 root 1.1
5     =head1 SYNOPSIS
6    
7 root 1.2 use BDB;
8 root 1.1
9 root 1.19 my $env = db_env_create;
10    
11     mkdir "bdtest", 0700;
12     db_env_open
13     $env,
14     "bdtest",
15     BDB::INIT_LOCK | BDB::INIT_LOG | BDB::INIT_MPOOL
16     | BDB::INIT_TXN | BDB::RECOVER | BDB::USE_ENVIRON | BDB::CREATE,
17     0600;
18    
19     $env->set_flags (BDB::AUTO_COMMIT | BDB::TXN_NOSYNC, 1);
20    
21     my $db = db_create $env;
22     db_open $db, undef, "table", undef, BDB::BTREE, BDB::AUTO_COMMIT | BDB::CREATE
23     | BDB::READ_UNCOMMITTED, 0600;
24     db_put $db, undef, "key", "data", 0, sub {
25     db_del $db, undef, "key";
26     };
27     db_sync $db;
28    
29 root 1.28 # when you also use Coro, management is easy:
30     use Coro::BDB;
31    
32 root 1.58 # automatic event loop integration with AnyEvent:
33 root 1.36 use AnyEvent::BDB;
34 root 1.19
35     # automatic result processing with EV:
36     my $WATCHER = EV::io BDB::poll_fileno, EV::READ, \&BDB::poll_cb;
37    
38     # with Glib:
39     add_watch Glib::IO BDB::poll_fileno,
40     in => sub { BDB::poll_cb; 1 };
41    
42     # or simply flush manually
43     BDB::flush;
44    
45    
46 root 1.1 =head1 DESCRIPTION
47    
48 root 1.10 See the BerkeleyDB documentation (L<http://www.oracle.com/technology/documentation/berkeley-db/db/index.html>).
49 root 1.12 The BDB API is very similar to the C API (the translation has been very faithful).
50 root 1.10
51     See also the example sections in the document below and possibly the eg/
52     subdirectory of the BDB distribution. Last not least see the IO::AIO
53     documentation, as that module uses almost the same asynchronous request
54     model as this module.
55    
56     I know this is woefully inadequate documentation. Send a patch!
57 root 1.7
58 root 1.1
59     =head1 REQUEST ANATOMY AND LIFETIME
60    
61     Every request method creates a request. which is a C data structure not
62     directly visible to Perl.
63    
64     During their existance, bdb requests travel through the following states,
65     in order:
66    
67     =over 4
68    
69     =item ready
70    
71     Immediately after a request is created it is put into the ready state,
72     waiting for a thread to execute it.
73    
74     =item execute
75    
76     A thread has accepted the request for processing and is currently
77     executing it (e.g. blocking in read).
78    
79     =item pending
80    
81     The request has been executed and is waiting for result processing.
82    
83     While request submission and execution is fully asynchronous, result
84     processing is not and relies on the perl interpreter calling C<poll_cb>
85     (or another function with the same effect).
86    
87     =item result
88    
89     The request results are processed synchronously by C<poll_cb>.
90    
91     The C<poll_cb> function will process all outstanding aio requests by
92     calling their callbacks, freeing memory associated with them and managing
93     any groups they are contained in.
94    
95     =item done
96    
97     Request has reached the end of its lifetime and holds no resources anymore
98     (except possibly for the Perl object, but its connection to the actual
99     aio request is severed and calling its methods will either do nothing or
100     result in a runtime error).
101    
102     =back
103    
104     =cut
105    
106 root 1.2 package BDB;
107 root 1.1
108 root 1.57 use common::sense;
109 root 1.1
110     use base 'Exporter';
111    
112 root 1.46 our $VERSION;
113    
114 root 1.1 BEGIN {
115 root 1.66 $VERSION = '1.91';
116 root 1.1
117 root 1.3 our @BDB_REQ = qw(
118 root 1.6 db_env_open db_env_close db_env_txn_checkpoint db_env_lock_detect
119 root 1.38 db_env_memp_sync db_env_memp_trickle db_env_dbrename db_env_dbremove
120 root 1.69 db_env_log_archive db_env_lsn_reset db_env_fileid_reset
121 root 1.56 db_open db_close db_compact db_sync db_verify db_upgrade
122 root 1.44 db_put db_exists db_get db_pget db_del db_key_range
123 root 1.15 db_txn_commit db_txn_abort db_txn_finish
124 root 1.5 db_c_close db_c_count db_c_put db_c_get db_c_pget db_c_del
125 root 1.6 db_sequence_open db_sequence_close
126     db_sequence_get db_sequence_remove
127     );
128     our @EXPORT = (@BDB_REQ, qw(dbreq_pri dbreq_nice db_env_create db_create));
129     our @EXPORT_OK = qw(
130     poll_fileno poll_cb poll_wait flush
131     min_parallel max_parallel max_idle
132     nreqs nready npending nthreads
133     max_poll_time max_poll_reqs
134 root 1.3 );
135 root 1.1
136     require XSLoader;
137 root 1.2 XSLoader::load ("BDB", $VERSION);
138 root 1.1 }
139    
140 root 1.53 =head1 BERKELEYDB FUNCTIONS
141 root 1.10
142     All of these are functions. The create functions simply return a new
143 root 1.34 object and never block. All the remaining functions take an optional
144 root 1.24 callback as last argument. If it is missing, then the function will be
145     executed synchronously. In both cases, C<$!> will reflect the return value
146     of the function.
147 root 1.10
148     BDB functions that cannot block (mostly functions that manipulate
149     settings) are method calls on the relevant objects, so the rule of thumb
150 root 1.34 is: if it's a method, it's not blocking, if it's a function, it takes a
151 root 1.10 callback as last argument.
152    
153     In the following, C<$int> signifies an integer return value,
154 root 1.38 C<bdb_filename> is a "filename" (octets on unix, madness on windows),
155     C<U32> is an unsigned 32 bit integer, C<int> is some integer, C<NV> is a
156     floating point value.
157 root 1.10
158 root 1.41 Most C<SV *> types are generic perl scalars (for input and output of data
159     values).
160 root 1.10
161 root 1.11 The various C<DB_ENV> etc. arguments are handles return by
162     C<db_env_create>, C<db_create>, C<txn_begin> and so on. If they have an
163     appended C<_ornull> this means they are optional and you can pass C<undef>
164     for them, resulting a NULL pointer on the C level.
165 root 1.10
166 root 1.41 The C<SV *callback> is the optional callback function to call when the
167     request is completed. This last callback argument is special: the callback
168     is simply the last argument passed. If there are "optional" arguments
169     before the callback they can be left out. The callback itself can be left
170     out or specified as C<undef>, in which case the function will be executed
171     synchronously.
172    
173     For example, C<db_env_txn_checkpoint> usually is called with all integer
174     arguments zero. These can be left out, so all of these specify a call
175     to C<< DB_ENV->txn_checkpoint >>, to be executed asynchronously with a
176     callback to be called:
177    
178     db_env_txn_checkpoint $db_env, 0, 0, 0, sub { };
179     db_env_txn_checkpoint $db_env, 0, 0, sub { };
180     db_env_txn_checkpoint $db_env, sub { };
181    
182     While these all specify a call to C<< DB_ENV->txn_checkpoint >> to be
183     executed synchronously:
184    
185     db_env_txn_checkpoint $db_env, 0, 0, 0, undef;
186     db_env_txn_checkpoint $db_env, 0, 0, 0;
187     db_env_txn_checkpoint $db_env, 0;
188    
189 root 1.53 =head2 BDB functions
190 root 1.10
191     Functions in the BDB namespace, exported by default:
192    
193     $env = db_env_create (U32 env_flags = 0)
194 root 1.14 flags: RPCCLIENT
195 root 1.10
196 root 1.56 db_env_open (DB_ENV *env, bdb_filename db_home, U32 open_flags, int mode, SV *callback = 0)
197 root 1.14 open_flags: INIT_CDB INIT_LOCK INIT_LOG INIT_MPOOL INIT_REP INIT_TXN RECOVER RECOVER_FATAL USE_ENVIRON USE_ENVIRON_ROOT CREATE LOCKDOWN PRIVATE REGISTER SYSTEM_MEM
198 root 1.56 db_env_close (DB_ENV *env, U32 flags = 0, SV *callback = 0)
199     db_env_txn_checkpoint (DB_ENV *env, U32 kbyte = 0, U32 min = 0, U32 flags = 0, SV *callback = 0)
200 root 1.14 flags: FORCE
201 root 1.56 db_env_lock_detect (DB_ENV *env, U32 flags = 0, U32 atype = DB_LOCK_DEFAULT, SV *dummy = 0, SV *callback = 0)
202 root 1.14 atype: LOCK_DEFAULT LOCK_EXPIRE LOCK_MAXLOCKS LOCK_MAXWRITE LOCK_MINLOCKS LOCK_MINWRITE LOCK_OLDEST LOCK_RANDOM LOCK_YOUNGEST
203 root 1.56 db_env_memp_sync (DB_ENV *env, SV *dummy = 0, SV *callback = 0)
204     db_env_memp_trickle (DB_ENV *env, int percent, SV *dummy = 0, SV *callback = 0)
205     db_env_dbremove (DB_ENV *env, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, U32 flags = 0, SV *callback = 0)
206     db_env_dbrename (DB_ENV *env, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, bdb_filename newname, U32 flags = 0, SV *callback = 0)
207     db_env_log_archive (DB_ENV *env, SV *listp, U32 flags = 0, SV *callback = 0)
208 root 1.67 db_env_lsn_reset (DB_ENV *env, bdb_filename db, U32 flags = 0, SV *callback = 0)
209 root 1.69 db_env_fileid_reset (DB_ENV *env, bdb_filename db, U32 flags = 0, SV *callback = 0)
210 root 1.10
211     $db = db_create (DB_ENV *env = 0, U32 flags = 0)
212 root 1.14 flags: XA_CREATE
213 root 1.10
214 root 1.56 db_open (DB *db, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, int type, U32 flags, int mode, SV *callback = 0)
215 root 1.14 flags: AUTO_COMMIT CREATE EXCL MULTIVERSION NOMMAP RDONLY READ_UNCOMMITTED THREAD TRUNCATE
216 root 1.56 db_close (DB *db, U32 flags = 0, SV *callback = 0)
217 root 1.14 flags: DB_NOSYNC
218 root 1.56 db_verify (DB *db, bdb_filename file, bdb_filename database = 0, SV *dummy = 0, U32 flags = 0, SV *callback = 0)
219     db_upgrade (DB *db, bdb_filename file, U32 flags = 0, SV *callback = 0)
220     db_compact (DB *db, DB_TXN_ornull *txn = 0, SV *start = 0, SV *stop = 0, SV *unused1 = 0, U32 flags = DB_FREE_SPACE, SV *unused2 = 0, SV *callback = 0)
221 root 1.14 flags: FREELIST_ONLY FREE_SPACE
222 root 1.56 db_sync (DB *db, U32 flags = 0, SV *callback = 0)
223     db_key_range (DB *db, DB_TXN_ornull *txn, SV *key, SV *key_range, U32 flags = 0, SV *callback = 0)
224     db_put (DB *db, DB_TXN_ornull *txn, SV *key, SV *data, U32 flags = 0, SV *callback = 0)
225 root 1.14 flags: APPEND NODUPDATA NOOVERWRITE
226 root 1.45 db_exists (DB *db, DB_TXN_ornull *txn, SV *key, U32 flags = 0, SV *callback = 0) (v4.6)
227 root 1.56 db_get (DB *db, DB_TXN_ornull *txn, SV *key, SV *data, U32 flags = 0, SV *callback = 0)
228 root 1.14 flags: CONSUME CONSUME_WAIT GET_BOTH SET_RECNO MULTIPLE READ_COMMITTED READ_UNCOMMITTED RMW
229 root 1.56 db_pget (DB *db, DB_TXN_ornull *txn, SV *key, SV *pkey, SV *data, U32 flags = 0, SV *callback = 0)
230 root 1.14 flags: CONSUME CONSUME_WAIT GET_BOTH SET_RECNO MULTIPLE READ_COMMITTED READ_UNCOMMITTED RMW
231 root 1.56 db_del (DB *db, DB_TXN_ornull *txn, SV *key, U32 flags = 0, SV *callback = 0)
232     db_txn_commit (DB_TXN *txn, U32 flags = 0, SV *callback = 0)
233 root 1.14 flags: TXN_NOSYNC TXN_SYNC
234 root 1.56 db_txn_abort (DB_TXN *txn, SV *callback = 0)
235 root 1.14
236 root 1.56 db_c_close (DBC *dbc, SV *callback = 0)
237     db_c_count (DBC *dbc, SV *count, U32 flags = 0, SV *callback = 0)
238     db_c_put (DBC *dbc, SV *key, SV *data, U32 flags = 0, SV *callback = 0)
239 root 1.14 flags: AFTER BEFORE CURRENT KEYFIRST KEYLAST NODUPDATA
240 root 1.56 db_c_get (DBC *dbc, SV *key, SV *data, U32 flags = 0, SV *callback = 0)
241 root 1.14 flags: CURRENT FIRST GET_BOTH GET_BOTH_RANGE GET_RECNO JOIN_ITEM LAST NEXT NEXT_DUP NEXT_NODUP PREV PREV_DUP PREV_NODUP SET SET_RANGE SET_RECNO READ_UNCOMMITTED MULTIPLE MULTIPLE_KEY RMW
242 root 1.56 db_c_pget (DBC *dbc, SV *key, SV *pkey, SV *data, U32 flags = 0, SV *callback = 0)
243     db_c_del (DBC *dbc, U32 flags = 0, SV *callback = 0)
244 root 1.10
245 root 1.56 db_sequence_open (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, SV *key, U32 flags = 0, SV *callback = 0)
246 root 1.14 flags: CREATE EXCL
247 root 1.56 db_sequence_close (DB_SEQUENCE *seq, U32 flags = 0, SV *callback = 0)
248     db_sequence_get (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, int delta, SV *seq_value, U32 flags = DB_TXN_NOSYNC, SV *callback = 0)
249 root 1.14 flags: TXN_NOSYNC
250 root 1.56 db_sequence_remove (DB_SEQUENCE *seq, DB_TXN_ornull *txnid = 0, U32 flags = 0, SV *callback = 0)
251 root 1.14 flags: TXN_NOSYNC
252 root 1.10
253 root 1.56 =head3 db_txn_finish (DB_TXN *txn, U32 flags = 0, SV *callback = 0)
254 root 1.15
255 root 1.16 This is not actually a Berkeley DB function but a BDB module
256     extension. The background for this exytension is: It is very annoying to
257     have to check every single BDB function for error returns and provide a
258     codepath out of your transaction. While the BDB module still makes this
259     possible, it contains the following extensions:
260 root 1.15
261     When a transaction-protected function returns any operating system
262     error (errno > 0), BDB will set the C<TXN_DEADLOCK> flag on the
263 root 1.16 transaction. This flag is also set by Berkeley DB functions themselves
264 root 1.15 when an operation fails with LOCK_DEADLOCK, and it causes all further
265     operations on that transaction (including C<db_txn_commit>) to fail.
266    
267     The C<db_txn_finish> request will look at this flag, and, if it is set,
268     will automatically call C<db_txn_abort> (setting errno to C<LOCK_DEADLOCK>
269 root 1.16 if it isn't set to something else yet). If it isn't set, it will call
270     C<db_txn_commit> and return the error normally.
271 root 1.15
272     How to use this? Easy: just write your transaction normally:
273    
274     my $txn = $db_env->txn_begin;
275     db_get $db, $txn, "key", my $data;
276     db_put $db, $txn, "key", $data + 1 unless $! == BDB::NOTFOUND;
277     db_txn_finish $txn;
278     die "transaction failed" if $!;
279    
280     That is, handle only the expected errors. If something unexpected happens
281     (EIO, LOCK_NOTGRANTED or a deadlock in either db_get or db_put), then the remaining
282     requests (db_put in this case) will simply be skipped (they will fail with
283     LOCK_DEADLOCK) and the transaction will be aborted.
284    
285 root 1.16 You can use the C<< $txn->failed >> method to check wether a transaction
286 root 1.15 has failed in this way and abort further processing (excluding
287     C<db_txn_finish>).
288    
289 root 1.54
290 root 1.53 =head2 DB_ENV/database environment methods
291 root 1.10
292     Methods available on DB_ENV/$env handles:
293    
294     DESTROY (DB_ENV_ornull *env)
295     CODE:
296     if (env)
297     env->close (env, 0);
298    
299     $int = $env->set_data_dir (const char *dir)
300     $int = $env->set_tmp_dir (const char *dir)
301     $int = $env->set_lg_dir (const char *dir)
302     $int = $env->set_shm_key (long shm_key)
303     $int = $env->set_cachesize (U32 gbytes, U32 bytes, int ncache = 0)
304 root 1.39 $int = $env->set_flags (U32 flags, int onoff = 1)
305 root 1.45 $int = $env->log_set_config (U32 flags, int onoff = 1) (v4.7)
306     $int = $env->set_intermediate_dir_mode (const char *modestring) (v4.7)
307 root 1.10 $env->set_errfile (FILE *errfile = 0)
308     $env->set_msgfile (FILE *msgfile = 0)
309     $int = $env->set_verbose (U32 which, int onoff = 1)
310     $int = $env->set_encrypt (const char *password, U32 flags = 0)
311 root 1.15 $int = $env->set_timeout (NV timeout_seconds, U32 flags = SET_TXN_TIMEOUT)
312 root 1.10 $int = $env->set_mp_max_openfd (int maxopenfd);
313     $int = $env->set_mp_max_write (int maxwrite, int maxwrite_sleep);
314     $int = $env->set_mp_mmapsize (int mmapsize_mb)
315     $int = $env->set_lk_detect (U32 detect = DB_LOCK_DEFAULT)
316     $int = $env->set_lk_max_lockers (U32 max)
317     $int = $env->set_lk_max_locks (U32 max)
318     $int = $env->set_lk_max_objects (U32 max)
319     $int = $env->set_lg_bsize (U32 max)
320     $int = $env->set_lg_max (U32 max)
321 root 1.20 $int = $env->mutex_set_increment (U32 increment)
322     $int = $env->mutex_set_tas_spins (U32 tas_spins)
323     $int = $env->mutex_set_max (U32 max)
324     $int = $env->mutex_set_align (U32 align)
325 root 1.10
326     $txn = $env->txn_begin (DB_TXN_ornull *parent = 0, U32 flags = 0)
327 root 1.14 flags: READ_COMMITTED READ_UNCOMMITTED TXN_NOSYNC TXN_NOWAIT TXN_SNAPSHOT TXN_SYNC TXN_WAIT TXN_WRITE_NOSYNC
328 root 1.45 $txn = $env->cdsgroup_begin; (v4.5)
329 root 1.10
330 root 1.53 =head3 Example:
331 root 1.10
332     use AnyEvent;
333     use BDB;
334    
335     our $FH; open $FH, "<&=" . BDB::poll_fileno;
336     our $WATCHER = AnyEvent->io (fh => $FH, poll => 'r', cb => \&BDB::poll_cb);
337    
338     BDB::min_parallel 8;
339    
340     my $env = db_env_create;
341    
342     mkdir "bdtest", 0700;
343     db_env_open
344     $env,
345     "bdtest",
346     BDB::INIT_LOCK | BDB::INIT_LOG | BDB::INIT_MPOOL | BDB::INIT_TXN | BDB::RECOVER | BDB::USE_ENVIRON | BDB::CREATE,
347     0600;
348    
349     $env->set_flags (BDB::AUTO_COMMIT | BDB::TXN_NOSYNC, 1);
350    
351    
352 root 1.53 =head2 DB/database methods
353 root 1.10
354     Methods available on DB/$db handles:
355    
356     DESTROY (DB_ornull *db)
357     CODE:
358     if (db)
359     {
360     SV *env = (SV *)db->app_private;
361     db->close (db, 0);
362     SvREFCNT_dec (env);
363     }
364    
365     $int = $db->set_cachesize (U32 gbytes, U32 bytes, int ncache = 0)
366     $int = $db->set_flags (U32 flags)
367 root 1.14 flags: CHKSUM ENCRYPT TXN_NOT_DURABLE
368     Btree: DUP DUPSORT RECNUM REVSPLITOFF
369     Hash: DUP DUPSORT
370     Queue: INORDER
371     Recno: RENUMBER SNAPSHOT
372    
373 root 1.10 $int = $db->set_encrypt (const char *password, U32 flags)
374     $int = $db->set_lorder (int lorder)
375     $int = $db->set_bt_minkey (U32 minkey)
376     $int = $db->set_re_delim (int delim)
377     $int = $db->set_re_pad (int re_pad)
378     $int = $db->set_re_source (char *source)
379     $int = $db->set_re_len (U32 re_len)
380     $int = $db->set_h_ffactor (U32 h_ffactor)
381     $int = $db->set_h_nelem (U32 h_nelem)
382     $int = $db->set_q_extentsize (U32 extentsize)
383    
384     $dbc = $db->cursor (DB_TXN_ornull *txn = 0, U32 flags = 0)
385 root 1.14 flags: READ_COMMITTED READ_UNCOMMITTED WRITECURSOR TXN_SNAPSHOT
386 root 1.10 $seq = $db->sequence (U32 flags = 0)
387    
388 root 1.53 =head3 Example:
389 root 1.10
390     my $db = db_create $env;
391     db_open $db, undef, "table", undef, BDB::BTREE, BDB::AUTO_COMMIT | BDB::CREATE | BDB::READ_UNCOMMITTED, 0600;
392    
393     for (1..1000) {
394     db_put $db, undef, "key $_", "data $_";
395    
396     db_key_range $db, undef, "key $_", my $keyrange;
397     my ($lt, $eq, $gt) = @$keyrange;
398     }
399    
400     db_del $db, undef, "key $_" for 1..1000;
401    
402     db_sync $db;
403    
404    
405 root 1.53 =head2 DB_TXN/transaction methods
406 root 1.10
407     Methods available on DB_TXN/$txn handles:
408    
409     DESTROY (DB_TXN_ornull *txn)
410     CODE:
411     if (txn)
412     txn->abort (txn);
413    
414 root 1.15 $int = $txn->set_timeout (NV timeout_seconds, U32 flags = SET_TXN_TIMEOUT)
415 root 1.14 flags: SET_LOCK_TIMEOUT SET_TXN_TIMEOUT
416 root 1.10
417 root 1.15 $bool = $txn->failed
418     # see db_txn_finish documentation, above
419    
420 root 1.10
421 root 1.53 =head2 DBC/cursor methods
422 root 1.10
423     Methods available on DBC/$dbc handles:
424    
425     DESTROY (DBC_ornull *dbc)
426     CODE:
427     if (dbc)
428     dbc->c_close (dbc);
429    
430 root 1.45 $int = $cursor->set_priority ($priority = PRIORITY_*) (v4.6)
431 root 1.28
432 root 1.53 =head3 Example:
433 root 1.10
434     my $c = $db->cursor;
435    
436     for (;;) {
437     db_c_get $c, my $key, my $data, BDB::NEXT;
438     warn "<$!,$key,$data>";
439     last if $!;
440     }
441    
442     db_c_close $c;
443    
444 root 1.12
445 root 1.53 =head2 DB_SEQUENCE/sequence methods
446 root 1.10
447     Methods available on DB_SEQUENCE/$seq handles:
448    
449     DESTROY (DB_SEQUENCE_ornull *seq)
450     CODE:
451     if (seq)
452     seq->close (seq, 0);
453    
454     $int = $seq->initial_value (db_seq_t value)
455     $int = $seq->set_cachesize (U32 size)
456     $int = $seq->set_flags (U32 flags)
457 root 1.14 flags: SEQ_DEC SEQ_INC SEQ_WRAP
458 root 1.10 $int = $seq->set_range (db_seq_t min, db_seq_t max)
459    
460 root 1.53 =head3 Example:
461 root 1.10
462     my $seq = $db->sequence;
463    
464     db_sequence_open $seq, undef, "seq", BDB::CREATE;
465     db_sequence_get $seq, undef, 1, my $value;
466    
467    
468 root 1.53 =head1 SUPPORT FUNCTIONS
469 root 1.1
470 root 1.53 =head2 EVENT PROCESSING AND EVENT LOOP INTEGRATION
471 root 1.1
472     =over 4
473    
474 root 1.23 =item $msg = BDB::strerror [$errno]
475    
476     Returns the string corresponding to the given errno value. If no argument
477     is given, use C<$!>.
478    
479 root 1.33 Note that the BDB module also patches the C<$!> variable directly, so you
480     should be able to get a bdb error string by simply stringifying C<$!>.
481    
482 root 1.2 =item $fileno = BDB::poll_fileno
483 root 1.1
484     Return the I<request result pipe file descriptor>. This filehandle must be
485     polled for reading by some mechanism outside this module (e.g. Event or
486     select, see below or the SYNOPSIS). If the pipe becomes readable you have
487     to call C<poll_cb> to check the results.
488    
489     See C<poll_cb> for an example.
490    
491 root 1.2 =item BDB::poll_cb
492 root 1.1
493     Process some outstanding events on the result pipe. You have to call this
494     regularly. Returns the number of events processed. Returns immediately
495     when no events are outstanding. The amount of events processed depends on
496 root 1.2 the settings of C<BDB::max_poll_req> and C<BDB::max_poll_time>.
497 root 1.1
498     If not all requests were processed for whatever reason, the filehandle
499     will still be ready when C<poll_cb> returns.
500    
501     Example: Install an Event watcher that automatically calls
502 root 1.2 BDB::poll_cb with high priority:
503 root 1.1
504 root 1.2 Event->io (fd => BDB::poll_fileno,
505 root 1.1 poll => 'r', async => 1,
506 root 1.2 cb => \&BDB::poll_cb);
507 root 1.1
508 root 1.2 =item BDB::max_poll_reqs $nreqs
509 root 1.1
510 root 1.2 =item BDB::max_poll_time $seconds
511 root 1.1
512     These set the maximum number of requests (default C<0>, meaning infinity)
513 root 1.2 that are being processed by C<BDB::poll_cb> in one call, respectively
514 root 1.1 the maximum amount of time (default C<0>, meaning infinity) spent in
515 root 1.2 C<BDB::poll_cb> to process requests (more correctly the mininum amount
516 root 1.1 of time C<poll_cb> is allowed to use).
517    
518     Setting C<max_poll_time> to a non-zero value creates an overhead of one
519     syscall per request processed, which is not normally a problem unless your
520     callbacks are really really fast or your OS is really really slow (I am
521     not mentioning Solaris here). Using C<max_poll_reqs> incurs no overhead.
522    
523     Setting these is useful if you want to ensure some level of
524     interactiveness when perl is not fast enough to process all requests in
525     time.
526    
527     For interactive programs, values such as C<0.01> to C<0.1> should be fine.
528    
529 root 1.18 Example: Install an EV watcher that automatically calls
530 root 1.2 BDB::poll_cb with low priority, to ensure that other parts of the
531 root 1.18 program get the CPU sometimes even under high load.
532 root 1.1
533     # try not to spend much more than 0.1s in poll_cb
534 root 1.2 BDB::max_poll_time 0.1;
535 root 1.1
536 root 1.18 my $bdb_poll = EV::io BDB::poll_fileno, EV::READ, \&BDB::poll_cb);
537 root 1.1
538 root 1.2 =item BDB::poll_wait
539 root 1.1
540     If there are any outstanding requests and none of them in the result
541     phase, wait till the result filehandle becomes ready for reading (simply
542     does a C<select> on the filehandle. This is useful if you want to
543     synchronously wait for some requests to finish).
544    
545     See C<nreqs> for an example.
546    
547 root 1.2 =item BDB::poll
548 root 1.1
549     Waits until some requests have been handled.
550    
551     Returns the number of requests processed, but is otherwise strictly
552     equivalent to:
553    
554 root 1.2 BDB::poll_wait, BDB::poll_cb
555 root 1.1
556 root 1.2 =item BDB::flush
557 root 1.1
558 root 1.18 Wait till all outstanding BDB requests have been handled.
559 root 1.1
560     Strictly equivalent to:
561    
562 root 1.2 BDB::poll_wait, BDB::poll_cb
563     while BDB::nreqs;
564 root 1.1
565 root 1.8 =back
566    
567 root 1.53 =head2 VERSION CHECKING
568 root 1.39
569     BerkeleyDB comes in various versions, many of them have minor
570     incompatibilities. This means that traditional "at least version x.x"
571     checks are often not sufficient.
572    
573 root 1.47 Example: set the log_autoremove option in a way compatible with <v4.7 and
574 root 1.40 v4.7. Note the use of & on the constants to avoid triggering a compiletime
575     bug when the symbol isn't available.
576    
577     $DB_ENV->set_flags (&BDB::LOG_AUTOREMOVE ) if BDB::VERSION v0, v4.7;
578     $DB_ENV->log_set_config (&BDB::LOG_AUTO_REMOVE) if BDB::VERSION v4.7;
579    
580 root 1.39 =over 4
581    
582     =item BDB::VERSION
583    
584     The C<BDB::VERSION> function, when called without arguments, returns the
585     Berkeley DB version as a v-string (usually with 3 components). You should
586     use C<lt> and C<ge> operators exclusively to make comparisons.
587    
588     Example: check for at least version 4.7.
589    
590     BDB::VERSION ge v4.7 or die;
591    
592     =item BDB::VERSION min-version
593    
594     Returns true if the BDB version is at least the given version (specified
595     as a v-string), false otherwise.
596    
597     Example: check for at least version 4.5.
598    
599     BDB::VERSION v4.7 or die;
600    
601     =item BDB::VERSION min-version, max-version
602    
603     Returns true of the BDB version is at least version C<min-version> (specify C<undef> or C<v0> for any minimum version)
604     and less then C<max-version>.
605    
606     Example: check wether version is strictly less then v4.7.
607    
608     BDB::VERSION v0, v4.7
609     or die "version 4.7 is not yet supported";
610    
611     =back
612    
613     =cut
614    
615     sub VERSION {
616 root 1.46 # I was dumb enough to override the VERSION method here, so let's try
617     # to fix it up.
618    
619     if ($_[0] eq __PACKAGE__) {
620     $VERSION
621     } else {
622     if (@_ > 0) {
623     return undef if VERSION_v lt $_[0];
624     if (@_ > 1) {
625     return undef if VERSION_v ge $_[1];
626     }
627 root 1.39 }
628 root 1.46
629     VERSION_v
630 root 1.39 }
631     }
632    
633 root 1.53 =head2 CONTROLLING THE NUMBER OF THREADS
634 root 1.1
635 root 1.8 =over 4
636    
637 root 1.2 =item BDB::min_parallel $nthreads
638 root 1.1
639 root 1.18 Set the minimum number of BDB threads to C<$nthreads>. The current
640 root 1.1 default is C<8>, which means eight asynchronous operations can execute
641     concurrently at any one time (the number of outstanding requests,
642     however, is unlimited).
643    
644 root 1.18 BDB starts threads only on demand, when an BDB request is queued and
645 root 1.1 no free thread exists. Please note that queueing up a hundred requests can
646     create demand for a hundred threads, even if it turns out that everything
647     is in the cache and could have been processed faster by a single thread.
648    
649     It is recommended to keep the number of threads relatively low, as some
650     Linux kernel versions will scale negatively with the number of threads
651     (higher parallelity => MUCH higher latency). With current Linux 2.6
652     versions, 4-32 threads should be fine.
653    
654     Under most circumstances you don't need to call this function, as the
655     module selects a default that is suitable for low to moderate load.
656    
657 root 1.2 =item BDB::max_parallel $nthreads
658 root 1.1
659 root 1.18 Sets the maximum number of BDB threads to C<$nthreads>. If more than the
660 root 1.1 specified number of threads are currently running, this function kills
661     them. This function blocks until the limit is reached.
662    
663     While C<$nthreads> are zero, aio requests get queued but not executed
664     until the number of threads has been increased again.
665    
666     This module automatically runs C<max_parallel 0> at program end, to ensure
667     that all threads are killed and that there are no outstanding requests.
668    
669     Under normal circumstances you don't need to call this function.
670    
671 root 1.2 =item BDB::max_idle $nthreads
672 root 1.1
673     Limit the number of threads (default: 4) that are allowed to idle (i.e.,
674     threads that did not get a request to process within 10 seconds). That
675     means if a thread becomes idle while C<$nthreads> other threads are also
676     idle, it will free its resources and exit.
677    
678     This is useful when you allow a large number of threads (e.g. 100 or 1000)
679     to allow for extremely high load situations, but want to free resources
680     under normal circumstances (1000 threads can easily consume 30MB of RAM).
681    
682     The default is probably ok in most situations, especially if thread
683     creation is fast. If thread creation is very slow on your system you might
684     want to use larger values.
685    
686 root 1.2 =item $oldmaxreqs = BDB::max_outstanding $maxreqs
687 root 1.1
688     This is a very bad function to use in interactive programs because it
689     blocks, and a bad way to reduce concurrency because it is inexact: Better
690     use an C<aio_group> together with a feed callback.
691    
692     Sets the maximum number of outstanding requests to C<$nreqs>. If you
693     to queue up more than this number of requests, the next call to the
694     C<poll_cb> (and C<poll_some> and other functions calling C<poll_cb>)
695     function will block until the limit is no longer exceeded.
696    
697     The default value is very large, so there is no practical limit on the
698     number of outstanding requests.
699    
700     You can still queue as many requests as you want. Therefore,
701     C<max_oustsanding> is mainly useful in simple scripts (with low values) or
702     as a stop gap to shield against fatal memory overflow (with large values).
703    
704 root 1.50 =item $old_cb = BDB::set_sync_prepare $cb
705 root 1.3
706     Sets a callback that is called whenever a request is created without an
707     explicit callback. It has to return two code references. The first is used
708 root 1.37 as the request callback (it should save the return status), and the second
709     is called to wait until the first callback has been called (it must set
710     C<$!> to the return status).
711    
712     This mechanism can be used to include BDB into other event mechanisms,
713 root 1.50 such as L<Coro::BDB>.
714 root 1.37
715 root 1.50 To allow other, callback-based, events to be executed while callback-less
716     ones are run, you could use this sync prepare function:
717 root 1.3
718     sub {
719     my $status;
720     (
721     sub { $status = $! },
722     sub { BDB::poll while !defined $status; $! = $status },
723     )
724     }
725    
726 root 1.50 It works by polling for results till the request has finished and then
727     sets C<$!> to the return value. This means that if you don't use a
728     callback, BDB would simply fall back to synchronous operations.
729    
730     By default, or if the sync prepare function is set to C<undef>, is to
731     execute callback-less BDB requests in the foreground thread, setting C<$!>
732     to the return value, without polling for other events.
733 root 1.37
734 root 1.3 =back
735    
736 root 1.53 =head2 STATISTICAL INFORMATION
737 root 1.1
738 root 1.3 =over 4
739    
740 root 1.2 =item BDB::nreqs
741 root 1.1
742     Returns the number of requests currently in the ready, execute or pending
743     states (i.e. for which their callback has not been invoked yet).
744    
745     Example: wait till there are no outstanding requests anymore:
746    
747 root 1.2 BDB::poll_wait, BDB::poll_cb
748     while BDB::nreqs;
749 root 1.1
750 root 1.2 =item BDB::nready
751 root 1.1
752     Returns the number of requests currently in the ready state (not yet
753     executed).
754    
755 root 1.2 =item BDB::npending
756 root 1.1
757     Returns the number of requests currently in the pending state (executed,
758     but not yet processed by poll_cb).
759    
760     =back
761    
762     =cut
763    
764 root 1.50 set_sync_prepare (undef);
765 root 1.3
766 root 1.1 min_parallel 8;
767    
768     END { flush }
769    
770     1;
771    
772 root 1.53 =head1 COMMON PITFALLS
773    
774     =head2 Unexpected Crashes
775    
776     Remember that, by default, BDB will execute requests in parallel, in
777     somewhat random order. That means that it is easy to run a C<db_get>
778 root 1.65 request on the same database as a concurrent C<db_close> request, leading
779 root 1.53 to a crash, silent data corruption, eventually the next world war on
780     terrorism.
781    
782     If you only ever use foreground requests (without a callback), this will
783 root 1.65 not be an issue (unless you use threads).
784 root 1.53
785     =head2 Unexpected Freezes or Deadlocks
786    
787     Remember that, by default, BDB will execute requests in parallel, which
788     easily leads to deadlocks (even concurrent put's on the same database can
789     deadlock).
790    
791     You either need to run deadlock detection (and handle the resulting
792     errors), or make sure only one process ever updates the database, ine one
793     thread, e.g. by using only foreground requests (without a callback).
794    
795     =head1 FORK BEHAVIOUR
796 root 1.1
797     This module should do "the right thing" when the process using it forks:
798    
799 root 1.18 Before the fork, BDB enters a quiescent state where no requests
800 root 1.1 can be added in other threads and no results will be processed. After
801     the fork the parent simply leaves the quiescent state and continues
802     request/result processing, while the child frees the request/result queue
803     (so that the requests started before the fork will only be handled in the
804     parent). Threads will be started on demand until the limit set in the
805     parent process has been reached again.
806    
807     In short: the parent will, after a short pause, continue as if fork had
808 root 1.18 not been called, while the child will act as if BDB has not been used
809 root 1.1 yet.
810    
811 root 1.26 Win32 note: there is no fork on win32, and perls emulation of it is too
812     broken to be supported, so do not use BDB in a windows pseudo-fork, better
813     yet, switch to a more capable platform.
814    
815 root 1.53 =head1 MEMORY USAGE
816 root 1.1
817     Per-request usage:
818    
819     Each aio request uses - depending on your architecture - around 100-200
820     bytes of memory. In addition, stat requests need a stat buffer (possibly
821     a few hundred bytes), readdir requires a result buffer and so on. Perl
822     scalars and other data passed into aio requests will also be locked and
823     will consume memory till the request has entered the done state.
824    
825 root 1.13 This is not awfully much, so queuing lots of requests is not usually a
826 root 1.1 problem.
827    
828     Per-thread usage:
829    
830     In the execution phase, some aio requests require more memory for
831     temporary buffers, and each thread requires a stack and other data
832     structures (usually around 16k-128k, depending on the OS).
833    
834 root 1.53 =head1 WIN32 FILENAMES/DATABASE NAME MESS
835    
836     Perl on Win32 supports only ASCII filenames (the reason is that it abuses
837     an internal flag to store wether a filename is Unicode or ANSI, but that
838     flag is used for somethign else in the perl core, so there is no way to
839     detect wether a filename is ANSI or Unicode-encoded). The BDB module
840     tries to work around this issue by assuming that the filename is an ANSI
841     filename and BDB was built for unicode support.
842    
843 root 1.1 =head1 KNOWN BUGS
844    
845 root 1.15 Known bugs will be fixed in the next release, except:
846    
847     If you use a transaction in any request, and the request returns
848     with an operating system error or DB_LOCK_NOTGRANTED, the internal
849     TXN_DEADLOCK flag will be set on the transaction. See C<db_txn_finish>,
850     above.
851 root 1.1
852     =head1 SEE ALSO
853    
854 root 1.36 L<AnyEvent::BDB> (event loop integration), L<Coro::BDB> (more natural
855     syntax), L<IO::AIO> (nice to have).
856 root 1.1
857     =head1 AUTHOR
858    
859     Marc Lehmann <schmorp@schmorp.de>
860     http://home.schmorp.de/
861    
862     =cut
863