ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/BDB/BDB.pm
Revision: 1.48
Committed: Tue Jul 29 03:33:16 2008 UTC (15 years, 9 months ago) by root
Branch: MAIN
CVS Tags: rel-1_71
Changes since 1.47: +1 -1 lines
Log Message:
1.71

File Contents

# User Rev Content
1 root 1.1 =head1 NAME
2    
3 root 1.2 BDB - Asynchronous Berkeley DB access
4 root 1.1
5     =head1 SYNOPSIS
6    
7 root 1.2 use BDB;
8 root 1.1
9 root 1.19 my $env = db_env_create;
10    
11     mkdir "bdtest", 0700;
12     db_env_open
13     $env,
14     "bdtest",
15     BDB::INIT_LOCK | BDB::INIT_LOG | BDB::INIT_MPOOL
16     | BDB::INIT_TXN | BDB::RECOVER | BDB::USE_ENVIRON | BDB::CREATE,
17     0600;
18    
19     $env->set_flags (BDB::AUTO_COMMIT | BDB::TXN_NOSYNC, 1);
20    
21     my $db = db_create $env;
22     db_open $db, undef, "table", undef, BDB::BTREE, BDB::AUTO_COMMIT | BDB::CREATE
23     | BDB::READ_UNCOMMITTED, 0600;
24     db_put $db, undef, "key", "data", 0, sub {
25     db_del $db, undef, "key";
26     };
27     db_sync $db;
28    
29 root 1.28 # when you also use Coro, management is easy:
30     use Coro::BDB;
31    
32 root 1.36 # automatic event loop intergration with AnyEvent:
33     use AnyEvent::BDB;
34 root 1.19
35     # automatic result processing with EV:
36     my $WATCHER = EV::io BDB::poll_fileno, EV::READ, \&BDB::poll_cb;
37    
38     # with Glib:
39     add_watch Glib::IO BDB::poll_fileno,
40     in => sub { BDB::poll_cb; 1 };
41    
42     # or simply flush manually
43     BDB::flush;
44    
45    
46 root 1.1 =head1 DESCRIPTION
47    
48 root 1.10 See the BerkeleyDB documentation (L<http://www.oracle.com/technology/documentation/berkeley-db/db/index.html>).
49 root 1.12 The BDB API is very similar to the C API (the translation has been very faithful).
50 root 1.10
51     See also the example sections in the document below and possibly the eg/
52     subdirectory of the BDB distribution. Last not least see the IO::AIO
53     documentation, as that module uses almost the same asynchronous request
54     model as this module.
55    
56     I know this is woefully inadequate documentation. Send a patch!
57 root 1.7
58 root 1.1
59     =head1 REQUEST ANATOMY AND LIFETIME
60    
61     Every request method creates a request. which is a C data structure not
62     directly visible to Perl.
63    
64     During their existance, bdb requests travel through the following states,
65     in order:
66    
67     =over 4
68    
69     =item ready
70    
71     Immediately after a request is created it is put into the ready state,
72     waiting for a thread to execute it.
73    
74     =item execute
75    
76     A thread has accepted the request for processing and is currently
77     executing it (e.g. blocking in read).
78    
79     =item pending
80    
81     The request has been executed and is waiting for result processing.
82    
83     While request submission and execution is fully asynchronous, result
84     processing is not and relies on the perl interpreter calling C<poll_cb>
85     (or another function with the same effect).
86    
87     =item result
88    
89     The request results are processed synchronously by C<poll_cb>.
90    
91     The C<poll_cb> function will process all outstanding aio requests by
92     calling their callbacks, freeing memory associated with them and managing
93     any groups they are contained in.
94    
95     =item done
96    
97     Request has reached the end of its lifetime and holds no resources anymore
98     (except possibly for the Perl object, but its connection to the actual
99     aio request is severed and calling its methods will either do nothing or
100     result in a runtime error).
101    
102     =back
103    
104     =cut
105    
106 root 1.2 package BDB;
107 root 1.1
108     no warnings;
109     use strict 'vars';
110    
111     use base 'Exporter';
112    
113 root 1.46 our $VERSION;
114    
115 root 1.1 BEGIN {
116 root 1.48 $VERSION = '1.71';
117 root 1.1
118 root 1.3 our @BDB_REQ = qw(
119 root 1.6 db_env_open db_env_close db_env_txn_checkpoint db_env_lock_detect
120 root 1.38 db_env_memp_sync db_env_memp_trickle db_env_dbrename db_env_dbremove
121 root 1.25 db_open db_close db_compact db_sync db_upgrade
122 root 1.44 db_put db_exists db_get db_pget db_del db_key_range
123 root 1.15 db_txn_commit db_txn_abort db_txn_finish
124 root 1.5 db_c_close db_c_count db_c_put db_c_get db_c_pget db_c_del
125 root 1.6 db_sequence_open db_sequence_close
126     db_sequence_get db_sequence_remove
127     );
128     our @EXPORT = (@BDB_REQ, qw(dbreq_pri dbreq_nice db_env_create db_create));
129     our @EXPORT_OK = qw(
130     poll_fileno poll_cb poll_wait flush
131     min_parallel max_parallel max_idle
132     nreqs nready npending nthreads
133     max_poll_time max_poll_reqs
134 root 1.3 );
135 root 1.1
136     require XSLoader;
137 root 1.2 XSLoader::load ("BDB", $VERSION);
138 root 1.1 }
139    
140 root 1.35 =head2 WIN32 FILENAMES/DATABASE NAME MESS
141 root 1.34
142 root 1.35 Perl on Win32 supports only ASCII filenames (the reason is that it abuses
143     an internal flag to store wether a filename is Unicode or ANSI, but that
144     flag is used for somethign else in the perl core, so there is no way to
145     detect wether a filename is ANSI or Unicode-encoded). The BDB module
146     tries to work around this issue by assuming that the filename is an ANSI
147     filename and BDB was built for unicode support.
148 root 1.34
149 root 1.10 =head2 BERKELEYDB FUNCTIONS
150    
151     All of these are functions. The create functions simply return a new
152 root 1.34 object and never block. All the remaining functions take an optional
153 root 1.24 callback as last argument. If it is missing, then the function will be
154     executed synchronously. In both cases, C<$!> will reflect the return value
155     of the function.
156 root 1.10
157     BDB functions that cannot block (mostly functions that manipulate
158     settings) are method calls on the relevant objects, so the rule of thumb
159 root 1.34 is: if it's a method, it's not blocking, if it's a function, it takes a
160 root 1.10 callback as last argument.
161    
162     In the following, C<$int> signifies an integer return value,
163 root 1.38 C<bdb_filename> is a "filename" (octets on unix, madness on windows),
164     C<U32> is an unsigned 32 bit integer, C<int> is some integer, C<NV> is a
165     floating point value.
166 root 1.10
167 root 1.41 Most C<SV *> types are generic perl scalars (for input and output of data
168     values).
169 root 1.10
170 root 1.11 The various C<DB_ENV> etc. arguments are handles return by
171     C<db_env_create>, C<db_create>, C<txn_begin> and so on. If they have an
172     appended C<_ornull> this means they are optional and you can pass C<undef>
173     for them, resulting a NULL pointer on the C level.
174 root 1.10
175 root 1.41 The C<SV *callback> is the optional callback function to call when the
176     request is completed. This last callback argument is special: the callback
177     is simply the last argument passed. If there are "optional" arguments
178     before the callback they can be left out. The callback itself can be left
179     out or specified as C<undef>, in which case the function will be executed
180     synchronously.
181    
182     For example, C<db_env_txn_checkpoint> usually is called with all integer
183     arguments zero. These can be left out, so all of these specify a call
184     to C<< DB_ENV->txn_checkpoint >>, to be executed asynchronously with a
185     callback to be called:
186    
187     db_env_txn_checkpoint $db_env, 0, 0, 0, sub { };
188     db_env_txn_checkpoint $db_env, 0, 0, sub { };
189     db_env_txn_checkpoint $db_env, sub { };
190    
191     While these all specify a call to C<< DB_ENV->txn_checkpoint >> to be
192     executed synchronously:
193    
194     db_env_txn_checkpoint $db_env, 0, 0, 0, undef;
195     db_env_txn_checkpoint $db_env, 0, 0, 0;
196     db_env_txn_checkpoint $db_env, 0;
197    
198 root 1.10 =head3 BDB functions
199    
200     Functions in the BDB namespace, exported by default:
201    
202     $env = db_env_create (U32 env_flags = 0)
203 root 1.14 flags: RPCCLIENT
204 root 1.10
205 root 1.38 db_env_open (DB_ENV *env, bdb_filename db_home, U32 open_flags, int mode, SV *callback = &PL_sv_undef)
206 root 1.14 open_flags: INIT_CDB INIT_LOCK INIT_LOG INIT_MPOOL INIT_REP INIT_TXN RECOVER RECOVER_FATAL USE_ENVIRON USE_ENVIRON_ROOT CREATE LOCKDOWN PRIVATE REGISTER SYSTEM_MEM
207 root 1.10 db_env_close (DB_ENV *env, U32 flags = 0, SV *callback = &PL_sv_undef)
208     db_env_txn_checkpoint (DB_ENV *env, U32 kbyte = 0, U32 min = 0, U32 flags = 0, SV *callback = &PL_sv_undef)
209 root 1.14 flags: FORCE
210 root 1.10 db_env_lock_detect (DB_ENV *env, U32 flags = 0, U32 atype = DB_LOCK_DEFAULT, SV *dummy = 0, SV *callback = &PL_sv_undef)
211 root 1.14 atype: LOCK_DEFAULT LOCK_EXPIRE LOCK_MAXLOCKS LOCK_MAXWRITE LOCK_MINLOCKS LOCK_MINWRITE LOCK_OLDEST LOCK_RANDOM LOCK_YOUNGEST
212 root 1.10 db_env_memp_sync (DB_ENV *env, SV *dummy = 0, SV *callback = &PL_sv_undef)
213     db_env_memp_trickle (DB_ENV *env, int percent, SV *dummy = 0, SV *callback = &PL_sv_undef)
214 root 1.38 db_env_dbremove (DB_ENV *env, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, U32 flags = 0, SV *callback = &PL_sv_undef)
215     db_env_dbrename (DB_ENV *env, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, bdb_filename newname, U32 flags = 0, SV *callback = &PL_sv_undef)
216 root 1.10
217     $db = db_create (DB_ENV *env = 0, U32 flags = 0)
218 root 1.14 flags: XA_CREATE
219 root 1.10
220 root 1.38 db_open (DB *db, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, int type, U32 flags, int mode, SV *callback = &PL_sv_undef)
221 root 1.14 flags: AUTO_COMMIT CREATE EXCL MULTIVERSION NOMMAP RDONLY READ_UNCOMMITTED THREAD TRUNCATE
222 root 1.10 db_close (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef)
223 root 1.14 flags: DB_NOSYNC
224 root 1.38 db_upgrade (DB *db, bdb_filename file, U32 flags = 0, SV *callback = &PL_sv_undef)
225 root 1.12 db_compact (DB *db, DB_TXN_ornull *txn = 0, SV *start = 0, SV *stop = 0, SV *unused1 = 0, U32 flags = DB_FREE_SPACE, SV *unused2 = 0, SV *callback = &PL_sv_undef)
226 root 1.14 flags: FREELIST_ONLY FREE_SPACE
227 root 1.10 db_sync (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef)
228     db_key_range (DB *db, DB_TXN_ornull *txn, SV *key, SV *key_range, U32 flags = 0, SV *callback = &PL_sv_undef)
229     db_put (DB *db, DB_TXN_ornull *txn, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
230 root 1.14 flags: APPEND NODUPDATA NOOVERWRITE
231 root 1.45 db_exists (DB *db, DB_TXN_ornull *txn, SV *key, U32 flags = 0, SV *callback = 0) (v4.6)
232 root 1.10 db_get (DB *db, DB_TXN_ornull *txn, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
233 root 1.14 flags: CONSUME CONSUME_WAIT GET_BOTH SET_RECNO MULTIPLE READ_COMMITTED READ_UNCOMMITTED RMW
234 root 1.10 db_pget (DB *db, DB_TXN_ornull *txn, SV *key, SV *pkey, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
235 root 1.14 flags: CONSUME CONSUME_WAIT GET_BOTH SET_RECNO MULTIPLE READ_COMMITTED READ_UNCOMMITTED RMW
236 root 1.10 db_del (DB *db, DB_TXN_ornull *txn, SV *key, U32 flags = 0, SV *callback = &PL_sv_undef)
237     db_txn_commit (DB_TXN *txn, U32 flags = 0, SV *callback = &PL_sv_undef)
238 root 1.14 flags: TXN_NOSYNC TXN_SYNC
239 root 1.10 db_txn_abort (DB_TXN *txn, SV *callback = &PL_sv_undef)
240 root 1.14
241 root 1.10 db_c_close (DBC *dbc, SV *callback = &PL_sv_undef)
242     db_c_count (DBC *dbc, SV *count, U32 flags = 0, SV *callback = &PL_sv_undef)
243     db_c_put (DBC *dbc, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
244 root 1.14 flags: AFTER BEFORE CURRENT KEYFIRST KEYLAST NODUPDATA
245 root 1.10 db_c_get (DBC *dbc, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
246 root 1.14 flags: CURRENT FIRST GET_BOTH GET_BOTH_RANGE GET_RECNO JOIN_ITEM LAST NEXT NEXT_DUP NEXT_NODUP PREV PREV_DUP PREV_NODUP SET SET_RANGE SET_RECNO READ_UNCOMMITTED MULTIPLE MULTIPLE_KEY RMW
247 root 1.10 db_c_pget (DBC *dbc, SV *key, SV *pkey, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
248     db_c_del (DBC *dbc, U32 flags = 0, SV *callback = &PL_sv_undef)
249    
250     db_sequence_open (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, SV *key, U32 flags = 0, SV *callback = &PL_sv_undef)
251 root 1.14 flags: CREATE EXCL
252 root 1.10 db_sequence_close (DB_SEQUENCE *seq, U32 flags = 0, SV *callback = &PL_sv_undef)
253     db_sequence_get (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, int delta, SV *seq_value, U32 flags = DB_TXN_NOSYNC, SV *callback = &PL_sv_undef)
254 root 1.14 flags: TXN_NOSYNC
255 root 1.10 db_sequence_remove (DB_SEQUENCE *seq, DB_TXN_ornull *txnid = 0, U32 flags = 0, SV *callback = &PL_sv_undef)
256 root 1.14 flags: TXN_NOSYNC
257 root 1.10
258 root 1.15 =head4 db_txn_finish (DB_TXN *txn, U32 flags = 0, SV *callback = &PL_sv_undef)
259    
260 root 1.16 This is not actually a Berkeley DB function but a BDB module
261     extension. The background for this exytension is: It is very annoying to
262     have to check every single BDB function for error returns and provide a
263     codepath out of your transaction. While the BDB module still makes this
264     possible, it contains the following extensions:
265 root 1.15
266     When a transaction-protected function returns any operating system
267     error (errno > 0), BDB will set the C<TXN_DEADLOCK> flag on the
268 root 1.16 transaction. This flag is also set by Berkeley DB functions themselves
269 root 1.15 when an operation fails with LOCK_DEADLOCK, and it causes all further
270     operations on that transaction (including C<db_txn_commit>) to fail.
271    
272     The C<db_txn_finish> request will look at this flag, and, if it is set,
273     will automatically call C<db_txn_abort> (setting errno to C<LOCK_DEADLOCK>
274 root 1.16 if it isn't set to something else yet). If it isn't set, it will call
275     C<db_txn_commit> and return the error normally.
276 root 1.15
277     How to use this? Easy: just write your transaction normally:
278    
279     my $txn = $db_env->txn_begin;
280     db_get $db, $txn, "key", my $data;
281     db_put $db, $txn, "key", $data + 1 unless $! == BDB::NOTFOUND;
282     db_txn_finish $txn;
283     die "transaction failed" if $!;
284    
285     That is, handle only the expected errors. If something unexpected happens
286     (EIO, LOCK_NOTGRANTED or a deadlock in either db_get or db_put), then the remaining
287     requests (db_put in this case) will simply be skipped (they will fail with
288     LOCK_DEADLOCK) and the transaction will be aborted.
289    
290 root 1.16 You can use the C<< $txn->failed >> method to check wether a transaction
291 root 1.15 has failed in this way and abort further processing (excluding
292     C<db_txn_finish>).
293    
294 root 1.10 =head3 DB_ENV/database environment methods
295    
296     Methods available on DB_ENV/$env handles:
297    
298     DESTROY (DB_ENV_ornull *env)
299     CODE:
300     if (env)
301     env->close (env, 0);
302    
303     $int = $env->set_data_dir (const char *dir)
304     $int = $env->set_tmp_dir (const char *dir)
305     $int = $env->set_lg_dir (const char *dir)
306     $int = $env->set_shm_key (long shm_key)
307     $int = $env->set_cachesize (U32 gbytes, U32 bytes, int ncache = 0)
308 root 1.39 $int = $env->set_flags (U32 flags, int onoff = 1)
309 root 1.45 $int = $env->log_set_config (U32 flags, int onoff = 1) (v4.7)
310     $int = $env->set_intermediate_dir_mode (const char *modestring) (v4.7)
311 root 1.10 $env->set_errfile (FILE *errfile = 0)
312     $env->set_msgfile (FILE *msgfile = 0)
313     $int = $env->set_verbose (U32 which, int onoff = 1)
314     $int = $env->set_encrypt (const char *password, U32 flags = 0)
315 root 1.15 $int = $env->set_timeout (NV timeout_seconds, U32 flags = SET_TXN_TIMEOUT)
316 root 1.10 $int = $env->set_mp_max_openfd (int maxopenfd);
317     $int = $env->set_mp_max_write (int maxwrite, int maxwrite_sleep);
318     $int = $env->set_mp_mmapsize (int mmapsize_mb)
319     $int = $env->set_lk_detect (U32 detect = DB_LOCK_DEFAULT)
320     $int = $env->set_lk_max_lockers (U32 max)
321     $int = $env->set_lk_max_locks (U32 max)
322     $int = $env->set_lk_max_objects (U32 max)
323     $int = $env->set_lg_bsize (U32 max)
324     $int = $env->set_lg_max (U32 max)
325 root 1.20 $int = $env->mutex_set_increment (U32 increment)
326     $int = $env->mutex_set_tas_spins (U32 tas_spins)
327     $int = $env->mutex_set_max (U32 max)
328     $int = $env->mutex_set_align (U32 align)
329 root 1.10
330     $txn = $env->txn_begin (DB_TXN_ornull *parent = 0, U32 flags = 0)
331 root 1.14 flags: READ_COMMITTED READ_UNCOMMITTED TXN_NOSYNC TXN_NOWAIT TXN_SNAPSHOT TXN_SYNC TXN_WAIT TXN_WRITE_NOSYNC
332 root 1.45 $txn = $env->cdsgroup_begin; (v4.5)
333 root 1.10
334 root 1.12 =head4 Example:
335 root 1.10
336     use AnyEvent;
337     use BDB;
338    
339     our $FH; open $FH, "<&=" . BDB::poll_fileno;
340     our $WATCHER = AnyEvent->io (fh => $FH, poll => 'r', cb => \&BDB::poll_cb);
341    
342     BDB::min_parallel 8;
343    
344     my $env = db_env_create;
345    
346     mkdir "bdtest", 0700;
347     db_env_open
348     $env,
349     "bdtest",
350     BDB::INIT_LOCK | BDB::INIT_LOG | BDB::INIT_MPOOL | BDB::INIT_TXN | BDB::RECOVER | BDB::USE_ENVIRON | BDB::CREATE,
351     0600;
352    
353     $env->set_flags (BDB::AUTO_COMMIT | BDB::TXN_NOSYNC, 1);
354    
355    
356     =head3 DB/database methods
357    
358     Methods available on DB/$db handles:
359    
360     DESTROY (DB_ornull *db)
361     CODE:
362     if (db)
363     {
364     SV *env = (SV *)db->app_private;
365     db->close (db, 0);
366     SvREFCNT_dec (env);
367     }
368    
369     $int = $db->set_cachesize (U32 gbytes, U32 bytes, int ncache = 0)
370     $int = $db->set_flags (U32 flags)
371 root 1.14 flags: CHKSUM ENCRYPT TXN_NOT_DURABLE
372     Btree: DUP DUPSORT RECNUM REVSPLITOFF
373     Hash: DUP DUPSORT
374     Queue: INORDER
375     Recno: RENUMBER SNAPSHOT
376    
377 root 1.10 $int = $db->set_encrypt (const char *password, U32 flags)
378     $int = $db->set_lorder (int lorder)
379     $int = $db->set_bt_minkey (U32 minkey)
380     $int = $db->set_re_delim (int delim)
381     $int = $db->set_re_pad (int re_pad)
382     $int = $db->set_re_source (char *source)
383     $int = $db->set_re_len (U32 re_len)
384     $int = $db->set_h_ffactor (U32 h_ffactor)
385     $int = $db->set_h_nelem (U32 h_nelem)
386     $int = $db->set_q_extentsize (U32 extentsize)
387    
388     $dbc = $db->cursor (DB_TXN_ornull *txn = 0, U32 flags = 0)
389 root 1.14 flags: READ_COMMITTED READ_UNCOMMITTED WRITECURSOR TXN_SNAPSHOT
390 root 1.10 $seq = $db->sequence (U32 flags = 0)
391    
392 root 1.12 =head4 Example:
393 root 1.10
394     my $db = db_create $env;
395     db_open $db, undef, "table", undef, BDB::BTREE, BDB::AUTO_COMMIT | BDB::CREATE | BDB::READ_UNCOMMITTED, 0600;
396    
397     for (1..1000) {
398     db_put $db, undef, "key $_", "data $_";
399    
400     db_key_range $db, undef, "key $_", my $keyrange;
401     my ($lt, $eq, $gt) = @$keyrange;
402     }
403    
404     db_del $db, undef, "key $_" for 1..1000;
405    
406     db_sync $db;
407    
408    
409     =head3 DB_TXN/transaction methods
410    
411     Methods available on DB_TXN/$txn handles:
412    
413     DESTROY (DB_TXN_ornull *txn)
414     CODE:
415     if (txn)
416     txn->abort (txn);
417    
418 root 1.15 $int = $txn->set_timeout (NV timeout_seconds, U32 flags = SET_TXN_TIMEOUT)
419 root 1.14 flags: SET_LOCK_TIMEOUT SET_TXN_TIMEOUT
420 root 1.10
421 root 1.15 $bool = $txn->failed
422     # see db_txn_finish documentation, above
423    
424 root 1.10
425     =head3 DBC/cursor methods
426    
427     Methods available on DBC/$dbc handles:
428    
429     DESTROY (DBC_ornull *dbc)
430     CODE:
431     if (dbc)
432     dbc->c_close (dbc);
433    
434 root 1.45 $int = $cursor->set_priority ($priority = PRIORITY_*) (v4.6)
435 root 1.28
436 root 1.12 =head4 Example:
437 root 1.10
438     my $c = $db->cursor;
439    
440     for (;;) {
441     db_c_get $c, my $key, my $data, BDB::NEXT;
442     warn "<$!,$key,$data>";
443     last if $!;
444     }
445    
446     db_c_close $c;
447    
448 root 1.12
449 root 1.10 =head3 DB_SEQUENCE/sequence methods
450    
451     Methods available on DB_SEQUENCE/$seq handles:
452    
453     DESTROY (DB_SEQUENCE_ornull *seq)
454     CODE:
455     if (seq)
456     seq->close (seq, 0);
457    
458     $int = $seq->initial_value (db_seq_t value)
459     $int = $seq->set_cachesize (U32 size)
460     $int = $seq->set_flags (U32 flags)
461 root 1.14 flags: SEQ_DEC SEQ_INC SEQ_WRAP
462 root 1.10 $int = $seq->set_range (db_seq_t min, db_seq_t max)
463    
464 root 1.12 =head4 Example:
465 root 1.10
466     my $seq = $db->sequence;
467    
468     db_sequence_open $seq, undef, "seq", BDB::CREATE;
469     db_sequence_get $seq, undef, 1, my $value;
470    
471    
472 root 1.1 =head2 SUPPORT FUNCTIONS
473    
474     =head3 EVENT PROCESSING AND EVENT LOOP INTEGRATION
475    
476     =over 4
477    
478 root 1.23 =item $msg = BDB::strerror [$errno]
479    
480     Returns the string corresponding to the given errno value. If no argument
481     is given, use C<$!>.
482    
483 root 1.33 Note that the BDB module also patches the C<$!> variable directly, so you
484     should be able to get a bdb error string by simply stringifying C<$!>.
485    
486 root 1.2 =item $fileno = BDB::poll_fileno
487 root 1.1
488     Return the I<request result pipe file descriptor>. This filehandle must be
489     polled for reading by some mechanism outside this module (e.g. Event or
490     select, see below or the SYNOPSIS). If the pipe becomes readable you have
491     to call C<poll_cb> to check the results.
492    
493     See C<poll_cb> for an example.
494    
495 root 1.2 =item BDB::poll_cb
496 root 1.1
497     Process some outstanding events on the result pipe. You have to call this
498     regularly. Returns the number of events processed. Returns immediately
499     when no events are outstanding. The amount of events processed depends on
500 root 1.2 the settings of C<BDB::max_poll_req> and C<BDB::max_poll_time>.
501 root 1.1
502     If not all requests were processed for whatever reason, the filehandle
503     will still be ready when C<poll_cb> returns.
504    
505     Example: Install an Event watcher that automatically calls
506 root 1.2 BDB::poll_cb with high priority:
507 root 1.1
508 root 1.2 Event->io (fd => BDB::poll_fileno,
509 root 1.1 poll => 'r', async => 1,
510 root 1.2 cb => \&BDB::poll_cb);
511 root 1.1
512 root 1.2 =item BDB::max_poll_reqs $nreqs
513 root 1.1
514 root 1.2 =item BDB::max_poll_time $seconds
515 root 1.1
516     These set the maximum number of requests (default C<0>, meaning infinity)
517 root 1.2 that are being processed by C<BDB::poll_cb> in one call, respectively
518 root 1.1 the maximum amount of time (default C<0>, meaning infinity) spent in
519 root 1.2 C<BDB::poll_cb> to process requests (more correctly the mininum amount
520 root 1.1 of time C<poll_cb> is allowed to use).
521    
522     Setting C<max_poll_time> to a non-zero value creates an overhead of one
523     syscall per request processed, which is not normally a problem unless your
524     callbacks are really really fast or your OS is really really slow (I am
525     not mentioning Solaris here). Using C<max_poll_reqs> incurs no overhead.
526    
527     Setting these is useful if you want to ensure some level of
528     interactiveness when perl is not fast enough to process all requests in
529     time.
530    
531     For interactive programs, values such as C<0.01> to C<0.1> should be fine.
532    
533 root 1.18 Example: Install an EV watcher that automatically calls
534 root 1.2 BDB::poll_cb with low priority, to ensure that other parts of the
535 root 1.18 program get the CPU sometimes even under high load.
536 root 1.1
537     # try not to spend much more than 0.1s in poll_cb
538 root 1.2 BDB::max_poll_time 0.1;
539 root 1.1
540 root 1.18 my $bdb_poll = EV::io BDB::poll_fileno, EV::READ, \&BDB::poll_cb);
541 root 1.1
542 root 1.2 =item BDB::poll_wait
543 root 1.1
544     If there are any outstanding requests and none of them in the result
545     phase, wait till the result filehandle becomes ready for reading (simply
546     does a C<select> on the filehandle. This is useful if you want to
547     synchronously wait for some requests to finish).
548    
549     See C<nreqs> for an example.
550    
551 root 1.2 =item BDB::poll
552 root 1.1
553     Waits until some requests have been handled.
554    
555     Returns the number of requests processed, but is otherwise strictly
556     equivalent to:
557    
558 root 1.2 BDB::poll_wait, BDB::poll_cb
559 root 1.1
560 root 1.2 =item BDB::flush
561 root 1.1
562 root 1.18 Wait till all outstanding BDB requests have been handled.
563 root 1.1
564     Strictly equivalent to:
565    
566 root 1.2 BDB::poll_wait, BDB::poll_cb
567     while BDB::nreqs;
568 root 1.1
569 root 1.8 =back
570    
571 root 1.39 =head3 VERSION CHECKING
572    
573     BerkeleyDB comes in various versions, many of them have minor
574     incompatibilities. This means that traditional "at least version x.x"
575     checks are often not sufficient.
576    
577 root 1.47 Example: set the log_autoremove option in a way compatible with <v4.7 and
578 root 1.40 v4.7. Note the use of & on the constants to avoid triggering a compiletime
579     bug when the symbol isn't available.
580    
581     $DB_ENV->set_flags (&BDB::LOG_AUTOREMOVE ) if BDB::VERSION v0, v4.7;
582     $DB_ENV->log_set_config (&BDB::LOG_AUTO_REMOVE) if BDB::VERSION v4.7;
583    
584 root 1.39 =over 4
585    
586     =item BDB::VERSION
587    
588     The C<BDB::VERSION> function, when called without arguments, returns the
589     Berkeley DB version as a v-string (usually with 3 components). You should
590     use C<lt> and C<ge> operators exclusively to make comparisons.
591    
592     Example: check for at least version 4.7.
593    
594     BDB::VERSION ge v4.7 or die;
595    
596     =item BDB::VERSION min-version
597    
598     Returns true if the BDB version is at least the given version (specified
599     as a v-string), false otherwise.
600    
601     Example: check for at least version 4.5.
602    
603     BDB::VERSION v4.7 or die;
604    
605     =item BDB::VERSION min-version, max-version
606    
607     Returns true of the BDB version is at least version C<min-version> (specify C<undef> or C<v0> for any minimum version)
608     and less then C<max-version>.
609    
610     Example: check wether version is strictly less then v4.7.
611    
612     BDB::VERSION v0, v4.7
613     or die "version 4.7 is not yet supported";
614    
615     =back
616    
617     =cut
618    
619     sub VERSION {
620 root 1.46 # I was dumb enough to override the VERSION method here, so let's try
621     # to fix it up.
622    
623     if ($_[0] eq __PACKAGE__) {
624     $VERSION
625     } else {
626     if (@_ > 0) {
627     return undef if VERSION_v lt $_[0];
628     if (@_ > 1) {
629     return undef if VERSION_v ge $_[1];
630     }
631 root 1.39 }
632 root 1.46
633     VERSION_v
634 root 1.39 }
635     }
636    
637 root 1.1 =head3 CONTROLLING THE NUMBER OF THREADS
638    
639 root 1.8 =over 4
640    
641 root 1.2 =item BDB::min_parallel $nthreads
642 root 1.1
643 root 1.18 Set the minimum number of BDB threads to C<$nthreads>. The current
644 root 1.1 default is C<8>, which means eight asynchronous operations can execute
645     concurrently at any one time (the number of outstanding requests,
646     however, is unlimited).
647    
648 root 1.18 BDB starts threads only on demand, when an BDB request is queued and
649 root 1.1 no free thread exists. Please note that queueing up a hundred requests can
650     create demand for a hundred threads, even if it turns out that everything
651     is in the cache and could have been processed faster by a single thread.
652    
653     It is recommended to keep the number of threads relatively low, as some
654     Linux kernel versions will scale negatively with the number of threads
655     (higher parallelity => MUCH higher latency). With current Linux 2.6
656     versions, 4-32 threads should be fine.
657    
658     Under most circumstances you don't need to call this function, as the
659     module selects a default that is suitable for low to moderate load.
660    
661 root 1.2 =item BDB::max_parallel $nthreads
662 root 1.1
663 root 1.18 Sets the maximum number of BDB threads to C<$nthreads>. If more than the
664 root 1.1 specified number of threads are currently running, this function kills
665     them. This function blocks until the limit is reached.
666    
667     While C<$nthreads> are zero, aio requests get queued but not executed
668     until the number of threads has been increased again.
669    
670     This module automatically runs C<max_parallel 0> at program end, to ensure
671     that all threads are killed and that there are no outstanding requests.
672    
673     Under normal circumstances you don't need to call this function.
674    
675 root 1.2 =item BDB::max_idle $nthreads
676 root 1.1
677     Limit the number of threads (default: 4) that are allowed to idle (i.e.,
678     threads that did not get a request to process within 10 seconds). That
679     means if a thread becomes idle while C<$nthreads> other threads are also
680     idle, it will free its resources and exit.
681    
682     This is useful when you allow a large number of threads (e.g. 100 or 1000)
683     to allow for extremely high load situations, but want to free resources
684     under normal circumstances (1000 threads can easily consume 30MB of RAM).
685    
686     The default is probably ok in most situations, especially if thread
687     creation is fast. If thread creation is very slow on your system you might
688     want to use larger values.
689    
690 root 1.2 =item $oldmaxreqs = BDB::max_outstanding $maxreqs
691 root 1.1
692     This is a very bad function to use in interactive programs because it
693     blocks, and a bad way to reduce concurrency because it is inexact: Better
694     use an C<aio_group> together with a feed callback.
695    
696     Sets the maximum number of outstanding requests to C<$nreqs>. If you
697     to queue up more than this number of requests, the next call to the
698     C<poll_cb> (and C<poll_some> and other functions calling C<poll_cb>)
699     function will block until the limit is no longer exceeded.
700    
701     The default value is very large, so there is no practical limit on the
702     number of outstanding requests.
703    
704     You can still queue as many requests as you want. Therefore,
705     C<max_oustsanding> is mainly useful in simple scripts (with low values) or
706     as a stop gap to shield against fatal memory overflow (with large values).
707    
708 root 1.3 =item BDB::set_sync_prepare $cb
709    
710     Sets a callback that is called whenever a request is created without an
711     explicit callback. It has to return two code references. The first is used
712 root 1.37 as the request callback (it should save the return status), and the second
713     is called to wait until the first callback has been called (it must set
714     C<$!> to the return status).
715    
716     This mechanism can be used to include BDB into other event mechanisms,
717     such as L<AnyEvent::BDB> or L<Coro::BDB>.
718    
719     The default implementation works like this:
720 root 1.3
721     sub {
722     my $status;
723     (
724     sub { $status = $! },
725     sub { BDB::poll while !defined $status; $! = $status },
726     )
727     }
728    
729 root 1.37 It simply blocks the process till the request has finished and then sets
730     C<$!> to the return value. This means that if you don't use a callback,
731     BDB will simply fall back to synchronous operations.
732    
733 root 1.3 =back
734    
735 root 1.1 =head3 STATISTICAL INFORMATION
736    
737 root 1.3 =over 4
738    
739 root 1.2 =item BDB::nreqs
740 root 1.1
741     Returns the number of requests currently in the ready, execute or pending
742     states (i.e. for which their callback has not been invoked yet).
743    
744     Example: wait till there are no outstanding requests anymore:
745    
746 root 1.2 BDB::poll_wait, BDB::poll_cb
747     while BDB::nreqs;
748 root 1.1
749 root 1.2 =item BDB::nready
750 root 1.1
751     Returns the number of requests currently in the ready state (not yet
752     executed).
753    
754 root 1.2 =item BDB::npending
755 root 1.1
756     Returns the number of requests currently in the pending state (executed,
757     but not yet processed by poll_cb).
758    
759     =back
760    
761     =cut
762    
763 root 1.3 set_sync_prepare {
764     my $status;
765     (
766     sub {
767     $status = $!;
768     },
769     sub {
770     BDB::poll while !defined $status;
771     $! = $status;
772     },
773     )
774     };
775    
776 root 1.1 min_parallel 8;
777    
778     END { flush }
779    
780     1;
781    
782     =head2 FORK BEHAVIOUR
783    
784     This module should do "the right thing" when the process using it forks:
785    
786 root 1.18 Before the fork, BDB enters a quiescent state where no requests
787 root 1.1 can be added in other threads and no results will be processed. After
788     the fork the parent simply leaves the quiescent state and continues
789     request/result processing, while the child frees the request/result queue
790     (so that the requests started before the fork will only be handled in the
791     parent). Threads will be started on demand until the limit set in the
792     parent process has been reached again.
793    
794     In short: the parent will, after a short pause, continue as if fork had
795 root 1.18 not been called, while the child will act as if BDB has not been used
796 root 1.1 yet.
797    
798 root 1.26 Win32 note: there is no fork on win32, and perls emulation of it is too
799     broken to be supported, so do not use BDB in a windows pseudo-fork, better
800     yet, switch to a more capable platform.
801    
802 root 1.1 =head2 MEMORY USAGE
803    
804     Per-request usage:
805    
806     Each aio request uses - depending on your architecture - around 100-200
807     bytes of memory. In addition, stat requests need a stat buffer (possibly
808     a few hundred bytes), readdir requires a result buffer and so on. Perl
809     scalars and other data passed into aio requests will also be locked and
810     will consume memory till the request has entered the done state.
811    
812 root 1.13 This is not awfully much, so queuing lots of requests is not usually a
813 root 1.1 problem.
814    
815     Per-thread usage:
816    
817     In the execution phase, some aio requests require more memory for
818     temporary buffers, and each thread requires a stack and other data
819     structures (usually around 16k-128k, depending on the OS).
820    
821     =head1 KNOWN BUGS
822    
823 root 1.15 Known bugs will be fixed in the next release, except:
824    
825     If you use a transaction in any request, and the request returns
826     with an operating system error or DB_LOCK_NOTGRANTED, the internal
827     TXN_DEADLOCK flag will be set on the transaction. See C<db_txn_finish>,
828     above.
829 root 1.1
830     =head1 SEE ALSO
831    
832 root 1.36 L<AnyEvent::BDB> (event loop integration), L<Coro::BDB> (more natural
833     syntax), L<IO::AIO> (nice to have).
834 root 1.1
835     =head1 AUTHOR
836    
837     Marc Lehmann <schmorp@schmorp.de>
838     http://home.schmorp.de/
839    
840     =cut
841