ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/BDB/BDB.pm
Revision: 1.39
Committed: Tue Jul 8 08:25:31 2008 UTC (15 years, 10 months ago) by root
Branch: MAIN
Changes since 1.38: +56 -2 lines
Log Message:
*** empty log message ***

File Contents

# Content
1 =head1 NAME
2
3 BDB - Asynchronous Berkeley DB access
4
5 =head1 SYNOPSIS
6
7 use BDB;
8
9 my $env = db_env_create;
10
11 mkdir "bdtest", 0700;
12 db_env_open
13 $env,
14 "bdtest",
15 BDB::INIT_LOCK | BDB::INIT_LOG | BDB::INIT_MPOOL
16 | BDB::INIT_TXN | BDB::RECOVER | BDB::USE_ENVIRON | BDB::CREATE,
17 0600;
18
19 $env->set_flags (BDB::AUTO_COMMIT | BDB::TXN_NOSYNC, 1);
20
21 my $db = db_create $env;
22 db_open $db, undef, "table", undef, BDB::BTREE, BDB::AUTO_COMMIT | BDB::CREATE
23 | BDB::READ_UNCOMMITTED, 0600;
24 db_put $db, undef, "key", "data", 0, sub {
25 db_del $db, undef, "key";
26 };
27 db_sync $db;
28
29 # when you also use Coro, management is easy:
30 use Coro::BDB;
31
32 # automatic event loop intergration with AnyEvent:
33 use AnyEvent::BDB;
34
35 # automatic result processing with EV:
36 my $WATCHER = EV::io BDB::poll_fileno, EV::READ, \&BDB::poll_cb;
37
38 # with Glib:
39 add_watch Glib::IO BDB::poll_fileno,
40 in => sub { BDB::poll_cb; 1 };
41
42 # or simply flush manually
43 BDB::flush;
44
45
46 =head1 DESCRIPTION
47
48 See the BerkeleyDB documentation (L<http://www.oracle.com/technology/documentation/berkeley-db/db/index.html>).
49 The BDB API is very similar to the C API (the translation has been very faithful).
50
51 See also the example sections in the document below and possibly the eg/
52 subdirectory of the BDB distribution. Last not least see the IO::AIO
53 documentation, as that module uses almost the same asynchronous request
54 model as this module.
55
56 I know this is woefully inadequate documentation. Send a patch!
57
58
59 =head1 REQUEST ANATOMY AND LIFETIME
60
61 Every request method creates a request. which is a C data structure not
62 directly visible to Perl.
63
64 During their existance, bdb requests travel through the following states,
65 in order:
66
67 =over 4
68
69 =item ready
70
71 Immediately after a request is created it is put into the ready state,
72 waiting for a thread to execute it.
73
74 =item execute
75
76 A thread has accepted the request for processing and is currently
77 executing it (e.g. blocking in read).
78
79 =item pending
80
81 The request has been executed and is waiting for result processing.
82
83 While request submission and execution is fully asynchronous, result
84 processing is not and relies on the perl interpreter calling C<poll_cb>
85 (or another function with the same effect).
86
87 =item result
88
89 The request results are processed synchronously by C<poll_cb>.
90
91 The C<poll_cb> function will process all outstanding aio requests by
92 calling their callbacks, freeing memory associated with them and managing
93 any groups they are contained in.
94
95 =item done
96
97 Request has reached the end of its lifetime and holds no resources anymore
98 (except possibly for the Perl object, but its connection to the actual
99 aio request is severed and calling its methods will either do nothing or
100 result in a runtime error).
101
102 =back
103
104 =cut
105
106 package BDB;
107
108 no warnings;
109 use strict 'vars';
110
111 use base 'Exporter';
112
113 BEGIN {
114 our $VERSION = '1.6';
115
116 our @BDB_REQ = qw(
117 db_env_open db_env_close db_env_txn_checkpoint db_env_lock_detect
118 db_env_memp_sync db_env_memp_trickle db_env_dbrename db_env_dbremove
119 db_open db_close db_compact db_sync db_upgrade
120 db_put db_get db_pget db_del db_key_range
121 db_txn_commit db_txn_abort db_txn_finish
122 db_c_close db_c_count db_c_put db_c_get db_c_pget db_c_del
123 db_sequence_open db_sequence_close
124 db_sequence_get db_sequence_remove
125 );
126 our @EXPORT = (@BDB_REQ, qw(dbreq_pri dbreq_nice db_env_create db_create));
127 our @EXPORT_OK = qw(
128 poll_fileno poll_cb poll_wait flush
129 min_parallel max_parallel max_idle
130 nreqs nready npending nthreads
131 max_poll_time max_poll_reqs
132 );
133
134 require XSLoader;
135 XSLoader::load ("BDB", $VERSION);
136 }
137
138 =head2 WIN32 FILENAMES/DATABASE NAME MESS
139
140 Perl on Win32 supports only ASCII filenames (the reason is that it abuses
141 an internal flag to store wether a filename is Unicode or ANSI, but that
142 flag is used for somethign else in the perl core, so there is no way to
143 detect wether a filename is ANSI or Unicode-encoded). The BDB module
144 tries to work around this issue by assuming that the filename is an ANSI
145 filename and BDB was built for unicode support.
146
147 =head2 BERKELEYDB FUNCTIONS
148
149 All of these are functions. The create functions simply return a new
150 object and never block. All the remaining functions take an optional
151 callback as last argument. If it is missing, then the function will be
152 executed synchronously. In both cases, C<$!> will reflect the return value
153 of the function.
154
155 BDB functions that cannot block (mostly functions that manipulate
156 settings) are method calls on the relevant objects, so the rule of thumb
157 is: if it's a method, it's not blocking, if it's a function, it takes a
158 callback as last argument.
159
160 In the following, C<$int> signifies an integer return value,
161 C<bdb_filename> is a "filename" (octets on unix, madness on windows),
162 C<U32> is an unsigned 32 bit integer, C<int> is some integer, C<NV> is a
163 floating point value.
164
165 The C<SV *> types are generic perl scalars (for input and output of data
166 values), and the C<SV *callback> is the optional callback function to call
167 when the request is completed.
168
169 The various C<DB_ENV> etc. arguments are handles return by
170 C<db_env_create>, C<db_create>, C<txn_begin> and so on. If they have an
171 appended C<_ornull> this means they are optional and you can pass C<undef>
172 for them, resulting a NULL pointer on the C level.
173
174 =head3 BDB functions
175
176 Functions in the BDB namespace, exported by default:
177
178 $env = db_env_create (U32 env_flags = 0)
179 flags: RPCCLIENT
180
181 db_env_open (DB_ENV *env, bdb_filename db_home, U32 open_flags, int mode, SV *callback = &PL_sv_undef)
182 open_flags: INIT_CDB INIT_LOCK INIT_LOG INIT_MPOOL INIT_REP INIT_TXN RECOVER RECOVER_FATAL USE_ENVIRON USE_ENVIRON_ROOT CREATE LOCKDOWN PRIVATE REGISTER SYSTEM_MEM
183 db_env_close (DB_ENV *env, U32 flags = 0, SV *callback = &PL_sv_undef)
184 db_env_txn_checkpoint (DB_ENV *env, U32 kbyte = 0, U32 min = 0, U32 flags = 0, SV *callback = &PL_sv_undef)
185 flags: FORCE
186 db_env_lock_detect (DB_ENV *env, U32 flags = 0, U32 atype = DB_LOCK_DEFAULT, SV *dummy = 0, SV *callback = &PL_sv_undef)
187 atype: LOCK_DEFAULT LOCK_EXPIRE LOCK_MAXLOCKS LOCK_MAXWRITE LOCK_MINLOCKS LOCK_MINWRITE LOCK_OLDEST LOCK_RANDOM LOCK_YOUNGEST
188 db_env_memp_sync (DB_ENV *env, SV *dummy = 0, SV *callback = &PL_sv_undef)
189 db_env_memp_trickle (DB_ENV *env, int percent, SV *dummy = 0, SV *callback = &PL_sv_undef)
190 db_env_dbremove (DB_ENV *env, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, U32 flags = 0, SV *callback = &PL_sv_undef)
191 db_env_dbrename (DB_ENV *env, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, bdb_filename newname, U32 flags = 0, SV *callback = &PL_sv_undef)
192
193 $db = db_create (DB_ENV *env = 0, U32 flags = 0)
194 flags: XA_CREATE
195
196 db_open (DB *db, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, int type, U32 flags, int mode, SV *callback = &PL_sv_undef)
197 flags: AUTO_COMMIT CREATE EXCL MULTIVERSION NOMMAP RDONLY READ_UNCOMMITTED THREAD TRUNCATE
198 db_close (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef)
199 flags: DB_NOSYNC
200 db_upgrade (DB *db, bdb_filename file, U32 flags = 0, SV *callback = &PL_sv_undef)
201 db_compact (DB *db, DB_TXN_ornull *txn = 0, SV *start = 0, SV *stop = 0, SV *unused1 = 0, U32 flags = DB_FREE_SPACE, SV *unused2 = 0, SV *callback = &PL_sv_undef)
202 flags: FREELIST_ONLY FREE_SPACE
203 db_sync (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef)
204 db_key_range (DB *db, DB_TXN_ornull *txn, SV *key, SV *key_range, U32 flags = 0, SV *callback = &PL_sv_undef)
205 db_put (DB *db, DB_TXN_ornull *txn, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
206 flags: APPEND NODUPDATA NOOVERWRITE
207 db_get (DB *db, DB_TXN_ornull *txn, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
208 flags: CONSUME CONSUME_WAIT GET_BOTH SET_RECNO MULTIPLE READ_COMMITTED READ_UNCOMMITTED RMW
209 db_pget (DB *db, DB_TXN_ornull *txn, SV *key, SV *pkey, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
210 flags: CONSUME CONSUME_WAIT GET_BOTH SET_RECNO MULTIPLE READ_COMMITTED READ_UNCOMMITTED RMW
211 db_del (DB *db, DB_TXN_ornull *txn, SV *key, U32 flags = 0, SV *callback = &PL_sv_undef)
212 db_txn_commit (DB_TXN *txn, U32 flags = 0, SV *callback = &PL_sv_undef)
213 flags: TXN_NOSYNC TXN_SYNC
214 db_txn_abort (DB_TXN *txn, SV *callback = &PL_sv_undef)
215
216 db_c_close (DBC *dbc, SV *callback = &PL_sv_undef)
217 db_c_count (DBC *dbc, SV *count, U32 flags = 0, SV *callback = &PL_sv_undef)
218 db_c_put (DBC *dbc, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
219 flags: AFTER BEFORE CURRENT KEYFIRST KEYLAST NODUPDATA
220 db_c_get (DBC *dbc, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
221 flags: CURRENT FIRST GET_BOTH GET_BOTH_RANGE GET_RECNO JOIN_ITEM LAST NEXT NEXT_DUP NEXT_NODUP PREV PREV_DUP PREV_NODUP SET SET_RANGE SET_RECNO READ_UNCOMMITTED MULTIPLE MULTIPLE_KEY RMW
222 db_c_pget (DBC *dbc, SV *key, SV *pkey, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
223 db_c_del (DBC *dbc, U32 flags = 0, SV *callback = &PL_sv_undef)
224
225 db_sequence_open (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, SV *key, U32 flags = 0, SV *callback = &PL_sv_undef)
226 flags: CREATE EXCL
227 db_sequence_close (DB_SEQUENCE *seq, U32 flags = 0, SV *callback = &PL_sv_undef)
228 db_sequence_get (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, int delta, SV *seq_value, U32 flags = DB_TXN_NOSYNC, SV *callback = &PL_sv_undef)
229 flags: TXN_NOSYNC
230 db_sequence_remove (DB_SEQUENCE *seq, DB_TXN_ornull *txnid = 0, U32 flags = 0, SV *callback = &PL_sv_undef)
231 flags: TXN_NOSYNC
232
233 =head4 db_txn_finish (DB_TXN *txn, U32 flags = 0, SV *callback = &PL_sv_undef)
234
235 This is not actually a Berkeley DB function but a BDB module
236 extension. The background for this exytension is: It is very annoying to
237 have to check every single BDB function for error returns and provide a
238 codepath out of your transaction. While the BDB module still makes this
239 possible, it contains the following extensions:
240
241 When a transaction-protected function returns any operating system
242 error (errno > 0), BDB will set the C<TXN_DEADLOCK> flag on the
243 transaction. This flag is also set by Berkeley DB functions themselves
244 when an operation fails with LOCK_DEADLOCK, and it causes all further
245 operations on that transaction (including C<db_txn_commit>) to fail.
246
247 The C<db_txn_finish> request will look at this flag, and, if it is set,
248 will automatically call C<db_txn_abort> (setting errno to C<LOCK_DEADLOCK>
249 if it isn't set to something else yet). If it isn't set, it will call
250 C<db_txn_commit> and return the error normally.
251
252 How to use this? Easy: just write your transaction normally:
253
254 my $txn = $db_env->txn_begin;
255 db_get $db, $txn, "key", my $data;
256 db_put $db, $txn, "key", $data + 1 unless $! == BDB::NOTFOUND;
257 db_txn_finish $txn;
258 die "transaction failed" if $!;
259
260 That is, handle only the expected errors. If something unexpected happens
261 (EIO, LOCK_NOTGRANTED or a deadlock in either db_get or db_put), then the remaining
262 requests (db_put in this case) will simply be skipped (they will fail with
263 LOCK_DEADLOCK) and the transaction will be aborted.
264
265 You can use the C<< $txn->failed >> method to check wether a transaction
266 has failed in this way and abort further processing (excluding
267 C<db_txn_finish>).
268
269 =head3 DB_ENV/database environment methods
270
271 Methods available on DB_ENV/$env handles:
272
273 DESTROY (DB_ENV_ornull *env)
274 CODE:
275 if (env)
276 env->close (env, 0);
277
278 $int = $env->set_data_dir (const char *dir)
279 $int = $env->set_tmp_dir (const char *dir)
280 $int = $env->set_lg_dir (const char *dir)
281 $int = $env->set_shm_key (long shm_key)
282 $int = $env->set_cachesize (U32 gbytes, U32 bytes, int ncache = 0)
283 $int = $env->set_flags (U32 flags, int onoff = 1)
284 $int = $env->log_set_config (U32 flags, int onoff = 1) [v4.7]
285 $int = $env->set_intermediate_dir_mode (const char *modestring) [v4.7]
286 $env->set_errfile (FILE *errfile = 0)
287 $env->set_msgfile (FILE *msgfile = 0)
288 $int = $env->set_verbose (U32 which, int onoff = 1)
289 $int = $env->set_encrypt (const char *password, U32 flags = 0)
290 $int = $env->set_timeout (NV timeout_seconds, U32 flags = SET_TXN_TIMEOUT)
291 $int = $env->set_mp_max_openfd (int maxopenfd);
292 $int = $env->set_mp_max_write (int maxwrite, int maxwrite_sleep);
293 $int = $env->set_mp_mmapsize (int mmapsize_mb)
294 $int = $env->set_lk_detect (U32 detect = DB_LOCK_DEFAULT)
295 $int = $env->set_lk_max_lockers (U32 max)
296 $int = $env->set_lk_max_locks (U32 max)
297 $int = $env->set_lk_max_objects (U32 max)
298 $int = $env->set_lg_bsize (U32 max)
299 $int = $env->set_lg_max (U32 max)
300 $int = $env->mutex_set_increment (U32 increment)
301 $int = $env->mutex_set_tas_spins (U32 tas_spins)
302 $int = $env->mutex_set_max (U32 max)
303 $int = $env->mutex_set_align (U32 align)
304
305 $txn = $env->txn_begin (DB_TXN_ornull *parent = 0, U32 flags = 0)
306 flags: READ_COMMITTED READ_UNCOMMITTED TXN_NOSYNC TXN_NOWAIT TXN_SNAPSHOT TXN_SYNC TXN_WAIT TXN_WRITE_NOSYNC
307
308 =head4 Example:
309
310 use AnyEvent;
311 use BDB;
312
313 our $FH; open $FH, "<&=" . BDB::poll_fileno;
314 our $WATCHER = AnyEvent->io (fh => $FH, poll => 'r', cb => \&BDB::poll_cb);
315
316 BDB::min_parallel 8;
317
318 my $env = db_env_create;
319
320 mkdir "bdtest", 0700;
321 db_env_open
322 $env,
323 "bdtest",
324 BDB::INIT_LOCK | BDB::INIT_LOG | BDB::INIT_MPOOL | BDB::INIT_TXN | BDB::RECOVER | BDB::USE_ENVIRON | BDB::CREATE,
325 0600;
326
327 $env->set_flags (BDB::AUTO_COMMIT | BDB::TXN_NOSYNC, 1);
328
329
330 =head3 DB/database methods
331
332 Methods available on DB/$db handles:
333
334 DESTROY (DB_ornull *db)
335 CODE:
336 if (db)
337 {
338 SV *env = (SV *)db->app_private;
339 db->close (db, 0);
340 SvREFCNT_dec (env);
341 }
342
343 $int = $db->set_cachesize (U32 gbytes, U32 bytes, int ncache = 0)
344 $int = $db->set_flags (U32 flags)
345 flags: CHKSUM ENCRYPT TXN_NOT_DURABLE
346 Btree: DUP DUPSORT RECNUM REVSPLITOFF
347 Hash: DUP DUPSORT
348 Queue: INORDER
349 Recno: RENUMBER SNAPSHOT
350
351 $int = $db->set_encrypt (const char *password, U32 flags)
352 $int = $db->set_lorder (int lorder)
353 $int = $db->set_bt_minkey (U32 minkey)
354 $int = $db->set_re_delim (int delim)
355 $int = $db->set_re_pad (int re_pad)
356 $int = $db->set_re_source (char *source)
357 $int = $db->set_re_len (U32 re_len)
358 $int = $db->set_h_ffactor (U32 h_ffactor)
359 $int = $db->set_h_nelem (U32 h_nelem)
360 $int = $db->set_q_extentsize (U32 extentsize)
361
362 $dbc = $db->cursor (DB_TXN_ornull *txn = 0, U32 flags = 0)
363 flags: READ_COMMITTED READ_UNCOMMITTED WRITECURSOR TXN_SNAPSHOT
364 $seq = $db->sequence (U32 flags = 0)
365
366 =head4 Example:
367
368 my $db = db_create $env;
369 db_open $db, undef, "table", undef, BDB::BTREE, BDB::AUTO_COMMIT | BDB::CREATE | BDB::READ_UNCOMMITTED, 0600;
370
371 for (1..1000) {
372 db_put $db, undef, "key $_", "data $_";
373
374 db_key_range $db, undef, "key $_", my $keyrange;
375 my ($lt, $eq, $gt) = @$keyrange;
376 }
377
378 db_del $db, undef, "key $_" for 1..1000;
379
380 db_sync $db;
381
382
383 =head3 DB_TXN/transaction methods
384
385 Methods available on DB_TXN/$txn handles:
386
387 DESTROY (DB_TXN_ornull *txn)
388 CODE:
389 if (txn)
390 txn->abort (txn);
391
392 $int = $txn->set_timeout (NV timeout_seconds, U32 flags = SET_TXN_TIMEOUT)
393 flags: SET_LOCK_TIMEOUT SET_TXN_TIMEOUT
394
395 $bool = $txn->failed
396 # see db_txn_finish documentation, above
397
398
399 =head3 DBC/cursor methods
400
401 Methods available on DBC/$dbc handles:
402
403 DESTROY (DBC_ornull *dbc)
404 CODE:
405 if (dbc)
406 dbc->c_close (dbc);
407
408 $int = $cursor->set_priority ($priority = PRIORITY_*)
409
410 =head4 Example:
411
412 my $c = $db->cursor;
413
414 for (;;) {
415 db_c_get $c, my $key, my $data, BDB::NEXT;
416 warn "<$!,$key,$data>";
417 last if $!;
418 }
419
420 db_c_close $c;
421
422
423 =head3 DB_SEQUENCE/sequence methods
424
425 Methods available on DB_SEQUENCE/$seq handles:
426
427 DESTROY (DB_SEQUENCE_ornull *seq)
428 CODE:
429 if (seq)
430 seq->close (seq, 0);
431
432 $int = $seq->initial_value (db_seq_t value)
433 $int = $seq->set_cachesize (U32 size)
434 $int = $seq->set_flags (U32 flags)
435 flags: SEQ_DEC SEQ_INC SEQ_WRAP
436 $int = $seq->set_range (db_seq_t min, db_seq_t max)
437
438 =head4 Example:
439
440 my $seq = $db->sequence;
441
442 db_sequence_open $seq, undef, "seq", BDB::CREATE;
443 db_sequence_get $seq, undef, 1, my $value;
444
445
446 =head2 SUPPORT FUNCTIONS
447
448 =head3 EVENT PROCESSING AND EVENT LOOP INTEGRATION
449
450 =over 4
451
452 =item $msg = BDB::strerror [$errno]
453
454 Returns the string corresponding to the given errno value. If no argument
455 is given, use C<$!>.
456
457 Note that the BDB module also patches the C<$!> variable directly, so you
458 should be able to get a bdb error string by simply stringifying C<$!>.
459
460 =item $fileno = BDB::poll_fileno
461
462 Return the I<request result pipe file descriptor>. This filehandle must be
463 polled for reading by some mechanism outside this module (e.g. Event or
464 select, see below or the SYNOPSIS). If the pipe becomes readable you have
465 to call C<poll_cb> to check the results.
466
467 See C<poll_cb> for an example.
468
469 =item BDB::poll_cb
470
471 Process some outstanding events on the result pipe. You have to call this
472 regularly. Returns the number of events processed. Returns immediately
473 when no events are outstanding. The amount of events processed depends on
474 the settings of C<BDB::max_poll_req> and C<BDB::max_poll_time>.
475
476 If not all requests were processed for whatever reason, the filehandle
477 will still be ready when C<poll_cb> returns.
478
479 Example: Install an Event watcher that automatically calls
480 BDB::poll_cb with high priority:
481
482 Event->io (fd => BDB::poll_fileno,
483 poll => 'r', async => 1,
484 cb => \&BDB::poll_cb);
485
486 =item BDB::max_poll_reqs $nreqs
487
488 =item BDB::max_poll_time $seconds
489
490 These set the maximum number of requests (default C<0>, meaning infinity)
491 that are being processed by C<BDB::poll_cb> in one call, respectively
492 the maximum amount of time (default C<0>, meaning infinity) spent in
493 C<BDB::poll_cb> to process requests (more correctly the mininum amount
494 of time C<poll_cb> is allowed to use).
495
496 Setting C<max_poll_time> to a non-zero value creates an overhead of one
497 syscall per request processed, which is not normally a problem unless your
498 callbacks are really really fast or your OS is really really slow (I am
499 not mentioning Solaris here). Using C<max_poll_reqs> incurs no overhead.
500
501 Setting these is useful if you want to ensure some level of
502 interactiveness when perl is not fast enough to process all requests in
503 time.
504
505 For interactive programs, values such as C<0.01> to C<0.1> should be fine.
506
507 Example: Install an EV watcher that automatically calls
508 BDB::poll_cb with low priority, to ensure that other parts of the
509 program get the CPU sometimes even under high load.
510
511 # try not to spend much more than 0.1s in poll_cb
512 BDB::max_poll_time 0.1;
513
514 my $bdb_poll = EV::io BDB::poll_fileno, EV::READ, \&BDB::poll_cb);
515
516 =item BDB::poll_wait
517
518 If there are any outstanding requests and none of them in the result
519 phase, wait till the result filehandle becomes ready for reading (simply
520 does a C<select> on the filehandle. This is useful if you want to
521 synchronously wait for some requests to finish).
522
523 See C<nreqs> for an example.
524
525 =item BDB::poll
526
527 Waits until some requests have been handled.
528
529 Returns the number of requests processed, but is otherwise strictly
530 equivalent to:
531
532 BDB::poll_wait, BDB::poll_cb
533
534 =item BDB::flush
535
536 Wait till all outstanding BDB requests have been handled.
537
538 Strictly equivalent to:
539
540 BDB::poll_wait, BDB::poll_cb
541 while BDB::nreqs;
542
543 =back
544
545 =head3 VERSION CHECKING
546
547 BerkeleyDB comes in various versions, many of them have minor
548 incompatibilities. This means that traditional "at least version x.x"
549 checks are often not sufficient.
550
551 =over 4
552
553 =item BDB::VERSION
554
555 The C<BDB::VERSION> function, when called without arguments, returns the
556 Berkeley DB version as a v-string (usually with 3 components). You should
557 use C<lt> and C<ge> operators exclusively to make comparisons.
558
559 Example: check for at least version 4.7.
560
561 BDB::VERSION ge v4.7 or die;
562
563 =item BDB::VERSION min-version
564
565 Returns true if the BDB version is at least the given version (specified
566 as a v-string), false otherwise.
567
568 Example: check for at least version 4.5.
569
570 BDB::VERSION v4.7 or die;
571
572 =item BDB::VERSION min-version, max-version
573
574 Returns true of the BDB version is at least version C<min-version> (specify C<undef> or C<v0> for any minimum version)
575 and less then C<max-version>.
576
577 Example: check wether version is strictly less then v4.7.
578
579 BDB::VERSION v0, v4.7
580 or die "version 4.7 is not yet supported";
581
582 =back
583
584 =cut
585
586 sub VERSION {
587 if (@_ > 0) {
588 return undef if VERSION_v lt $_[0];
589 if (@_ > 1) {
590 return undef if VERSION_v ge $_[1];
591 }
592 }
593
594 VERSION_v
595 }
596
597 =head3 CONTROLLING THE NUMBER OF THREADS
598
599 =over 4
600
601 =item BDB::min_parallel $nthreads
602
603 Set the minimum number of BDB threads to C<$nthreads>. The current
604 default is C<8>, which means eight asynchronous operations can execute
605 concurrently at any one time (the number of outstanding requests,
606 however, is unlimited).
607
608 BDB starts threads only on demand, when an BDB request is queued and
609 no free thread exists. Please note that queueing up a hundred requests can
610 create demand for a hundred threads, even if it turns out that everything
611 is in the cache and could have been processed faster by a single thread.
612
613 It is recommended to keep the number of threads relatively low, as some
614 Linux kernel versions will scale negatively with the number of threads
615 (higher parallelity => MUCH higher latency). With current Linux 2.6
616 versions, 4-32 threads should be fine.
617
618 Under most circumstances you don't need to call this function, as the
619 module selects a default that is suitable for low to moderate load.
620
621 =item BDB::max_parallel $nthreads
622
623 Sets the maximum number of BDB threads to C<$nthreads>. If more than the
624 specified number of threads are currently running, this function kills
625 them. This function blocks until the limit is reached.
626
627 While C<$nthreads> are zero, aio requests get queued but not executed
628 until the number of threads has been increased again.
629
630 This module automatically runs C<max_parallel 0> at program end, to ensure
631 that all threads are killed and that there are no outstanding requests.
632
633 Under normal circumstances you don't need to call this function.
634
635 =item BDB::max_idle $nthreads
636
637 Limit the number of threads (default: 4) that are allowed to idle (i.e.,
638 threads that did not get a request to process within 10 seconds). That
639 means if a thread becomes idle while C<$nthreads> other threads are also
640 idle, it will free its resources and exit.
641
642 This is useful when you allow a large number of threads (e.g. 100 or 1000)
643 to allow for extremely high load situations, but want to free resources
644 under normal circumstances (1000 threads can easily consume 30MB of RAM).
645
646 The default is probably ok in most situations, especially if thread
647 creation is fast. If thread creation is very slow on your system you might
648 want to use larger values.
649
650 =item $oldmaxreqs = BDB::max_outstanding $maxreqs
651
652 This is a very bad function to use in interactive programs because it
653 blocks, and a bad way to reduce concurrency because it is inexact: Better
654 use an C<aio_group> together with a feed callback.
655
656 Sets the maximum number of outstanding requests to C<$nreqs>. If you
657 to queue up more than this number of requests, the next call to the
658 C<poll_cb> (and C<poll_some> and other functions calling C<poll_cb>)
659 function will block until the limit is no longer exceeded.
660
661 The default value is very large, so there is no practical limit on the
662 number of outstanding requests.
663
664 You can still queue as many requests as you want. Therefore,
665 C<max_oustsanding> is mainly useful in simple scripts (with low values) or
666 as a stop gap to shield against fatal memory overflow (with large values).
667
668 =item BDB::set_sync_prepare $cb
669
670 Sets a callback that is called whenever a request is created without an
671 explicit callback. It has to return two code references. The first is used
672 as the request callback (it should save the return status), and the second
673 is called to wait until the first callback has been called (it must set
674 C<$!> to the return status).
675
676 This mechanism can be used to include BDB into other event mechanisms,
677 such as L<AnyEvent::BDB> or L<Coro::BDB>.
678
679 The default implementation works like this:
680
681 sub {
682 my $status;
683 (
684 sub { $status = $! },
685 sub { BDB::poll while !defined $status; $! = $status },
686 )
687 }
688
689 It simply blocks the process till the request has finished and then sets
690 C<$!> to the return value. This means that if you don't use a callback,
691 BDB will simply fall back to synchronous operations.
692
693 =back
694
695 =head3 STATISTICAL INFORMATION
696
697 =over 4
698
699 =item BDB::nreqs
700
701 Returns the number of requests currently in the ready, execute or pending
702 states (i.e. for which their callback has not been invoked yet).
703
704 Example: wait till there are no outstanding requests anymore:
705
706 BDB::poll_wait, BDB::poll_cb
707 while BDB::nreqs;
708
709 =item BDB::nready
710
711 Returns the number of requests currently in the ready state (not yet
712 executed).
713
714 =item BDB::npending
715
716 Returns the number of requests currently in the pending state (executed,
717 but not yet processed by poll_cb).
718
719 =back
720
721 =cut
722
723 set_sync_prepare {
724 my $status;
725 (
726 sub {
727 $status = $!;
728 },
729 sub {
730 BDB::poll while !defined $status;
731 $! = $status;
732 },
733 )
734 };
735
736 min_parallel 8;
737
738 END { flush }
739
740 1;
741
742 =head2 FORK BEHAVIOUR
743
744 This module should do "the right thing" when the process using it forks:
745
746 Before the fork, BDB enters a quiescent state where no requests
747 can be added in other threads and no results will be processed. After
748 the fork the parent simply leaves the quiescent state and continues
749 request/result processing, while the child frees the request/result queue
750 (so that the requests started before the fork will only be handled in the
751 parent). Threads will be started on demand until the limit set in the
752 parent process has been reached again.
753
754 In short: the parent will, after a short pause, continue as if fork had
755 not been called, while the child will act as if BDB has not been used
756 yet.
757
758 Win32 note: there is no fork on win32, and perls emulation of it is too
759 broken to be supported, so do not use BDB in a windows pseudo-fork, better
760 yet, switch to a more capable platform.
761
762 =head2 MEMORY USAGE
763
764 Per-request usage:
765
766 Each aio request uses - depending on your architecture - around 100-200
767 bytes of memory. In addition, stat requests need a stat buffer (possibly
768 a few hundred bytes), readdir requires a result buffer and so on. Perl
769 scalars and other data passed into aio requests will also be locked and
770 will consume memory till the request has entered the done state.
771
772 This is not awfully much, so queuing lots of requests is not usually a
773 problem.
774
775 Per-thread usage:
776
777 In the execution phase, some aio requests require more memory for
778 temporary buffers, and each thread requires a stack and other data
779 structures (usually around 16k-128k, depending on the OS).
780
781 =head1 KNOWN BUGS
782
783 Known bugs will be fixed in the next release, except:
784
785 If you use a transaction in any request, and the request returns
786 with an operating system error or DB_LOCK_NOTGRANTED, the internal
787 TXN_DEADLOCK flag will be set on the transaction. See C<db_txn_finish>,
788 above.
789
790 =head1 SEE ALSO
791
792 L<AnyEvent::BDB> (event loop integration), L<Coro::BDB> (more natural
793 syntax), L<IO::AIO> (nice to have).
794
795 =head1 AUTHOR
796
797 Marc Lehmann <schmorp@schmorp.de>
798 http://home.schmorp.de/
799
800 =cut
801