ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/BDB/BDB.pm
(Generate patch)

Comparing BDB/BDB.pm (file contents):
Revision 1.14 by root, Thu Sep 13 12:29:49 2007 UTC vs.
Revision 1.31 by root, Sun Jan 13 09:43:21 2008 UTC

3BDB - Asynchronous Berkeley DB access 3BDB - Asynchronous Berkeley DB access
4 4
5=head1 SYNOPSIS 5=head1 SYNOPSIS
6 6
7 use BDB; 7 use BDB;
8
9 my $env = db_env_create;
10
11 mkdir "bdtest", 0700;
12 db_env_open
13 $env,
14 "bdtest",
15 BDB::INIT_LOCK | BDB::INIT_LOG | BDB::INIT_MPOOL
16 | BDB::INIT_TXN | BDB::RECOVER | BDB::USE_ENVIRON | BDB::CREATE,
17 0600;
18
19 $env->set_flags (BDB::AUTO_COMMIT | BDB::TXN_NOSYNC, 1);
20
21 my $db = db_create $env;
22 db_open $db, undef, "table", undef, BDB::BTREE, BDB::AUTO_COMMIT | BDB::CREATE
23 | BDB::READ_UNCOMMITTED, 0600;
24 db_put $db, undef, "key", "data", 0, sub {
25 db_del $db, undef, "key";
26 };
27 db_sync $db;
28
29 # when you also use Coro, management is easy:
30 use Coro::BDB;
31
32 # automatic result processing with AnyEvent:
33 our $FH; open $FH, "<&=" . BDB::poll_fileno;
34 our $WATCHER = AnyEvent->io (fh => $FH, poll => 'r', cb => \&BDB::poll_cb);
35
36 # automatic result processing with EV:
37 my $WATCHER = EV::io BDB::poll_fileno, EV::READ, \&BDB::poll_cb;
38
39 # with Glib:
40 add_watch Glib::IO BDB::poll_fileno,
41 in => sub { BDB::poll_cb; 1 };
42
43 # or simply flush manually
44 BDB::flush;
45
8 46
9=head1 DESCRIPTION 47=head1 DESCRIPTION
10 48
11See the BerkeleyDB documentation (L<http://www.oracle.com/technology/documentation/berkeley-db/db/index.html>). 49See the BerkeleyDB documentation (L<http://www.oracle.com/technology/documentation/berkeley-db/db/index.html>).
12The BDB API is very similar to the C API (the translation has been very faithful). 50The BDB API is very similar to the C API (the translation has been very faithful).
72use strict 'vars'; 110use strict 'vars';
73 111
74use base 'Exporter'; 112use base 'Exporter';
75 113
76BEGIN { 114BEGIN {
77 our $VERSION = '1.0'; 115 our $VERSION = '1.43';
78 116
79 our @BDB_REQ = qw( 117 our @BDB_REQ = qw(
80 db_env_open db_env_close db_env_txn_checkpoint db_env_lock_detect 118 db_env_open db_env_close db_env_txn_checkpoint db_env_lock_detect
81 db_env_memp_sync db_env_memp_trickle 119 db_env_memp_sync db_env_memp_trickle
82 db_open db_close db_compact db_sync db_put db_get db_pget db_del db_key_range 120 db_open db_close db_compact db_sync db_upgrade
121 db_put db_get db_pget db_del db_key_range
83 db_txn_commit db_txn_abort 122 db_txn_commit db_txn_abort db_txn_finish
84 db_c_close db_c_count db_c_put db_c_get db_c_pget db_c_del 123 db_c_close db_c_count db_c_put db_c_get db_c_pget db_c_del
85 db_sequence_open db_sequence_close 124 db_sequence_open db_sequence_close
86 db_sequence_get db_sequence_remove 125 db_sequence_get db_sequence_remove
87 ); 126 );
88 our @EXPORT = (@BDB_REQ, qw(dbreq_pri dbreq_nice db_env_create db_create)); 127 our @EXPORT = (@BDB_REQ, qw(dbreq_pri dbreq_nice db_env_create db_create));
99 138
100=head2 BERKELEYDB FUNCTIONS 139=head2 BERKELEYDB FUNCTIONS
101 140
102All of these are functions. The create functions simply return a new 141All of these are functions. The create functions simply return a new
103object and never block. All the remaining functions all take an optional 142object and never block. All the remaining functions all take an optional
104callback as last argument. If it is missing, then the fucntion will be 143callback as last argument. If it is missing, then the function will be
105executed synchronously. 144executed synchronously. In both cases, C<$!> will reflect the return value
145of the function.
106 146
107BDB functions that cannot block (mostly functions that manipulate 147BDB functions that cannot block (mostly functions that manipulate
108settings) are method calls on the relevant objects, so the rule of thumb 148settings) are method calls on the relevant objects, so the rule of thumb
109is: if its a method, its not blocking, if its a function, it takes a 149is: if its a method, its not blocking, if its a function, it takes a
110callback as last argument. 150callback as last argument.
145 185
146 db_open (DB *db, DB_TXN_ornull *txnid, octetstring file, octetstring database, int type, U32 flags, int mode, SV *callback = &PL_sv_undef) 186 db_open (DB *db, DB_TXN_ornull *txnid, octetstring file, octetstring database, int type, U32 flags, int mode, SV *callback = &PL_sv_undef)
147 flags: AUTO_COMMIT CREATE EXCL MULTIVERSION NOMMAP RDONLY READ_UNCOMMITTED THREAD TRUNCATE 187 flags: AUTO_COMMIT CREATE EXCL MULTIVERSION NOMMAP RDONLY READ_UNCOMMITTED THREAD TRUNCATE
148 db_close (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef) 188 db_close (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef)
149 flags: DB_NOSYNC 189 flags: DB_NOSYNC
190 db_upgrade (DB *db, octetstring file, U32 flags = 0, SV *callback = &PL_sv_undef)
150 db_compact (DB *db, DB_TXN_ornull *txn = 0, SV *start = 0, SV *stop = 0, SV *unused1 = 0, U32 flags = DB_FREE_SPACE, SV *unused2 = 0, SV *callback = &PL_sv_undef) 191 db_compact (DB *db, DB_TXN_ornull *txn = 0, SV *start = 0, SV *stop = 0, SV *unused1 = 0, U32 flags = DB_FREE_SPACE, SV *unused2 = 0, SV *callback = &PL_sv_undef)
151 flags: FREELIST_ONLY FREE_SPACE 192 flags: FREELIST_ONLY FREE_SPACE
152 db_sync (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef) 193 db_sync (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef)
153 db_key_range (DB *db, DB_TXN_ornull *txn, SV *key, SV *key_range, U32 flags = 0, SV *callback = &PL_sv_undef) 194 db_key_range (DB *db, DB_TXN_ornull *txn, SV *key, SV *key_range, U32 flags = 0, SV *callback = &PL_sv_undef)
154 db_put (DB *db, DB_TXN_ornull *txn, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef) 195 db_put (DB *db, DB_TXN_ornull *txn, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
177 db_sequence_get (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, int delta, SV *seq_value, U32 flags = DB_TXN_NOSYNC, SV *callback = &PL_sv_undef) 218 db_sequence_get (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, int delta, SV *seq_value, U32 flags = DB_TXN_NOSYNC, SV *callback = &PL_sv_undef)
178 flags: TXN_NOSYNC 219 flags: TXN_NOSYNC
179 db_sequence_remove (DB_SEQUENCE *seq, DB_TXN_ornull *txnid = 0, U32 flags = 0, SV *callback = &PL_sv_undef) 220 db_sequence_remove (DB_SEQUENCE *seq, DB_TXN_ornull *txnid = 0, U32 flags = 0, SV *callback = &PL_sv_undef)
180 flags: TXN_NOSYNC 221 flags: TXN_NOSYNC
181 222
223=head4 db_txn_finish (DB_TXN *txn, U32 flags = 0, SV *callback = &PL_sv_undef)
224
225This is not actually a Berkeley DB function but a BDB module
226extension. The background for this exytension is: It is very annoying to
227have to check every single BDB function for error returns and provide a
228codepath out of your transaction. While the BDB module still makes this
229possible, it contains the following extensions:
230
231When a transaction-protected function returns any operating system
232error (errno > 0), BDB will set the C<TXN_DEADLOCK> flag on the
233transaction. This flag is also set by Berkeley DB functions themselves
234when an operation fails with LOCK_DEADLOCK, and it causes all further
235operations on that transaction (including C<db_txn_commit>) to fail.
236
237The C<db_txn_finish> request will look at this flag, and, if it is set,
238will automatically call C<db_txn_abort> (setting errno to C<LOCK_DEADLOCK>
239if it isn't set to something else yet). If it isn't set, it will call
240C<db_txn_commit> and return the error normally.
241
242How to use this? Easy: just write your transaction normally:
243
244 my $txn = $db_env->txn_begin;
245 db_get $db, $txn, "key", my $data;
246 db_put $db, $txn, "key", $data + 1 unless $! == BDB::NOTFOUND;
247 db_txn_finish $txn;
248 die "transaction failed" if $!;
249
250That is, handle only the expected errors. If something unexpected happens
251(EIO, LOCK_NOTGRANTED or a deadlock in either db_get or db_put), then the remaining
252requests (db_put in this case) will simply be skipped (they will fail with
253LOCK_DEADLOCK) and the transaction will be aborted.
254
255You can use the C<< $txn->failed >> method to check wether a transaction
256has failed in this way and abort further processing (excluding
257C<db_txn_finish>).
258
182=head3 DB_ENV/database environment methods 259=head3 DB_ENV/database environment methods
183 260
184Methods available on DB_ENV/$env handles: 261Methods available on DB_ENV/$env handles:
185 262
186 DESTROY (DB_ENV_ornull *env) 263 DESTROY (DB_ENV_ornull *env)
196 $int = $env->set_flags (U32 flags, int onoff) 273 $int = $env->set_flags (U32 flags, int onoff)
197 $env->set_errfile (FILE *errfile = 0) 274 $env->set_errfile (FILE *errfile = 0)
198 $env->set_msgfile (FILE *msgfile = 0) 275 $env->set_msgfile (FILE *msgfile = 0)
199 $int = $env->set_verbose (U32 which, int onoff = 1) 276 $int = $env->set_verbose (U32 which, int onoff = 1)
200 $int = $env->set_encrypt (const char *password, U32 flags = 0) 277 $int = $env->set_encrypt (const char *password, U32 flags = 0)
201 $int = $env->set_timeout (NV timeout, U32 flags) 278 $int = $env->set_timeout (NV timeout_seconds, U32 flags = SET_TXN_TIMEOUT)
202 $int = $env->set_mp_max_openfd (int maxopenfd); 279 $int = $env->set_mp_max_openfd (int maxopenfd);
203 $int = $env->set_mp_max_write (int maxwrite, int maxwrite_sleep); 280 $int = $env->set_mp_max_write (int maxwrite, int maxwrite_sleep);
204 $int = $env->set_mp_mmapsize (int mmapsize_mb) 281 $int = $env->set_mp_mmapsize (int mmapsize_mb)
205 $int = $env->set_lk_detect (U32 detect = DB_LOCK_DEFAULT) 282 $int = $env->set_lk_detect (U32 detect = DB_LOCK_DEFAULT)
206 $int = $env->set_lk_max_lockers (U32 max) 283 $int = $env->set_lk_max_lockers (U32 max)
207 $int = $env->set_lk_max_locks (U32 max) 284 $int = $env->set_lk_max_locks (U32 max)
208 $int = $env->set_lk_max_objects (U32 max) 285 $int = $env->set_lk_max_objects (U32 max)
209 $int = $env->set_lg_bsize (U32 max) 286 $int = $env->set_lg_bsize (U32 max)
210 $int = $env->set_lg_max (U32 max) 287 $int = $env->set_lg_max (U32 max)
288 $int = $env->mutex_set_increment (U32 increment)
289 $int = $env->mutex_set_tas_spins (U32 tas_spins)
290 $int = $env->mutex_set_max (U32 max)
291 $int = $env->mutex_set_align (U32 align)
211 292
212 $txn = $env->txn_begin (DB_TXN_ornull *parent = 0, U32 flags = 0) 293 $txn = $env->txn_begin (DB_TXN_ornull *parent = 0, U32 flags = 0)
213 flags: READ_COMMITTED READ_UNCOMMITTED TXN_NOSYNC TXN_NOWAIT TXN_SNAPSHOT TXN_SYNC TXN_WAIT TXN_WRITE_NOSYNC 294 flags: READ_COMMITTED READ_UNCOMMITTED TXN_NOSYNC TXN_NOWAIT TXN_SNAPSHOT TXN_SYNC TXN_WAIT TXN_WRITE_NOSYNC
214 295
215=head4 Example: 296=head4 Example:
294 DESTROY (DB_TXN_ornull *txn) 375 DESTROY (DB_TXN_ornull *txn)
295 CODE: 376 CODE:
296 if (txn) 377 if (txn)
297 txn->abort (txn); 378 txn->abort (txn);
298 379
299 $int = $txn->set_timeout (NV timeout, U32 flags) 380 $int = $txn->set_timeout (NV timeout_seconds, U32 flags = SET_TXN_TIMEOUT)
300 flags: SET_LOCK_TIMEOUT SET_TXN_TIMEOUT 381 flags: SET_LOCK_TIMEOUT SET_TXN_TIMEOUT
382
383 $bool = $txn->failed
384 # see db_txn_finish documentation, above
301 385
302 386
303=head3 DBC/cursor methods 387=head3 DBC/cursor methods
304 388
305Methods available on DBC/$dbc handles: 389Methods available on DBC/$dbc handles:
306 390
307 DESTROY (DBC_ornull *dbc) 391 DESTROY (DBC_ornull *dbc)
308 CODE: 392 CODE:
309 if (dbc) 393 if (dbc)
310 dbc->c_close (dbc); 394 dbc->c_close (dbc);
395
396 $int = $cursor->set_priority ($priority = PRIORITY_*)
311 397
312=head4 Example: 398=head4 Example:
313 399
314 my $c = $db->cursor; 400 my $c = $db->cursor;
315 401
349 435
350=head3 EVENT PROCESSING AND EVENT LOOP INTEGRATION 436=head3 EVENT PROCESSING AND EVENT LOOP INTEGRATION
351 437
352=over 4 438=over 4
353 439
440=item $msg = BDB::strerror [$errno]
441
442Returns the string corresponding to the given errno value. If no argument
443is given, use C<$!>.
444
354=item $fileno = BDB::poll_fileno 445=item $fileno = BDB::poll_fileno
355 446
356Return the I<request result pipe file descriptor>. This filehandle must be 447Return the I<request result pipe file descriptor>. This filehandle must be
357polled for reading by some mechanism outside this module (e.g. Event or 448polled for reading by some mechanism outside this module (e.g. Event or
358select, see below or the SYNOPSIS). If the pipe becomes readable you have 449select, see below or the SYNOPSIS). If the pipe becomes readable you have
396interactiveness when perl is not fast enough to process all requests in 487interactiveness when perl is not fast enough to process all requests in
397time. 488time.
398 489
399For interactive programs, values such as C<0.01> to C<0.1> should be fine. 490For interactive programs, values such as C<0.01> to C<0.1> should be fine.
400 491
401Example: Install an Event watcher that automatically calls 492Example: Install an EV watcher that automatically calls
402BDB::poll_cb with low priority, to ensure that other parts of the 493BDB::poll_cb with low priority, to ensure that other parts of the
403program get the CPU sometimes even under high AIO load. 494program get the CPU sometimes even under high load.
404 495
405 # try not to spend much more than 0.1s in poll_cb 496 # try not to spend much more than 0.1s in poll_cb
406 BDB::max_poll_time 0.1; 497 BDB::max_poll_time 0.1;
407 498
408 # use a low priority so other tasks have priority 499 my $bdb_poll = EV::io BDB::poll_fileno, EV::READ, \&BDB::poll_cb);
409 Event->io (fd => BDB::poll_fileno,
410 poll => 'r', nice => 1,
411 cb => &BDB::poll_cb);
412 500
413=item BDB::poll_wait 501=item BDB::poll_wait
414 502
415If there are any outstanding requests and none of them in the result 503If there are any outstanding requests and none of them in the result
416phase, wait till the result filehandle becomes ready for reading (simply 504phase, wait till the result filehandle becomes ready for reading (simply
428 516
429 BDB::poll_wait, BDB::poll_cb 517 BDB::poll_wait, BDB::poll_cb
430 518
431=item BDB::flush 519=item BDB::flush
432 520
433Wait till all outstanding AIO requests have been handled. 521Wait till all outstanding BDB requests have been handled.
434 522
435Strictly equivalent to: 523Strictly equivalent to:
436 524
437 BDB::poll_wait, BDB::poll_cb 525 BDB::poll_wait, BDB::poll_cb
438 while BDB::nreqs; 526 while BDB::nreqs;
443 531
444=over 4 532=over 4
445 533
446=item BDB::min_parallel $nthreads 534=item BDB::min_parallel $nthreads
447 535
448Set the minimum number of AIO threads to C<$nthreads>. The current 536Set the minimum number of BDB threads to C<$nthreads>. The current
449default is C<8>, which means eight asynchronous operations can execute 537default is C<8>, which means eight asynchronous operations can execute
450concurrently at any one time (the number of outstanding requests, 538concurrently at any one time (the number of outstanding requests,
451however, is unlimited). 539however, is unlimited).
452 540
453BDB starts threads only on demand, when an AIO request is queued and 541BDB starts threads only on demand, when an BDB request is queued and
454no free thread exists. Please note that queueing up a hundred requests can 542no free thread exists. Please note that queueing up a hundred requests can
455create demand for a hundred threads, even if it turns out that everything 543create demand for a hundred threads, even if it turns out that everything
456is in the cache and could have been processed faster by a single thread. 544is in the cache and could have been processed faster by a single thread.
457 545
458It is recommended to keep the number of threads relatively low, as some 546It is recommended to keep the number of threads relatively low, as some
463Under most circumstances you don't need to call this function, as the 551Under most circumstances you don't need to call this function, as the
464module selects a default that is suitable for low to moderate load. 552module selects a default that is suitable for low to moderate load.
465 553
466=item BDB::max_parallel $nthreads 554=item BDB::max_parallel $nthreads
467 555
468Sets the maximum number of AIO threads to C<$nthreads>. If more than the 556Sets the maximum number of BDB threads to C<$nthreads>. If more than the
469specified number of threads are currently running, this function kills 557specified number of threads are currently running, this function kills
470them. This function blocks until the limit is reached. 558them. This function blocks until the limit is reached.
471 559
472While C<$nthreads> are zero, aio requests get queued but not executed 560While C<$nthreads> are zero, aio requests get queued but not executed
473until the number of threads has been increased again. 561until the number of threads has been increased again.
576 664
577=head2 FORK BEHAVIOUR 665=head2 FORK BEHAVIOUR
578 666
579This module should do "the right thing" when the process using it forks: 667This module should do "the right thing" when the process using it forks:
580 668
581Before the fork, IO::AIO enters a quiescent state where no requests 669Before the fork, BDB enters a quiescent state where no requests
582can be added in other threads and no results will be processed. After 670can be added in other threads and no results will be processed. After
583the fork the parent simply leaves the quiescent state and continues 671the fork the parent simply leaves the quiescent state and continues
584request/result processing, while the child frees the request/result queue 672request/result processing, while the child frees the request/result queue
585(so that the requests started before the fork will only be handled in the 673(so that the requests started before the fork will only be handled in the
586parent). Threads will be started on demand until the limit set in the 674parent). Threads will be started on demand until the limit set in the
587parent process has been reached again. 675parent process has been reached again.
588 676
589In short: the parent will, after a short pause, continue as if fork had 677In short: the parent will, after a short pause, continue as if fork had
590not been called, while the child will act as if IO::AIO has not been used 678not been called, while the child will act as if BDB has not been used
591yet. 679yet.
680
681Win32 note: there is no fork on win32, and perls emulation of it is too
682broken to be supported, so do not use BDB in a windows pseudo-fork, better
683yet, switch to a more capable platform.
592 684
593=head2 MEMORY USAGE 685=head2 MEMORY USAGE
594 686
595Per-request usage: 687Per-request usage:
596 688
609temporary buffers, and each thread requires a stack and other data 701temporary buffers, and each thread requires a stack and other data
610structures (usually around 16k-128k, depending on the OS). 702structures (usually around 16k-128k, depending on the OS).
611 703
612=head1 KNOWN BUGS 704=head1 KNOWN BUGS
613 705
614Known bugs will be fixed in the next release. 706Known bugs will be fixed in the next release, except:
707
708 If you use a transaction in any request, and the request returns
709 with an operating system error or DB_LOCK_NOTGRANTED, the internal
710 TXN_DEADLOCK flag will be set on the transaction. See C<db_txn_finish>,
711 above.
615 712
616=head1 SEE ALSO 713=head1 SEE ALSO
617 714
618L<Coro::AIO>. 715L<Coro::BDB>, L<IO::AIO>.
619 716
620=head1 AUTHOR 717=head1 AUTHOR
621 718
622 Marc Lehmann <schmorp@schmorp.de> 719 Marc Lehmann <schmorp@schmorp.de>
623 http://home.schmorp.de/ 720 http://home.schmorp.de/

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines