ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/BDB/BDB.pm
(Generate patch)

Comparing BDB/BDB.pm (file contents):
Revision 1.14 by root, Thu Sep 13 12:29:49 2007 UTC vs.
Revision 1.22 by root, Mon Dec 10 03:57:27 2007 UTC

3BDB - Asynchronous Berkeley DB access 3BDB - Asynchronous Berkeley DB access
4 4
5=head1 SYNOPSIS 5=head1 SYNOPSIS
6 6
7 use BDB; 7 use BDB;
8
9 my $env = db_env_create;
10
11 mkdir "bdtest", 0700;
12 db_env_open
13 $env,
14 "bdtest",
15 BDB::INIT_LOCK | BDB::INIT_LOG | BDB::INIT_MPOOL
16 | BDB::INIT_TXN | BDB::RECOVER | BDB::USE_ENVIRON | BDB::CREATE,
17 0600;
18
19 $env->set_flags (BDB::AUTO_COMMIT | BDB::TXN_NOSYNC, 1);
20
21 my $db = db_create $env;
22 db_open $db, undef, "table", undef, BDB::BTREE, BDB::AUTO_COMMIT | BDB::CREATE
23 | BDB::READ_UNCOMMITTED, 0600;
24 db_put $db, undef, "key", "data", 0, sub {
25 db_del $db, undef, "key";
26 };
27 db_sync $db;
28
29 # automatic result processing with AnyEvent:
30 our $FH; open $FH, "<&=" . BDB::poll_fileno;
31 our $WATCHER = AnyEvent->io (fh => $FH, poll => 'r', cb => \&BDB::poll_cb);
32
33 # automatic result processing with EV:
34 my $WATCHER = EV::io BDB::poll_fileno, EV::READ, \&BDB::poll_cb;
35
36 # with Glib:
37 add_watch Glib::IO BDB::poll_fileno,
38 in => sub { BDB::poll_cb; 1 };
39
40 # or simply flush manually
41 BDB::flush;
42
8 43
9=head1 DESCRIPTION 44=head1 DESCRIPTION
10 45
11See the BerkeleyDB documentation (L<http://www.oracle.com/technology/documentation/berkeley-db/db/index.html>). 46See the BerkeleyDB documentation (L<http://www.oracle.com/technology/documentation/berkeley-db/db/index.html>).
12The BDB API is very similar to the C API (the translation has been very faithful). 47The BDB API is very similar to the C API (the translation has been very faithful).
72use strict 'vars'; 107use strict 'vars';
73 108
74use base 'Exporter'; 109use base 'Exporter';
75 110
76BEGIN { 111BEGIN {
77 our $VERSION = '1.0'; 112 our $VERSION = '1.3';
78 113
79 our @BDB_REQ = qw( 114 our @BDB_REQ = qw(
80 db_env_open db_env_close db_env_txn_checkpoint db_env_lock_detect 115 db_env_open db_env_close db_env_txn_checkpoint db_env_lock_detect
81 db_env_memp_sync db_env_memp_trickle 116 db_env_memp_sync db_env_memp_trickle
82 db_open db_close db_compact db_sync db_put db_get db_pget db_del db_key_range 117 db_open db_close db_compact db_sync db_put db_get db_pget db_del db_key_range
83 db_txn_commit db_txn_abort 118 db_txn_commit db_txn_abort db_txn_finish
84 db_c_close db_c_count db_c_put db_c_get db_c_pget db_c_del 119 db_c_close db_c_count db_c_put db_c_get db_c_pget db_c_del
85 db_sequence_open db_sequence_close 120 db_sequence_open db_sequence_close
86 db_sequence_get db_sequence_remove 121 db_sequence_get db_sequence_remove
87 ); 122 );
88 our @EXPORT = (@BDB_REQ, qw(dbreq_pri dbreq_nice db_env_create db_create)); 123 our @EXPORT = (@BDB_REQ, qw(dbreq_pri dbreq_nice db_env_create db_create));
99 134
100=head2 BERKELEYDB FUNCTIONS 135=head2 BERKELEYDB FUNCTIONS
101 136
102All of these are functions. The create functions simply return a new 137All of these are functions. The create functions simply return a new
103object and never block. All the remaining functions all take an optional 138object and never block. All the remaining functions all take an optional
104callback as last argument. If it is missing, then the fucntion will be 139callback as last argument that gets passed the return value. If it is
105executed synchronously. 140missing, then the function will be executed synchronously, and the return
141value is returned as normally.
106 142
107BDB functions that cannot block (mostly functions that manipulate 143BDB functions that cannot block (mostly functions that manipulate
108settings) are method calls on the relevant objects, so the rule of thumb 144settings) are method calls on the relevant objects, so the rule of thumb
109is: if its a method, its not blocking, if its a function, it takes a 145is: if its a method, its not blocking, if its a function, it takes a
110callback as last argument. 146callback as last argument.
177 db_sequence_get (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, int delta, SV *seq_value, U32 flags = DB_TXN_NOSYNC, SV *callback = &PL_sv_undef) 213 db_sequence_get (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, int delta, SV *seq_value, U32 flags = DB_TXN_NOSYNC, SV *callback = &PL_sv_undef)
178 flags: TXN_NOSYNC 214 flags: TXN_NOSYNC
179 db_sequence_remove (DB_SEQUENCE *seq, DB_TXN_ornull *txnid = 0, U32 flags = 0, SV *callback = &PL_sv_undef) 215 db_sequence_remove (DB_SEQUENCE *seq, DB_TXN_ornull *txnid = 0, U32 flags = 0, SV *callback = &PL_sv_undef)
180 flags: TXN_NOSYNC 216 flags: TXN_NOSYNC
181 217
218=head4 db_txn_finish (DB_TXN *txn, U32 flags = 0, SV *callback = &PL_sv_undef)
219
220This is not actually a Berkeley DB function but a BDB module
221extension. The background for this exytension is: It is very annoying to
222have to check every single BDB function for error returns and provide a
223codepath out of your transaction. While the BDB module still makes this
224possible, it contains the following extensions:
225
226When a transaction-protected function returns any operating system
227error (errno > 0), BDB will set the C<TXN_DEADLOCK> flag on the
228transaction. This flag is also set by Berkeley DB functions themselves
229when an operation fails with LOCK_DEADLOCK, and it causes all further
230operations on that transaction (including C<db_txn_commit>) to fail.
231
232The C<db_txn_finish> request will look at this flag, and, if it is set,
233will automatically call C<db_txn_abort> (setting errno to C<LOCK_DEADLOCK>
234if it isn't set to something else yet). If it isn't set, it will call
235C<db_txn_commit> and return the error normally.
236
237How to use this? Easy: just write your transaction normally:
238
239 my $txn = $db_env->txn_begin;
240 db_get $db, $txn, "key", my $data;
241 db_put $db, $txn, "key", $data + 1 unless $! == BDB::NOTFOUND;
242 db_txn_finish $txn;
243 die "transaction failed" if $!;
244
245That is, handle only the expected errors. If something unexpected happens
246(EIO, LOCK_NOTGRANTED or a deadlock in either db_get or db_put), then the remaining
247requests (db_put in this case) will simply be skipped (they will fail with
248LOCK_DEADLOCK) and the transaction will be aborted.
249
250You can use the C<< $txn->failed >> method to check wether a transaction
251has failed in this way and abort further processing (excluding
252C<db_txn_finish>).
253
182=head3 DB_ENV/database environment methods 254=head3 DB_ENV/database environment methods
183 255
184Methods available on DB_ENV/$env handles: 256Methods available on DB_ENV/$env handles:
185 257
186 DESTROY (DB_ENV_ornull *env) 258 DESTROY (DB_ENV_ornull *env)
196 $int = $env->set_flags (U32 flags, int onoff) 268 $int = $env->set_flags (U32 flags, int onoff)
197 $env->set_errfile (FILE *errfile = 0) 269 $env->set_errfile (FILE *errfile = 0)
198 $env->set_msgfile (FILE *msgfile = 0) 270 $env->set_msgfile (FILE *msgfile = 0)
199 $int = $env->set_verbose (U32 which, int onoff = 1) 271 $int = $env->set_verbose (U32 which, int onoff = 1)
200 $int = $env->set_encrypt (const char *password, U32 flags = 0) 272 $int = $env->set_encrypt (const char *password, U32 flags = 0)
201 $int = $env->set_timeout (NV timeout, U32 flags) 273 $int = $env->set_timeout (NV timeout_seconds, U32 flags = SET_TXN_TIMEOUT)
202 $int = $env->set_mp_max_openfd (int maxopenfd); 274 $int = $env->set_mp_max_openfd (int maxopenfd);
203 $int = $env->set_mp_max_write (int maxwrite, int maxwrite_sleep); 275 $int = $env->set_mp_max_write (int maxwrite, int maxwrite_sleep);
204 $int = $env->set_mp_mmapsize (int mmapsize_mb) 276 $int = $env->set_mp_mmapsize (int mmapsize_mb)
205 $int = $env->set_lk_detect (U32 detect = DB_LOCK_DEFAULT) 277 $int = $env->set_lk_detect (U32 detect = DB_LOCK_DEFAULT)
206 $int = $env->set_lk_max_lockers (U32 max) 278 $int = $env->set_lk_max_lockers (U32 max)
207 $int = $env->set_lk_max_locks (U32 max) 279 $int = $env->set_lk_max_locks (U32 max)
208 $int = $env->set_lk_max_objects (U32 max) 280 $int = $env->set_lk_max_objects (U32 max)
209 $int = $env->set_lg_bsize (U32 max) 281 $int = $env->set_lg_bsize (U32 max)
210 $int = $env->set_lg_max (U32 max) 282 $int = $env->set_lg_max (U32 max)
283 $int = $env->mutex_set_increment (U32 increment)
284 $int = $env->mutex_set_tas_spins (U32 tas_spins)
285 $int = $env->mutex_set_max (U32 max)
286 $int = $env->mutex_set_align (U32 align)
211 287
212 $txn = $env->txn_begin (DB_TXN_ornull *parent = 0, U32 flags = 0) 288 $txn = $env->txn_begin (DB_TXN_ornull *parent = 0, U32 flags = 0)
213 flags: READ_COMMITTED READ_UNCOMMITTED TXN_NOSYNC TXN_NOWAIT TXN_SNAPSHOT TXN_SYNC TXN_WAIT TXN_WRITE_NOSYNC 289 flags: READ_COMMITTED READ_UNCOMMITTED TXN_NOSYNC TXN_NOWAIT TXN_SNAPSHOT TXN_SYNC TXN_WAIT TXN_WRITE_NOSYNC
214 290
215=head4 Example: 291=head4 Example:
294 DESTROY (DB_TXN_ornull *txn) 370 DESTROY (DB_TXN_ornull *txn)
295 CODE: 371 CODE:
296 if (txn) 372 if (txn)
297 txn->abort (txn); 373 txn->abort (txn);
298 374
299 $int = $txn->set_timeout (NV timeout, U32 flags) 375 $int = $txn->set_timeout (NV timeout_seconds, U32 flags = SET_TXN_TIMEOUT)
300 flags: SET_LOCK_TIMEOUT SET_TXN_TIMEOUT 376 flags: SET_LOCK_TIMEOUT SET_TXN_TIMEOUT
377
378 $bool = $txn->failed
379 # see db_txn_finish documentation, above
301 380
302 381
303=head3 DBC/cursor methods 382=head3 DBC/cursor methods
304 383
305Methods available on DBC/$dbc handles: 384Methods available on DBC/$dbc handles:
396interactiveness when perl is not fast enough to process all requests in 475interactiveness when perl is not fast enough to process all requests in
397time. 476time.
398 477
399For interactive programs, values such as C<0.01> to C<0.1> should be fine. 478For interactive programs, values such as C<0.01> to C<0.1> should be fine.
400 479
401Example: Install an Event watcher that automatically calls 480Example: Install an EV watcher that automatically calls
402BDB::poll_cb with low priority, to ensure that other parts of the 481BDB::poll_cb with low priority, to ensure that other parts of the
403program get the CPU sometimes even under high AIO load. 482program get the CPU sometimes even under high load.
404 483
405 # try not to spend much more than 0.1s in poll_cb 484 # try not to spend much more than 0.1s in poll_cb
406 BDB::max_poll_time 0.1; 485 BDB::max_poll_time 0.1;
407 486
408 # use a low priority so other tasks have priority 487 my $bdb_poll = EV::io BDB::poll_fileno, EV::READ, \&BDB::poll_cb);
409 Event->io (fd => BDB::poll_fileno,
410 poll => 'r', nice => 1,
411 cb => &BDB::poll_cb);
412 488
413=item BDB::poll_wait 489=item BDB::poll_wait
414 490
415If there are any outstanding requests and none of them in the result 491If there are any outstanding requests and none of them in the result
416phase, wait till the result filehandle becomes ready for reading (simply 492phase, wait till the result filehandle becomes ready for reading (simply
428 504
429 BDB::poll_wait, BDB::poll_cb 505 BDB::poll_wait, BDB::poll_cb
430 506
431=item BDB::flush 507=item BDB::flush
432 508
433Wait till all outstanding AIO requests have been handled. 509Wait till all outstanding BDB requests have been handled.
434 510
435Strictly equivalent to: 511Strictly equivalent to:
436 512
437 BDB::poll_wait, BDB::poll_cb 513 BDB::poll_wait, BDB::poll_cb
438 while BDB::nreqs; 514 while BDB::nreqs;
443 519
444=over 4 520=over 4
445 521
446=item BDB::min_parallel $nthreads 522=item BDB::min_parallel $nthreads
447 523
448Set the minimum number of AIO threads to C<$nthreads>. The current 524Set the minimum number of BDB threads to C<$nthreads>. The current
449default is C<8>, which means eight asynchronous operations can execute 525default is C<8>, which means eight asynchronous operations can execute
450concurrently at any one time (the number of outstanding requests, 526concurrently at any one time (the number of outstanding requests,
451however, is unlimited). 527however, is unlimited).
452 528
453BDB starts threads only on demand, when an AIO request is queued and 529BDB starts threads only on demand, when an BDB request is queued and
454no free thread exists. Please note that queueing up a hundred requests can 530no free thread exists. Please note that queueing up a hundred requests can
455create demand for a hundred threads, even if it turns out that everything 531create demand for a hundred threads, even if it turns out that everything
456is in the cache and could have been processed faster by a single thread. 532is in the cache and could have been processed faster by a single thread.
457 533
458It is recommended to keep the number of threads relatively low, as some 534It is recommended to keep the number of threads relatively low, as some
463Under most circumstances you don't need to call this function, as the 539Under most circumstances you don't need to call this function, as the
464module selects a default that is suitable for low to moderate load. 540module selects a default that is suitable for low to moderate load.
465 541
466=item BDB::max_parallel $nthreads 542=item BDB::max_parallel $nthreads
467 543
468Sets the maximum number of AIO threads to C<$nthreads>. If more than the 544Sets the maximum number of BDB threads to C<$nthreads>. If more than the
469specified number of threads are currently running, this function kills 545specified number of threads are currently running, this function kills
470them. This function blocks until the limit is reached. 546them. This function blocks until the limit is reached.
471 547
472While C<$nthreads> are zero, aio requests get queued but not executed 548While C<$nthreads> are zero, aio requests get queued but not executed
473until the number of threads has been increased again. 549until the number of threads has been increased again.
576 652
577=head2 FORK BEHAVIOUR 653=head2 FORK BEHAVIOUR
578 654
579This module should do "the right thing" when the process using it forks: 655This module should do "the right thing" when the process using it forks:
580 656
581Before the fork, IO::AIO enters a quiescent state where no requests 657Before the fork, BDB enters a quiescent state where no requests
582can be added in other threads and no results will be processed. After 658can be added in other threads and no results will be processed. After
583the fork the parent simply leaves the quiescent state and continues 659the fork the parent simply leaves the quiescent state and continues
584request/result processing, while the child frees the request/result queue 660request/result processing, while the child frees the request/result queue
585(so that the requests started before the fork will only be handled in the 661(so that the requests started before the fork will only be handled in the
586parent). Threads will be started on demand until the limit set in the 662parent). Threads will be started on demand until the limit set in the
587parent process has been reached again. 663parent process has been reached again.
588 664
589In short: the parent will, after a short pause, continue as if fork had 665In short: the parent will, after a short pause, continue as if fork had
590not been called, while the child will act as if IO::AIO has not been used 666not been called, while the child will act as if BDB has not been used
591yet. 667yet.
592 668
593=head2 MEMORY USAGE 669=head2 MEMORY USAGE
594 670
595Per-request usage: 671Per-request usage:
609temporary buffers, and each thread requires a stack and other data 685temporary buffers, and each thread requires a stack and other data
610structures (usually around 16k-128k, depending on the OS). 686structures (usually around 16k-128k, depending on the OS).
611 687
612=head1 KNOWN BUGS 688=head1 KNOWN BUGS
613 689
614Known bugs will be fixed in the next release. 690Known bugs will be fixed in the next release, except:
691
692 If you use a transaction in any request, and the request returns
693 with an operating system error or DB_LOCK_NOTGRANTED, the internal
694 TXN_DEADLOCK flag will be set on the transaction. See C<db_txn_finish>,
695 above.
615 696
616=head1 SEE ALSO 697=head1 SEE ALSO
617 698
618L<Coro::AIO>. 699L<Coro::BDB>, L<IO::AIO>.
619 700
620=head1 AUTHOR 701=head1 AUTHOR
621 702
622 Marc Lehmann <schmorp@schmorp.de> 703 Marc Lehmann <schmorp@schmorp.de>
623 http://home.schmorp.de/ 704 http://home.schmorp.de/

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines