ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/BDB/BDB.pm
(Generate patch)

Comparing BDB/BDB.pm (file contents):
Revision 1.15 by root, Thu Sep 13 21:34:00 2007 UTC vs.
Revision 1.34 by root, Sun Mar 30 06:17:31 2008 UTC

3BDB - Asynchronous Berkeley DB access 3BDB - Asynchronous Berkeley DB access
4 4
5=head1 SYNOPSIS 5=head1 SYNOPSIS
6 6
7 use BDB; 7 use BDB;
8
9 my $env = db_env_create;
10
11 mkdir "bdtest", 0700;
12 db_env_open
13 $env,
14 "bdtest",
15 BDB::INIT_LOCK | BDB::INIT_LOG | BDB::INIT_MPOOL
16 | BDB::INIT_TXN | BDB::RECOVER | BDB::USE_ENVIRON | BDB::CREATE,
17 0600;
18
19 $env->set_flags (BDB::AUTO_COMMIT | BDB::TXN_NOSYNC, 1);
20
21 my $db = db_create $env;
22 db_open $db, undef, "table", undef, BDB::BTREE, BDB::AUTO_COMMIT | BDB::CREATE
23 | BDB::READ_UNCOMMITTED, 0600;
24 db_put $db, undef, "key", "data", 0, sub {
25 db_del $db, undef, "key";
26 };
27 db_sync $db;
28
29 # when you also use Coro, management is easy:
30 use Coro::BDB;
31
32 # automatic result processing with AnyEvent:
33 our $FH; open $FH, "<&=" . BDB::poll_fileno;
34 our $WATCHER = AnyEvent->io (fh => $FH, poll => 'r', cb => \&BDB::poll_cb);
35
36 # automatic result processing with EV:
37 my $WATCHER = EV::io BDB::poll_fileno, EV::READ, \&BDB::poll_cb;
38
39 # with Glib:
40 add_watch Glib::IO BDB::poll_fileno,
41 in => sub { BDB::poll_cb; 1 };
42
43 # or simply flush manually
44 BDB::flush;
45
8 46
9=head1 DESCRIPTION 47=head1 DESCRIPTION
10 48
11See the BerkeleyDB documentation (L<http://www.oracle.com/technology/documentation/berkeley-db/db/index.html>). 49See the BerkeleyDB documentation (L<http://www.oracle.com/technology/documentation/berkeley-db/db/index.html>).
12The BDB API is very similar to the C API (the translation has been very faithful). 50The BDB API is very similar to the C API (the translation has been very faithful).
72use strict 'vars'; 110use strict 'vars';
73 111
74use base 'Exporter'; 112use base 'Exporter';
75 113
76BEGIN { 114BEGIN {
77 our $VERSION = '1.1'; 115 our $VERSION = '1.44';
78 116
79 our @BDB_REQ = qw( 117 our @BDB_REQ = qw(
80 db_env_open db_env_close db_env_txn_checkpoint db_env_lock_detect 118 db_env_open db_env_close db_env_txn_checkpoint db_env_lock_detect
81 db_env_memp_sync db_env_memp_trickle 119 db_env_memp_sync db_env_memp_trickle
82 db_open db_close db_compact db_sync db_put db_get db_pget db_del db_key_range 120 db_open db_close db_compact db_sync db_upgrade
121 db_put db_get db_pget db_del db_key_range
83 db_txn_commit db_txn_abort db_txn_finish 122 db_txn_commit db_txn_abort db_txn_finish
84 db_c_close db_c_count db_c_put db_c_get db_c_pget db_c_del 123 db_c_close db_c_count db_c_put db_c_get db_c_pget db_c_del
85 db_sequence_open db_sequence_close 124 db_sequence_open db_sequence_close
86 db_sequence_get db_sequence_remove 125 db_sequence_get db_sequence_remove
87 ); 126 );
95 134
96 require XSLoader; 135 require XSLoader;
97 XSLoader::load ("BDB", $VERSION); 136 XSLoader::load ("BDB", $VERSION);
98} 137}
99 138
139=head2 FILENAMES/DATABASE NAMES
140
141The BDB expects "binary" filenames (octet strings) for pathnames on POSIX
142systems, and "unicode" filenames (strings with characters potentially
143>255) on Win32 (expecting a Unicode win32 build - win32 is a total mess).
144
100=head2 BERKELEYDB FUNCTIONS 145=head2 BERKELEYDB FUNCTIONS
101 146
102All of these are functions. The create functions simply return a new 147All of these are functions. The create functions simply return a new
103object and never block. All the remaining functions all take an optional 148object and never block. All the remaining functions take an optional
104callback as last argument. If it is missing, then the fucntion will be 149callback as last argument. If it is missing, then the function will be
105executed synchronously. 150executed synchronously. In both cases, C<$!> will reflect the return value
151of the function.
106 152
107BDB functions that cannot block (mostly functions that manipulate 153BDB functions that cannot block (mostly functions that manipulate
108settings) are method calls on the relevant objects, so the rule of thumb 154settings) are method calls on the relevant objects, so the rule of thumb
109is: if its a method, its not blocking, if its a function, it takes a 155is: if it's a method, it's not blocking, if it's a function, it takes a
110callback as last argument. 156callback as last argument.
111 157
112In the following, C<$int> signifies an integer return value, 158In the following, C<$int> signifies an integer return value,
113C<octetstring> is a "binary string" (i.e. a perl string with no character 159C<octetstring> is a "binary string" (i.e. a perl string with no character
114indices >255), C<U32> is an unsigned 32 bit integer, C<int> is some 160indices >255), C<U32> is an unsigned 32 bit integer, C<int> is some
145 191
146 db_open (DB *db, DB_TXN_ornull *txnid, octetstring file, octetstring database, int type, U32 flags, int mode, SV *callback = &PL_sv_undef) 192 db_open (DB *db, DB_TXN_ornull *txnid, octetstring file, octetstring database, int type, U32 flags, int mode, SV *callback = &PL_sv_undef)
147 flags: AUTO_COMMIT CREATE EXCL MULTIVERSION NOMMAP RDONLY READ_UNCOMMITTED THREAD TRUNCATE 193 flags: AUTO_COMMIT CREATE EXCL MULTIVERSION NOMMAP RDONLY READ_UNCOMMITTED THREAD TRUNCATE
148 db_close (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef) 194 db_close (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef)
149 flags: DB_NOSYNC 195 flags: DB_NOSYNC
196 db_upgrade (DB *db, octetstring file, U32 flags = 0, SV *callback = &PL_sv_undef)
150 db_compact (DB *db, DB_TXN_ornull *txn = 0, SV *start = 0, SV *stop = 0, SV *unused1 = 0, U32 flags = DB_FREE_SPACE, SV *unused2 = 0, SV *callback = &PL_sv_undef) 197 db_compact (DB *db, DB_TXN_ornull *txn = 0, SV *start = 0, SV *stop = 0, SV *unused1 = 0, U32 flags = DB_FREE_SPACE, SV *unused2 = 0, SV *callback = &PL_sv_undef)
151 flags: FREELIST_ONLY FREE_SPACE 198 flags: FREELIST_ONLY FREE_SPACE
152 db_sync (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef) 199 db_sync (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef)
153 db_key_range (DB *db, DB_TXN_ornull *txn, SV *key, SV *key_range, U32 flags = 0, SV *callback = &PL_sv_undef) 200 db_key_range (DB *db, DB_TXN_ornull *txn, SV *key, SV *key_range, U32 flags = 0, SV *callback = &PL_sv_undef)
154 db_put (DB *db, DB_TXN_ornull *txn, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef) 201 db_put (DB *db, DB_TXN_ornull *txn, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
179 db_sequence_remove (DB_SEQUENCE *seq, DB_TXN_ornull *txnid = 0, U32 flags = 0, SV *callback = &PL_sv_undef) 226 db_sequence_remove (DB_SEQUENCE *seq, DB_TXN_ornull *txnid = 0, U32 flags = 0, SV *callback = &PL_sv_undef)
180 flags: TXN_NOSYNC 227 flags: TXN_NOSYNC
181 228
182=head4 db_txn_finish (DB_TXN *txn, U32 flags = 0, SV *callback = &PL_sv_undef) 229=head4 db_txn_finish (DB_TXN *txn, U32 flags = 0, SV *callback = &PL_sv_undef)
183 230
184This is not a Berkeley DB function but a BDB module extension. It is very 231This is not actually a Berkeley DB function but a BDB module
232extension. The background for this exytension is: It is very annoying to
185annoying to have to check every single BDB function for error returns and 233have to check every single BDB function for error returns and provide a
186provide a codepath out of your transaction. While the BDB module still 234codepath out of your transaction. While the BDB module still makes this
187makes this possible, it contains the following extensions: 235possible, it contains the following extensions:
188 236
189When a transaction-protected function returns any operating system 237When a transaction-protected function returns any operating system
190error (errno > 0), BDB will set the C<TXN_DEADLOCK> flag on the 238error (errno > 0), BDB will set the C<TXN_DEADLOCK> flag on the
191transaction. This flag is also set by Berkeley DB functions externally 239transaction. This flag is also set by Berkeley DB functions themselves
192when an operation fails with LOCK_DEADLOCK, and it causes all further 240when an operation fails with LOCK_DEADLOCK, and it causes all further
193operations on that transaction (including C<db_txn_commit>) to fail. 241operations on that transaction (including C<db_txn_commit>) to fail.
194 242
195The C<db_txn_finish> request will look at this flag, and, if it is set, 243The C<db_txn_finish> request will look at this flag, and, if it is set,
196will automatically call C<db_txn_abort> (setting errno to C<LOCK_DEADLOCK> 244will automatically call C<db_txn_abort> (setting errno to C<LOCK_DEADLOCK>
197if it isn't set). If it isn't set, it will call C<db_txn_commit> and 245if it isn't set to something else yet). If it isn't set, it will call
198return the error normally. 246C<db_txn_commit> and return the error normally.
199 247
200How to use this? Easy: just write your transaction normally: 248How to use this? Easy: just write your transaction normally:
201 249
202 my $txn = $db_env->txn_begin; 250 my $txn = $db_env->txn_begin;
203 db_get $db, $txn, "key", my $data; 251 db_get $db, $txn, "key", my $data;
208That is, handle only the expected errors. If something unexpected happens 256That is, handle only the expected errors. If something unexpected happens
209(EIO, LOCK_NOTGRANTED or a deadlock in either db_get or db_put), then the remaining 257(EIO, LOCK_NOTGRANTED or a deadlock in either db_get or db_put), then the remaining
210requests (db_put in this case) will simply be skipped (they will fail with 258requests (db_put in this case) will simply be skipped (they will fail with
211LOCK_DEADLOCK) and the transaction will be aborted. 259LOCK_DEADLOCK) and the transaction will be aborted.
212 260
213You cna use the C<< $txn->failed >> method to check wether a transaction 261You can use the C<< $txn->failed >> method to check wether a transaction
214has failed in this way and abort further processing (excluding 262has failed in this way and abort further processing (excluding
215C<db_txn_finish>). 263C<db_txn_finish>).
216 264
217=head3 DB_ENV/database environment methods 265=head3 DB_ENV/database environment methods
218 266
241 $int = $env->set_lk_max_lockers (U32 max) 289 $int = $env->set_lk_max_lockers (U32 max)
242 $int = $env->set_lk_max_locks (U32 max) 290 $int = $env->set_lk_max_locks (U32 max)
243 $int = $env->set_lk_max_objects (U32 max) 291 $int = $env->set_lk_max_objects (U32 max)
244 $int = $env->set_lg_bsize (U32 max) 292 $int = $env->set_lg_bsize (U32 max)
245 $int = $env->set_lg_max (U32 max) 293 $int = $env->set_lg_max (U32 max)
294 $int = $env->mutex_set_increment (U32 increment)
295 $int = $env->mutex_set_tas_spins (U32 tas_spins)
296 $int = $env->mutex_set_max (U32 max)
297 $int = $env->mutex_set_align (U32 align)
246 298
247 $txn = $env->txn_begin (DB_TXN_ornull *parent = 0, U32 flags = 0) 299 $txn = $env->txn_begin (DB_TXN_ornull *parent = 0, U32 flags = 0)
248 flags: READ_COMMITTED READ_UNCOMMITTED TXN_NOSYNC TXN_NOWAIT TXN_SNAPSHOT TXN_SYNC TXN_WAIT TXN_WRITE_NOSYNC 300 flags: READ_COMMITTED READ_UNCOMMITTED TXN_NOSYNC TXN_NOWAIT TXN_SNAPSHOT TXN_SYNC TXN_WAIT TXN_WRITE_NOSYNC
249 301
250=head4 Example: 302=head4 Example:
345 DESTROY (DBC_ornull *dbc) 397 DESTROY (DBC_ornull *dbc)
346 CODE: 398 CODE:
347 if (dbc) 399 if (dbc)
348 dbc->c_close (dbc); 400 dbc->c_close (dbc);
349 401
402 $int = $cursor->set_priority ($priority = PRIORITY_*)
403
350=head4 Example: 404=head4 Example:
351 405
352 my $c = $db->cursor; 406 my $c = $db->cursor;
353 407
354 for (;;) { 408 for (;;) {
387 441
388=head3 EVENT PROCESSING AND EVENT LOOP INTEGRATION 442=head3 EVENT PROCESSING AND EVENT LOOP INTEGRATION
389 443
390=over 4 444=over 4
391 445
446=item $msg = BDB::strerror [$errno]
447
448Returns the string corresponding to the given errno value. If no argument
449is given, use C<$!>.
450
451Note that the BDB module also patches the C<$!> variable directly, so you
452should be able to get a bdb error string by simply stringifying C<$!>.
453
392=item $fileno = BDB::poll_fileno 454=item $fileno = BDB::poll_fileno
393 455
394Return the I<request result pipe file descriptor>. This filehandle must be 456Return the I<request result pipe file descriptor>. This filehandle must be
395polled for reading by some mechanism outside this module (e.g. Event or 457polled for reading by some mechanism outside this module (e.g. Event or
396select, see below or the SYNOPSIS). If the pipe becomes readable you have 458select, see below or the SYNOPSIS). If the pipe becomes readable you have
434interactiveness when perl is not fast enough to process all requests in 496interactiveness when perl is not fast enough to process all requests in
435time. 497time.
436 498
437For interactive programs, values such as C<0.01> to C<0.1> should be fine. 499For interactive programs, values such as C<0.01> to C<0.1> should be fine.
438 500
439Example: Install an Event watcher that automatically calls 501Example: Install an EV watcher that automatically calls
440BDB::poll_cb with low priority, to ensure that other parts of the 502BDB::poll_cb with low priority, to ensure that other parts of the
441program get the CPU sometimes even under high AIO load. 503program get the CPU sometimes even under high load.
442 504
443 # try not to spend much more than 0.1s in poll_cb 505 # try not to spend much more than 0.1s in poll_cb
444 BDB::max_poll_time 0.1; 506 BDB::max_poll_time 0.1;
445 507
446 # use a low priority so other tasks have priority 508 my $bdb_poll = EV::io BDB::poll_fileno, EV::READ, \&BDB::poll_cb);
447 Event->io (fd => BDB::poll_fileno,
448 poll => 'r', nice => 1,
449 cb => &BDB::poll_cb);
450 509
451=item BDB::poll_wait 510=item BDB::poll_wait
452 511
453If there are any outstanding requests and none of them in the result 512If there are any outstanding requests and none of them in the result
454phase, wait till the result filehandle becomes ready for reading (simply 513phase, wait till the result filehandle becomes ready for reading (simply
466 525
467 BDB::poll_wait, BDB::poll_cb 526 BDB::poll_wait, BDB::poll_cb
468 527
469=item BDB::flush 528=item BDB::flush
470 529
471Wait till all outstanding AIO requests have been handled. 530Wait till all outstanding BDB requests have been handled.
472 531
473Strictly equivalent to: 532Strictly equivalent to:
474 533
475 BDB::poll_wait, BDB::poll_cb 534 BDB::poll_wait, BDB::poll_cb
476 while BDB::nreqs; 535 while BDB::nreqs;
481 540
482=over 4 541=over 4
483 542
484=item BDB::min_parallel $nthreads 543=item BDB::min_parallel $nthreads
485 544
486Set the minimum number of AIO threads to C<$nthreads>. The current 545Set the minimum number of BDB threads to C<$nthreads>. The current
487default is C<8>, which means eight asynchronous operations can execute 546default is C<8>, which means eight asynchronous operations can execute
488concurrently at any one time (the number of outstanding requests, 547concurrently at any one time (the number of outstanding requests,
489however, is unlimited). 548however, is unlimited).
490 549
491BDB starts threads only on demand, when an AIO request is queued and 550BDB starts threads only on demand, when an BDB request is queued and
492no free thread exists. Please note that queueing up a hundred requests can 551no free thread exists. Please note that queueing up a hundred requests can
493create demand for a hundred threads, even if it turns out that everything 552create demand for a hundred threads, even if it turns out that everything
494is in the cache and could have been processed faster by a single thread. 553is in the cache and could have been processed faster by a single thread.
495 554
496It is recommended to keep the number of threads relatively low, as some 555It is recommended to keep the number of threads relatively low, as some
501Under most circumstances you don't need to call this function, as the 560Under most circumstances you don't need to call this function, as the
502module selects a default that is suitable for low to moderate load. 561module selects a default that is suitable for low to moderate load.
503 562
504=item BDB::max_parallel $nthreads 563=item BDB::max_parallel $nthreads
505 564
506Sets the maximum number of AIO threads to C<$nthreads>. If more than the 565Sets the maximum number of BDB threads to C<$nthreads>. If more than the
507specified number of threads are currently running, this function kills 566specified number of threads are currently running, this function kills
508them. This function blocks until the limit is reached. 567them. This function blocks until the limit is reached.
509 568
510While C<$nthreads> are zero, aio requests get queued but not executed 569While C<$nthreads> are zero, aio requests get queued but not executed
511until the number of threads has been increased again. 570until the number of threads has been increased again.
614 673
615=head2 FORK BEHAVIOUR 674=head2 FORK BEHAVIOUR
616 675
617This module should do "the right thing" when the process using it forks: 676This module should do "the right thing" when the process using it forks:
618 677
619Before the fork, IO::AIO enters a quiescent state where no requests 678Before the fork, BDB enters a quiescent state where no requests
620can be added in other threads and no results will be processed. After 679can be added in other threads and no results will be processed. After
621the fork the parent simply leaves the quiescent state and continues 680the fork the parent simply leaves the quiescent state and continues
622request/result processing, while the child frees the request/result queue 681request/result processing, while the child frees the request/result queue
623(so that the requests started before the fork will only be handled in the 682(so that the requests started before the fork will only be handled in the
624parent). Threads will be started on demand until the limit set in the 683parent). Threads will be started on demand until the limit set in the
625parent process has been reached again. 684parent process has been reached again.
626 685
627In short: the parent will, after a short pause, continue as if fork had 686In short: the parent will, after a short pause, continue as if fork had
628not been called, while the child will act as if IO::AIO has not been used 687not been called, while the child will act as if BDB has not been used
629yet. 688yet.
689
690Win32 note: there is no fork on win32, and perls emulation of it is too
691broken to be supported, so do not use BDB in a windows pseudo-fork, better
692yet, switch to a more capable platform.
630 693
631=head2 MEMORY USAGE 694=head2 MEMORY USAGE
632 695
633Per-request usage: 696Per-request usage:
634 697
656 TXN_DEADLOCK flag will be set on the transaction. See C<db_txn_finish>, 719 TXN_DEADLOCK flag will be set on the transaction. See C<db_txn_finish>,
657 above. 720 above.
658 721
659=head1 SEE ALSO 722=head1 SEE ALSO
660 723
661L<Coro::AIO>. 724L<Coro::BDB>, L<IO::AIO>.
662 725
663=head1 AUTHOR 726=head1 AUTHOR
664 727
665 Marc Lehmann <schmorp@schmorp.de> 728 Marc Lehmann <schmorp@schmorp.de>
666 http://home.schmorp.de/ 729 http://home.schmorp.de/

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines