ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/BDB/BDB.pm
(Generate patch)

Comparing BDB/BDB.pm (file contents):
Revision 1.45 by root, Wed Jul 9 21:16:14 2008 UTC vs.
Revision 1.53 by root, Tue Oct 21 02:21:25 2008 UTC

108no warnings; 108no warnings;
109use strict 'vars'; 109use strict 'vars';
110 110
111use base 'Exporter'; 111use base 'Exporter';
112 112
113our $VERSION;
114
113BEGIN { 115BEGIN {
114 our $VERSION = '1.7'; 116 $VERSION = '1.81';
115 117
116 our @BDB_REQ = qw( 118 our @BDB_REQ = qw(
117 db_env_open db_env_close db_env_txn_checkpoint db_env_lock_detect 119 db_env_open db_env_close db_env_txn_checkpoint db_env_lock_detect
118 db_env_memp_sync db_env_memp_trickle db_env_dbrename db_env_dbremove 120 db_env_memp_sync db_env_memp_trickle db_env_dbrename db_env_dbremove
121 db_env_log_archive
119 db_open db_close db_compact db_sync db_upgrade 122 db_open db_close db_compact db_sync db_upgrade
120 db_put db_exists db_get db_pget db_del db_key_range 123 db_put db_exists db_get db_pget db_del db_key_range
121 db_txn_commit db_txn_abort db_txn_finish 124 db_txn_commit db_txn_abort db_txn_finish
122 db_c_close db_c_count db_c_put db_c_get db_c_pget db_c_del 125 db_c_close db_c_count db_c_put db_c_get db_c_pget db_c_del
123 db_sequence_open db_sequence_close 126 db_sequence_open db_sequence_close
133 136
134 require XSLoader; 137 require XSLoader;
135 XSLoader::load ("BDB", $VERSION); 138 XSLoader::load ("BDB", $VERSION);
136} 139}
137 140
138=head2 WIN32 FILENAMES/DATABASE NAME MESS
139
140Perl on Win32 supports only ASCII filenames (the reason is that it abuses
141an internal flag to store wether a filename is Unicode or ANSI, but that
142flag is used for somethign else in the perl core, so there is no way to
143detect wether a filename is ANSI or Unicode-encoded). The BDB module
144tries to work around this issue by assuming that the filename is an ANSI
145filename and BDB was built for unicode support.
146
147=head2 BERKELEYDB FUNCTIONS 141=head1 BERKELEYDB FUNCTIONS
148 142
149All of these are functions. The create functions simply return a new 143All of these are functions. The create functions simply return a new
150object and never block. All the remaining functions take an optional 144object and never block. All the remaining functions take an optional
151callback as last argument. If it is missing, then the function will be 145callback as last argument. If it is missing, then the function will be
152executed synchronously. In both cases, C<$!> will reflect the return value 146executed synchronously. In both cases, C<$!> will reflect the return value
191 185
192 db_env_txn_checkpoint $db_env, 0, 0, 0, undef; 186 db_env_txn_checkpoint $db_env, 0, 0, 0, undef;
193 db_env_txn_checkpoint $db_env, 0, 0, 0; 187 db_env_txn_checkpoint $db_env, 0, 0, 0;
194 db_env_txn_checkpoint $db_env, 0; 188 db_env_txn_checkpoint $db_env, 0;
195 189
196=head3 BDB functions 190=head2 BDB functions
197 191
198Functions in the BDB namespace, exported by default: 192Functions in the BDB namespace, exported by default:
199 193
200 $env = db_env_create (U32 env_flags = 0) 194 $env = db_env_create (U32 env_flags = 0)
201 flags: RPCCLIENT 195 flags: RPCCLIENT
209 atype: LOCK_DEFAULT LOCK_EXPIRE LOCK_MAXLOCKS LOCK_MAXWRITE LOCK_MINLOCKS LOCK_MINWRITE LOCK_OLDEST LOCK_RANDOM LOCK_YOUNGEST 203 atype: LOCK_DEFAULT LOCK_EXPIRE LOCK_MAXLOCKS LOCK_MAXWRITE LOCK_MINLOCKS LOCK_MINWRITE LOCK_OLDEST LOCK_RANDOM LOCK_YOUNGEST
210 db_env_memp_sync (DB_ENV *env, SV *dummy = 0, SV *callback = &PL_sv_undef) 204 db_env_memp_sync (DB_ENV *env, SV *dummy = 0, SV *callback = &PL_sv_undef)
211 db_env_memp_trickle (DB_ENV *env, int percent, SV *dummy = 0, SV *callback = &PL_sv_undef) 205 db_env_memp_trickle (DB_ENV *env, int percent, SV *dummy = 0, SV *callback = &PL_sv_undef)
212 db_env_dbremove (DB_ENV *env, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, U32 flags = 0, SV *callback = &PL_sv_undef) 206 db_env_dbremove (DB_ENV *env, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, U32 flags = 0, SV *callback = &PL_sv_undef)
213 db_env_dbrename (DB_ENV *env, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, bdb_filename newname, U32 flags = 0, SV *callback = &PL_sv_undef) 207 db_env_dbrename (DB_ENV *env, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, bdb_filename newname, U32 flags = 0, SV *callback = &PL_sv_undef)
208 db_env_log_archive (DB_ENV *env, SV *listp, U32 flags = 0, SV *callback = &PL_sv_undef)
214 209
215 $db = db_create (DB_ENV *env = 0, U32 flags = 0) 210 $db = db_create (DB_ENV *env = 0, U32 flags = 0)
216 flags: XA_CREATE 211 flags: XA_CREATE
217 212
218 db_open (DB *db, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, int type, U32 flags, int mode, SV *callback = &PL_sv_undef) 213 db_open (DB *db, DB_TXN_ornull *txnid, bdb_filename file, bdb_filename database, int type, U32 flags, int mode, SV *callback = &PL_sv_undef)
251 db_sequence_get (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, int delta, SV *seq_value, U32 flags = DB_TXN_NOSYNC, SV *callback = &PL_sv_undef) 246 db_sequence_get (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, int delta, SV *seq_value, U32 flags = DB_TXN_NOSYNC, SV *callback = &PL_sv_undef)
252 flags: TXN_NOSYNC 247 flags: TXN_NOSYNC
253 db_sequence_remove (DB_SEQUENCE *seq, DB_TXN_ornull *txnid = 0, U32 flags = 0, SV *callback = &PL_sv_undef) 248 db_sequence_remove (DB_SEQUENCE *seq, DB_TXN_ornull *txnid = 0, U32 flags = 0, SV *callback = &PL_sv_undef)
254 flags: TXN_NOSYNC 249 flags: TXN_NOSYNC
255 250
256=head4 db_txn_finish (DB_TXN *txn, U32 flags = 0, SV *callback = &PL_sv_undef) 251=head3 db_txn_finish (DB_TXN *txn, U32 flags = 0, SV *callback = &PL_sv_undef)
257 252
258This is not actually a Berkeley DB function but a BDB module 253This is not actually a Berkeley DB function but a BDB module
259extension. The background for this exytension is: It is very annoying to 254extension. The background for this exytension is: It is very annoying to
260have to check every single BDB function for error returns and provide a 255have to check every single BDB function for error returns and provide a
261codepath out of your transaction. While the BDB module still makes this 256codepath out of your transaction. While the BDB module still makes this
287 282
288You can use the C<< $txn->failed >> method to check wether a transaction 283You can use the C<< $txn->failed >> method to check wether a transaction
289has failed in this way and abort further processing (excluding 284has failed in this way and abort further processing (excluding
290C<db_txn_finish>). 285C<db_txn_finish>).
291 286
292=head3 DB_ENV/database environment methods 287=head2 DB_ENV/database environment methods
293 288
294Methods available on DB_ENV/$env handles: 289Methods available on DB_ENV/$env handles:
295 290
296 DESTROY (DB_ENV_ornull *env) 291 DESTROY (DB_ENV_ornull *env)
297 CODE: 292 CODE:
327 322
328 $txn = $env->txn_begin (DB_TXN_ornull *parent = 0, U32 flags = 0) 323 $txn = $env->txn_begin (DB_TXN_ornull *parent = 0, U32 flags = 0)
329 flags: READ_COMMITTED READ_UNCOMMITTED TXN_NOSYNC TXN_NOWAIT TXN_SNAPSHOT TXN_SYNC TXN_WAIT TXN_WRITE_NOSYNC 324 flags: READ_COMMITTED READ_UNCOMMITTED TXN_NOSYNC TXN_NOWAIT TXN_SNAPSHOT TXN_SYNC TXN_WAIT TXN_WRITE_NOSYNC
330 $txn = $env->cdsgroup_begin; (v4.5) 325 $txn = $env->cdsgroup_begin; (v4.5)
331 326
332=head4 Example: 327=head3 Example:
333 328
334 use AnyEvent; 329 use AnyEvent;
335 use BDB; 330 use BDB;
336 331
337 our $FH; open $FH, "<&=" . BDB::poll_fileno; 332 our $FH; open $FH, "<&=" . BDB::poll_fileno;
349 0600; 344 0600;
350 345
351 $env->set_flags (BDB::AUTO_COMMIT | BDB::TXN_NOSYNC, 1); 346 $env->set_flags (BDB::AUTO_COMMIT | BDB::TXN_NOSYNC, 1);
352 347
353 348
354=head3 DB/database methods 349=head2 DB/database methods
355 350
356Methods available on DB/$db handles: 351Methods available on DB/$db handles:
357 352
358 DESTROY (DB_ornull *db) 353 DESTROY (DB_ornull *db)
359 CODE: 354 CODE:
385 380
386 $dbc = $db->cursor (DB_TXN_ornull *txn = 0, U32 flags = 0) 381 $dbc = $db->cursor (DB_TXN_ornull *txn = 0, U32 flags = 0)
387 flags: READ_COMMITTED READ_UNCOMMITTED WRITECURSOR TXN_SNAPSHOT 382 flags: READ_COMMITTED READ_UNCOMMITTED WRITECURSOR TXN_SNAPSHOT
388 $seq = $db->sequence (U32 flags = 0) 383 $seq = $db->sequence (U32 flags = 0)
389 384
390=head4 Example: 385=head3 Example:
391 386
392 my $db = db_create $env; 387 my $db = db_create $env;
393 db_open $db, undef, "table", undef, BDB::BTREE, BDB::AUTO_COMMIT | BDB::CREATE | BDB::READ_UNCOMMITTED, 0600; 388 db_open $db, undef, "table", undef, BDB::BTREE, BDB::AUTO_COMMIT | BDB::CREATE | BDB::READ_UNCOMMITTED, 0600;
394 389
395 for (1..1000) { 390 for (1..1000) {
402 db_del $db, undef, "key $_" for 1..1000; 397 db_del $db, undef, "key $_" for 1..1000;
403 398
404 db_sync $db; 399 db_sync $db;
405 400
406 401
407=head3 DB_TXN/transaction methods 402=head2 DB_TXN/transaction methods
408 403
409Methods available on DB_TXN/$txn handles: 404Methods available on DB_TXN/$txn handles:
410 405
411 DESTROY (DB_TXN_ornull *txn) 406 DESTROY (DB_TXN_ornull *txn)
412 CODE: 407 CODE:
418 413
419 $bool = $txn->failed 414 $bool = $txn->failed
420 # see db_txn_finish documentation, above 415 # see db_txn_finish documentation, above
421 416
422 417
423=head3 DBC/cursor methods 418=head2 DBC/cursor methods
424 419
425Methods available on DBC/$dbc handles: 420Methods available on DBC/$dbc handles:
426 421
427 DESTROY (DBC_ornull *dbc) 422 DESTROY (DBC_ornull *dbc)
428 CODE: 423 CODE:
429 if (dbc) 424 if (dbc)
430 dbc->c_close (dbc); 425 dbc->c_close (dbc);
431 426
432 $int = $cursor->set_priority ($priority = PRIORITY_*) (v4.6) 427 $int = $cursor->set_priority ($priority = PRIORITY_*) (v4.6)
433 428
434=head4 Example: 429=head3 Example:
435 430
436 my $c = $db->cursor; 431 my $c = $db->cursor;
437 432
438 for (;;) { 433 for (;;) {
439 db_c_get $c, my $key, my $data, BDB::NEXT; 434 db_c_get $c, my $key, my $data, BDB::NEXT;
442 } 437 }
443 438
444 db_c_close $c; 439 db_c_close $c;
445 440
446 441
447=head3 DB_SEQUENCE/sequence methods 442=head2 DB_SEQUENCE/sequence methods
448 443
449Methods available on DB_SEQUENCE/$seq handles: 444Methods available on DB_SEQUENCE/$seq handles:
450 445
451 DESTROY (DB_SEQUENCE_ornull *seq) 446 DESTROY (DB_SEQUENCE_ornull *seq)
452 CODE: 447 CODE:
457 $int = $seq->set_cachesize (U32 size) 452 $int = $seq->set_cachesize (U32 size)
458 $int = $seq->set_flags (U32 flags) 453 $int = $seq->set_flags (U32 flags)
459 flags: SEQ_DEC SEQ_INC SEQ_WRAP 454 flags: SEQ_DEC SEQ_INC SEQ_WRAP
460 $int = $seq->set_range (db_seq_t min, db_seq_t max) 455 $int = $seq->set_range (db_seq_t min, db_seq_t max)
461 456
462=head4 Example: 457=head3 Example:
463 458
464 my $seq = $db->sequence; 459 my $seq = $db->sequence;
465 460
466 db_sequence_open $seq, undef, "seq", BDB::CREATE; 461 db_sequence_open $seq, undef, "seq", BDB::CREATE;
467 db_sequence_get $seq, undef, 1, my $value; 462 db_sequence_get $seq, undef, 1, my $value;
468 463
469 464
470=head2 SUPPORT FUNCTIONS 465=head1 SUPPORT FUNCTIONS
471 466
472=head3 EVENT PROCESSING AND EVENT LOOP INTEGRATION 467=head2 EVENT PROCESSING AND EVENT LOOP INTEGRATION
473 468
474=over 4 469=over 4
475 470
476=item $msg = BDB::strerror [$errno] 471=item $msg = BDB::strerror [$errno]
477 472
564 BDB::poll_wait, BDB::poll_cb 559 BDB::poll_wait, BDB::poll_cb
565 while BDB::nreqs; 560 while BDB::nreqs;
566 561
567=back 562=back
568 563
569=head3 VERSION CHECKING 564=head2 VERSION CHECKING
570 565
571BerkeleyDB comes in various versions, many of them have minor 566BerkeleyDB comes in various versions, many of them have minor
572incompatibilities. This means that traditional "at least version x.x" 567incompatibilities. This means that traditional "at least version x.x"
573checks are often not sufficient. 568checks are often not sufficient.
574 569
575Example: set the log_autoremove option in a way compatible with <v.47 and 570Example: set the log_autoremove option in a way compatible with <v4.7 and
576v4.7. Note the use of & on the constants to avoid triggering a compiletime 571v4.7. Note the use of & on the constants to avoid triggering a compiletime
577bug when the symbol isn't available. 572bug when the symbol isn't available.
578 573
579 $DB_ENV->set_flags (&BDB::LOG_AUTOREMOVE ) if BDB::VERSION v0, v4.7; 574 $DB_ENV->set_flags (&BDB::LOG_AUTOREMOVE ) if BDB::VERSION v0, v4.7;
580 $DB_ENV->log_set_config (&BDB::LOG_AUTO_REMOVE) if BDB::VERSION v4.7; 575 $DB_ENV->log_set_config (&BDB::LOG_AUTO_REMOVE) if BDB::VERSION v4.7;
613=back 608=back
614 609
615=cut 610=cut
616 611
617sub VERSION { 612sub VERSION {
613 # I was dumb enough to override the VERSION method here, so let's try
614 # to fix it up.
615
616 if ($_[0] eq __PACKAGE__) {
617 $VERSION
618 } else {
618 if (@_ > 0) { 619 if (@_ > 0) {
619 return undef if VERSION_v lt $_[0]; 620 return undef if VERSION_v lt $_[0];
620 if (@_ > 1) { 621 if (@_ > 1) {
621 return undef if VERSION_v ge $_[1]; 622 return undef if VERSION_v ge $_[1];
623 }
622 } 624 }
625
626 VERSION_v
623 } 627 }
624
625 VERSION_v
626} 628}
627 629
628=head3 CONTROLLING THE NUMBER OF THREADS 630=head2 CONTROLLING THE NUMBER OF THREADS
629 631
630=over 4 632=over 4
631 633
632=item BDB::min_parallel $nthreads 634=item BDB::min_parallel $nthreads
633 635
694 696
695You can still queue as many requests as you want. Therefore, 697You can still queue as many requests as you want. Therefore,
696C<max_oustsanding> is mainly useful in simple scripts (with low values) or 698C<max_oustsanding> is mainly useful in simple scripts (with low values) or
697as a stop gap to shield against fatal memory overflow (with large values). 699as a stop gap to shield against fatal memory overflow (with large values).
698 700
699=item BDB::set_sync_prepare $cb 701=item $old_cb = BDB::set_sync_prepare $cb
700 702
701Sets a callback that is called whenever a request is created without an 703Sets a callback that is called whenever a request is created without an
702explicit callback. It has to return two code references. The first is used 704explicit callback. It has to return two code references. The first is used
703as the request callback (it should save the return status), and the second 705as the request callback (it should save the return status), and the second
704is called to wait until the first callback has been called (it must set 706is called to wait until the first callback has been called (it must set
705C<$!> to the return status). 707C<$!> to the return status).
706 708
707This mechanism can be used to include BDB into other event mechanisms, 709This mechanism can be used to include BDB into other event mechanisms,
708such as L<AnyEvent::BDB> or L<Coro::BDB>. 710such as L<Coro::BDB>.
709 711
710The default implementation works like this: 712To allow other, callback-based, events to be executed while callback-less
713ones are run, you could use this sync prepare function:
711 714
712 sub { 715 sub {
713 my $status; 716 my $status;
714 ( 717 (
715 sub { $status = $! }, 718 sub { $status = $! },
716 sub { BDB::poll while !defined $status; $! = $status }, 719 sub { BDB::poll while !defined $status; $! = $status },
717 ) 720 )
718 } 721 }
719 722
720It simply blocks the process till the request has finished and then sets 723It works by polling for results till the request has finished and then
721C<$!> to the return value. This means that if you don't use a callback, 724sets C<$!> to the return value. This means that if you don't use a
722BDB will simply fall back to synchronous operations. 725callback, BDB would simply fall back to synchronous operations.
726
727By default, or if the sync prepare function is set to C<undef>, is to
728execute callback-less BDB requests in the foreground thread, setting C<$!>
729to the return value, without polling for other events.
723 730
724=back 731=back
725 732
726=head3 STATISTICAL INFORMATION 733=head2 STATISTICAL INFORMATION
727 734
728=over 4 735=over 4
729 736
730=item BDB::nreqs 737=item BDB::nreqs
731 738
749 756
750=back 757=back
751 758
752=cut 759=cut
753 760
754set_sync_prepare { 761set_sync_prepare (undef);
755 my $status;
756 (
757 sub {
758 $status = $!;
759 },
760 sub {
761 BDB::poll while !defined $status;
762 $! = $status;
763 },
764 )
765};
766 762
767min_parallel 8; 763min_parallel 8;
768 764
769END { flush } 765END { flush }
770 766
7711; 7671;
772 768
769=head1 COMMON PITFALLS
770
771=head2 Unexpected Crashes
772
773Remember that, by default, BDB will execute requests in parallel, in
774somewhat random order. That means that it is easy to run a C<db_get>
775request on thesa me database as a concurrent C<db_close> request, leading
776to a crash, silent data corruption, eventually the next world war on
777terrorism.
778
779If you only ever use foreground requests (without a callback), this will
780not be an issue.
781
782=head2 Unexpected Freezes or Deadlocks
783
784Remember that, by default, BDB will execute requests in parallel, which
785easily leads to deadlocks (even concurrent put's on the same database can
786deadlock).
787
788You either need to run deadlock detection (and handle the resulting
789errors), or make sure only one process ever updates the database, ine one
790thread, e.g. by using only foreground requests (without a callback).
791
773=head2 FORK BEHAVIOUR 792=head1 FORK BEHAVIOUR
774 793
775This module should do "the right thing" when the process using it forks: 794This module should do "the right thing" when the process using it forks:
776 795
777Before the fork, BDB enters a quiescent state where no requests 796Before the fork, BDB enters a quiescent state where no requests
778can be added in other threads and no results will be processed. After 797can be added in other threads and no results will be processed. After
788 807
789Win32 note: there is no fork on win32, and perls emulation of it is too 808Win32 note: there is no fork on win32, and perls emulation of it is too
790broken to be supported, so do not use BDB in a windows pseudo-fork, better 809broken to be supported, so do not use BDB in a windows pseudo-fork, better
791yet, switch to a more capable platform. 810yet, switch to a more capable platform.
792 811
793=head2 MEMORY USAGE 812=head1 MEMORY USAGE
794 813
795Per-request usage: 814Per-request usage:
796 815
797Each aio request uses - depending on your architecture - around 100-200 816Each aio request uses - depending on your architecture - around 100-200
798bytes of memory. In addition, stat requests need a stat buffer (possibly 817bytes of memory. In addition, stat requests need a stat buffer (possibly
807 826
808In the execution phase, some aio requests require more memory for 827In the execution phase, some aio requests require more memory for
809temporary buffers, and each thread requires a stack and other data 828temporary buffers, and each thread requires a stack and other data
810structures (usually around 16k-128k, depending on the OS). 829structures (usually around 16k-128k, depending on the OS).
811 830
831=head1 WIN32 FILENAMES/DATABASE NAME MESS
832
833Perl on Win32 supports only ASCII filenames (the reason is that it abuses
834an internal flag to store wether a filename is Unicode or ANSI, but that
835flag is used for somethign else in the perl core, so there is no way to
836detect wether a filename is ANSI or Unicode-encoded). The BDB module
837tries to work around this issue by assuming that the filename is an ANSI
838filename and BDB was built for unicode support.
839
812=head1 KNOWN BUGS 840=head1 KNOWN BUGS
813 841
814Known bugs will be fixed in the next release, except: 842Known bugs will be fixed in the next release, except:
815 843
816 If you use a transaction in any request, and the request returns 844 If you use a transaction in any request, and the request returns

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines