ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/BDB/BDB.pm
(Generate patch)

Comparing BDB/BDB.pm (file contents):
Revision 1.6 by root, Sun Feb 11 22:07:23 2007 UTC vs.
Revision 1.15 by root, Thu Sep 13 21:34:00 2007 UTC

6 6
7 use BDB; 7 use BDB;
8 8
9=head1 DESCRIPTION 9=head1 DESCRIPTION
10 10
11=head2 EXAMPLE 11See the BerkeleyDB documentation (L<http://www.oracle.com/technology/documentation/berkeley-db/db/index.html>).
12The BDB API is very similar to the C API (the translation has been very faithful).
13
14See also the example sections in the document below and possibly the eg/
15subdirectory of the BDB distribution. Last not least see the IO::AIO
16documentation, as that module uses almost the same asynchronous request
17model as this module.
18
19I know this is woefully inadequate documentation. Send a patch!
20
12 21
13=head1 REQUEST ANATOMY AND LIFETIME 22=head1 REQUEST ANATOMY AND LIFETIME
14 23
15Every request method creates a request. which is a C data structure not 24Every request method creates a request. which is a C data structure not
16directly visible to Perl. 25directly visible to Perl.
63use strict 'vars'; 72use strict 'vars';
64 73
65use base 'Exporter'; 74use base 'Exporter';
66 75
67BEGIN { 76BEGIN {
68 our $VERSION = '0.1'; 77 our $VERSION = '1.1';
69 78
70 our @BDB_REQ = qw( 79 our @BDB_REQ = qw(
71 db_env_open db_env_close db_env_txn_checkpoint db_env_lock_detect 80 db_env_open db_env_close db_env_txn_checkpoint db_env_lock_detect
72 db_env_memp_sync db_env_memp_trickle 81 db_env_memp_sync db_env_memp_trickle
73 db_open db_close db_compact db_sync db_put db_get db_pget db_del db_key_range 82 db_open db_close db_compact db_sync db_put db_get db_pget db_del db_key_range
74 db_txn_commit db_txn_abort 83 db_txn_commit db_txn_abort db_txn_finish
75 db_c_close db_c_count db_c_put db_c_get db_c_pget db_c_del 84 db_c_close db_c_count db_c_put db_c_get db_c_pget db_c_del
76 db_sequence_open db_sequence_close 85 db_sequence_open db_sequence_close
77 db_sequence_get db_sequence_remove 86 db_sequence_get db_sequence_remove
78 ); 87 );
79 our @EXPORT = (@BDB_REQ, qw(dbreq_pri dbreq_nice db_env_create db_create)); 88 our @EXPORT = (@BDB_REQ, qw(dbreq_pri dbreq_nice db_env_create db_create));
86 95
87 require XSLoader; 96 require XSLoader;
88 XSLoader::load ("BDB", $VERSION); 97 XSLoader::load ("BDB", $VERSION);
89} 98}
90 99
100=head2 BERKELEYDB FUNCTIONS
101
102All of these are functions. The create functions simply return a new
103object and never block. All the remaining functions all take an optional
104callback as last argument. If it is missing, then the fucntion will be
105executed synchronously.
106
107BDB functions that cannot block (mostly functions that manipulate
108settings) are method calls on the relevant objects, so the rule of thumb
109is: if its a method, its not blocking, if its a function, it takes a
110callback as last argument.
111
112In the following, C<$int> signifies an integer return value,
113C<octetstring> is a "binary string" (i.e. a perl string with no character
114indices >255), C<U32> is an unsigned 32 bit integer, C<int> is some
115integer, C<NV> is a floating point value.
116
117The C<SV *> types are generic perl scalars (for input and output of data
118values), and the C<SV *callback> is the optional callback function to call
119when the request is completed.
120
121The various C<DB_ENV> etc. arguments are handles return by
122C<db_env_create>, C<db_create>, C<txn_begin> and so on. If they have an
123appended C<_ornull> this means they are optional and you can pass C<undef>
124for them, resulting a NULL pointer on the C level.
125
126=head3 BDB functions
127
128Functions in the BDB namespace, exported by default:
129
130 $env = db_env_create (U32 env_flags = 0)
131 flags: RPCCLIENT
132
133 db_env_open (DB_ENV *env, octetstring db_home, U32 open_flags, int mode, SV *callback = &PL_sv_undef)
134 open_flags: INIT_CDB INIT_LOCK INIT_LOG INIT_MPOOL INIT_REP INIT_TXN RECOVER RECOVER_FATAL USE_ENVIRON USE_ENVIRON_ROOT CREATE LOCKDOWN PRIVATE REGISTER SYSTEM_MEM
135 db_env_close (DB_ENV *env, U32 flags = 0, SV *callback = &PL_sv_undef)
136 db_env_txn_checkpoint (DB_ENV *env, U32 kbyte = 0, U32 min = 0, U32 flags = 0, SV *callback = &PL_sv_undef)
137 flags: FORCE
138 db_env_lock_detect (DB_ENV *env, U32 flags = 0, U32 atype = DB_LOCK_DEFAULT, SV *dummy = 0, SV *callback = &PL_sv_undef)
139 atype: LOCK_DEFAULT LOCK_EXPIRE LOCK_MAXLOCKS LOCK_MAXWRITE LOCK_MINLOCKS LOCK_MINWRITE LOCK_OLDEST LOCK_RANDOM LOCK_YOUNGEST
140 db_env_memp_sync (DB_ENV *env, SV *dummy = 0, SV *callback = &PL_sv_undef)
141 db_env_memp_trickle (DB_ENV *env, int percent, SV *dummy = 0, SV *callback = &PL_sv_undef)
142
143 $db = db_create (DB_ENV *env = 0, U32 flags = 0)
144 flags: XA_CREATE
145
146 db_open (DB *db, DB_TXN_ornull *txnid, octetstring file, octetstring database, int type, U32 flags, int mode, SV *callback = &PL_sv_undef)
147 flags: AUTO_COMMIT CREATE EXCL MULTIVERSION NOMMAP RDONLY READ_UNCOMMITTED THREAD TRUNCATE
148 db_close (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef)
149 flags: DB_NOSYNC
150 db_compact (DB *db, DB_TXN_ornull *txn = 0, SV *start = 0, SV *stop = 0, SV *unused1 = 0, U32 flags = DB_FREE_SPACE, SV *unused2 = 0, SV *callback = &PL_sv_undef)
151 flags: FREELIST_ONLY FREE_SPACE
152 db_sync (DB *db, U32 flags = 0, SV *callback = &PL_sv_undef)
153 db_key_range (DB *db, DB_TXN_ornull *txn, SV *key, SV *key_range, U32 flags = 0, SV *callback = &PL_sv_undef)
154 db_put (DB *db, DB_TXN_ornull *txn, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
155 flags: APPEND NODUPDATA NOOVERWRITE
156 db_get (DB *db, DB_TXN_ornull *txn, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
157 flags: CONSUME CONSUME_WAIT GET_BOTH SET_RECNO MULTIPLE READ_COMMITTED READ_UNCOMMITTED RMW
158 db_pget (DB *db, DB_TXN_ornull *txn, SV *key, SV *pkey, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
159 flags: CONSUME CONSUME_WAIT GET_BOTH SET_RECNO MULTIPLE READ_COMMITTED READ_UNCOMMITTED RMW
160 db_del (DB *db, DB_TXN_ornull *txn, SV *key, U32 flags = 0, SV *callback = &PL_sv_undef)
161 db_txn_commit (DB_TXN *txn, U32 flags = 0, SV *callback = &PL_sv_undef)
162 flags: TXN_NOSYNC TXN_SYNC
163 db_txn_abort (DB_TXN *txn, SV *callback = &PL_sv_undef)
164
165 db_c_close (DBC *dbc, SV *callback = &PL_sv_undef)
166 db_c_count (DBC *dbc, SV *count, U32 flags = 0, SV *callback = &PL_sv_undef)
167 db_c_put (DBC *dbc, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
168 flags: AFTER BEFORE CURRENT KEYFIRST KEYLAST NODUPDATA
169 db_c_get (DBC *dbc, SV *key, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
170 flags: CURRENT FIRST GET_BOTH GET_BOTH_RANGE GET_RECNO JOIN_ITEM LAST NEXT NEXT_DUP NEXT_NODUP PREV PREV_DUP PREV_NODUP SET SET_RANGE SET_RECNO READ_UNCOMMITTED MULTIPLE MULTIPLE_KEY RMW
171 db_c_pget (DBC *dbc, SV *key, SV *pkey, SV *data, U32 flags = 0, SV *callback = &PL_sv_undef)
172 db_c_del (DBC *dbc, U32 flags = 0, SV *callback = &PL_sv_undef)
173
174 db_sequence_open (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, SV *key, U32 flags = 0, SV *callback = &PL_sv_undef)
175 flags: CREATE EXCL
176 db_sequence_close (DB_SEQUENCE *seq, U32 flags = 0, SV *callback = &PL_sv_undef)
177 db_sequence_get (DB_SEQUENCE *seq, DB_TXN_ornull *txnid, int delta, SV *seq_value, U32 flags = DB_TXN_NOSYNC, SV *callback = &PL_sv_undef)
178 flags: TXN_NOSYNC
179 db_sequence_remove (DB_SEQUENCE *seq, DB_TXN_ornull *txnid = 0, U32 flags = 0, SV *callback = &PL_sv_undef)
180 flags: TXN_NOSYNC
181
182=head4 db_txn_finish (DB_TXN *txn, U32 flags = 0, SV *callback = &PL_sv_undef)
183
184This is not a Berkeley DB function but a BDB module extension. It is very
185annoying to have to check every single BDB function for error returns and
186provide a codepath out of your transaction. While the BDB module still
187makes this possible, it contains the following extensions:
188
189When a transaction-protected function returns any operating system
190error (errno > 0), BDB will set the C<TXN_DEADLOCK> flag on the
191transaction. This flag is also set by Berkeley DB functions externally
192when an operation fails with LOCK_DEADLOCK, and it causes all further
193operations on that transaction (including C<db_txn_commit>) to fail.
194
195The C<db_txn_finish> request will look at this flag, and, if it is set,
196will automatically call C<db_txn_abort> (setting errno to C<LOCK_DEADLOCK>
197if it isn't set). If it isn't set, it will call C<db_txn_commit> and
198return the error normally.
199
200How to use this? Easy: just write your transaction normally:
201
202 my $txn = $db_env->txn_begin;
203 db_get $db, $txn, "key", my $data;
204 db_put $db, $txn, "key", $data + 1 unless $! == BDB::NOTFOUND;
205 db_txn_finish $txn;
206 die "transaction failed" if $!;
207
208That is, handle only the expected errors. If something unexpected happens
209(EIO, LOCK_NOTGRANTED or a deadlock in either db_get or db_put), then the remaining
210requests (db_put in this case) will simply be skipped (they will fail with
211LOCK_DEADLOCK) and the transaction will be aborted.
212
213You cna use the C<< $txn->failed >> method to check wether a transaction
214has failed in this way and abort further processing (excluding
215C<db_txn_finish>).
216
217=head3 DB_ENV/database environment methods
218
219Methods available on DB_ENV/$env handles:
220
221 DESTROY (DB_ENV_ornull *env)
222 CODE:
223 if (env)
224 env->close (env, 0);
225
226 $int = $env->set_data_dir (const char *dir)
227 $int = $env->set_tmp_dir (const char *dir)
228 $int = $env->set_lg_dir (const char *dir)
229 $int = $env->set_shm_key (long shm_key)
230 $int = $env->set_cachesize (U32 gbytes, U32 bytes, int ncache = 0)
231 $int = $env->set_flags (U32 flags, int onoff)
232 $env->set_errfile (FILE *errfile = 0)
233 $env->set_msgfile (FILE *msgfile = 0)
234 $int = $env->set_verbose (U32 which, int onoff = 1)
235 $int = $env->set_encrypt (const char *password, U32 flags = 0)
236 $int = $env->set_timeout (NV timeout_seconds, U32 flags = SET_TXN_TIMEOUT)
237 $int = $env->set_mp_max_openfd (int maxopenfd);
238 $int = $env->set_mp_max_write (int maxwrite, int maxwrite_sleep);
239 $int = $env->set_mp_mmapsize (int mmapsize_mb)
240 $int = $env->set_lk_detect (U32 detect = DB_LOCK_DEFAULT)
241 $int = $env->set_lk_max_lockers (U32 max)
242 $int = $env->set_lk_max_locks (U32 max)
243 $int = $env->set_lk_max_objects (U32 max)
244 $int = $env->set_lg_bsize (U32 max)
245 $int = $env->set_lg_max (U32 max)
246
247 $txn = $env->txn_begin (DB_TXN_ornull *parent = 0, U32 flags = 0)
248 flags: READ_COMMITTED READ_UNCOMMITTED TXN_NOSYNC TXN_NOWAIT TXN_SNAPSHOT TXN_SYNC TXN_WAIT TXN_WRITE_NOSYNC
249
250=head4 Example:
251
252 use AnyEvent;
253 use BDB;
254
255 our $FH; open $FH, "<&=" . BDB::poll_fileno;
256 our $WATCHER = AnyEvent->io (fh => $FH, poll => 'r', cb => \&BDB::poll_cb);
257
258 BDB::min_parallel 8;
259
260 my $env = db_env_create;
261
262 mkdir "bdtest", 0700;
263 db_env_open
264 $env,
265 "bdtest",
266 BDB::INIT_LOCK | BDB::INIT_LOG | BDB::INIT_MPOOL | BDB::INIT_TXN | BDB::RECOVER | BDB::USE_ENVIRON | BDB::CREATE,
267 0600;
268
269 $env->set_flags (BDB::AUTO_COMMIT | BDB::TXN_NOSYNC, 1);
270
271
272=head3 DB/database methods
273
274Methods available on DB/$db handles:
275
276 DESTROY (DB_ornull *db)
277 CODE:
278 if (db)
279 {
280 SV *env = (SV *)db->app_private;
281 db->close (db, 0);
282 SvREFCNT_dec (env);
283 }
284
285 $int = $db->set_cachesize (U32 gbytes, U32 bytes, int ncache = 0)
286 $int = $db->set_flags (U32 flags)
287 flags: CHKSUM ENCRYPT TXN_NOT_DURABLE
288 Btree: DUP DUPSORT RECNUM REVSPLITOFF
289 Hash: DUP DUPSORT
290 Queue: INORDER
291 Recno: RENUMBER SNAPSHOT
292
293 $int = $db->set_encrypt (const char *password, U32 flags)
294 $int = $db->set_lorder (int lorder)
295 $int = $db->set_bt_minkey (U32 minkey)
296 $int = $db->set_re_delim (int delim)
297 $int = $db->set_re_pad (int re_pad)
298 $int = $db->set_re_source (char *source)
299 $int = $db->set_re_len (U32 re_len)
300 $int = $db->set_h_ffactor (U32 h_ffactor)
301 $int = $db->set_h_nelem (U32 h_nelem)
302 $int = $db->set_q_extentsize (U32 extentsize)
303
304 $dbc = $db->cursor (DB_TXN_ornull *txn = 0, U32 flags = 0)
305 flags: READ_COMMITTED READ_UNCOMMITTED WRITECURSOR TXN_SNAPSHOT
306 $seq = $db->sequence (U32 flags = 0)
307
308=head4 Example:
309
310 my $db = db_create $env;
311 db_open $db, undef, "table", undef, BDB::BTREE, BDB::AUTO_COMMIT | BDB::CREATE | BDB::READ_UNCOMMITTED, 0600;
312
313 for (1..1000) {
314 db_put $db, undef, "key $_", "data $_";
315
316 db_key_range $db, undef, "key $_", my $keyrange;
317 my ($lt, $eq, $gt) = @$keyrange;
318 }
319
320 db_del $db, undef, "key $_" for 1..1000;
321
322 db_sync $db;
323
324
325=head3 DB_TXN/transaction methods
326
327Methods available on DB_TXN/$txn handles:
328
329 DESTROY (DB_TXN_ornull *txn)
330 CODE:
331 if (txn)
332 txn->abort (txn);
333
334 $int = $txn->set_timeout (NV timeout_seconds, U32 flags = SET_TXN_TIMEOUT)
335 flags: SET_LOCK_TIMEOUT SET_TXN_TIMEOUT
336
337 $bool = $txn->failed
338 # see db_txn_finish documentation, above
339
340
341=head3 DBC/cursor methods
342
343Methods available on DBC/$dbc handles:
344
345 DESTROY (DBC_ornull *dbc)
346 CODE:
347 if (dbc)
348 dbc->c_close (dbc);
349
350=head4 Example:
351
352 my $c = $db->cursor;
353
354 for (;;) {
355 db_c_get $c, my $key, my $data, BDB::NEXT;
356 warn "<$!,$key,$data>";
357 last if $!;
358 }
359
360 db_c_close $c;
361
362
363=head3 DB_SEQUENCE/sequence methods
364
365Methods available on DB_SEQUENCE/$seq handles:
366
367 DESTROY (DB_SEQUENCE_ornull *seq)
368 CODE:
369 if (seq)
370 seq->close (seq, 0);
371
372 $int = $seq->initial_value (db_seq_t value)
373 $int = $seq->set_cachesize (U32 size)
374 $int = $seq->set_flags (U32 flags)
375 flags: SEQ_DEC SEQ_INC SEQ_WRAP
376 $int = $seq->set_range (db_seq_t min, db_seq_t max)
377
378=head4 Example:
379
380 my $seq = $db->sequence;
381
382 db_sequence_open $seq, undef, "seq", BDB::CREATE;
383 db_sequence_get $seq, undef, 1, my $value;
384
385
91=head2 SUPPORT FUNCTIONS 386=head2 SUPPORT FUNCTIONS
92 387
93=head3 EVENT PROCESSING AND EVENT LOOP INTEGRATION 388=head3 EVENT PROCESSING AND EVENT LOOP INTEGRATION
94 389
95=over 4 390=over 4
178Strictly equivalent to: 473Strictly equivalent to:
179 474
180 BDB::poll_wait, BDB::poll_cb 475 BDB::poll_wait, BDB::poll_cb
181 while BDB::nreqs; 476 while BDB::nreqs;
182 477
478=back
479
183=head3 CONTROLLING THE NUMBER OF THREADS 480=head3 CONTROLLING THE NUMBER OF THREADS
481
482=over 4
184 483
185=item BDB::min_parallel $nthreads 484=item BDB::min_parallel $nthreads
186 485
187Set the minimum number of AIO threads to C<$nthreads>. The current 486Set the minimum number of AIO threads to C<$nthreads>. The current
188default is C<8>, which means eight asynchronous operations can execute 487default is C<8>, which means eight asynchronous operations can execute
337bytes of memory. In addition, stat requests need a stat buffer (possibly 636bytes of memory. In addition, stat requests need a stat buffer (possibly
338a few hundred bytes), readdir requires a result buffer and so on. Perl 637a few hundred bytes), readdir requires a result buffer and so on. Perl
339scalars and other data passed into aio requests will also be locked and 638scalars and other data passed into aio requests will also be locked and
340will consume memory till the request has entered the done state. 639will consume memory till the request has entered the done state.
341 640
342This is now awfully much, so queuing lots of requests is not usually a 641This is not awfully much, so queuing lots of requests is not usually a
343problem. 642problem.
344 643
345Per-thread usage: 644Per-thread usage:
346 645
347In the execution phase, some aio requests require more memory for 646In the execution phase, some aio requests require more memory for
348temporary buffers, and each thread requires a stack and other data 647temporary buffers, and each thread requires a stack and other data
349structures (usually around 16k-128k, depending on the OS). 648structures (usually around 16k-128k, depending on the OS).
350 649
351=head1 KNOWN BUGS 650=head1 KNOWN BUGS
352 651
353Known bugs will be fixed in the next release. 652Known bugs will be fixed in the next release, except:
653
654 If you use a transaction in any request, and the request returns
655 with an operating system error or DB_LOCK_NOTGRANTED, the internal
656 TXN_DEADLOCK flag will be set on the transaction. See C<db_txn_finish>,
657 above.
354 658
355=head1 SEE ALSO 659=head1 SEE ALSO
356 660
357L<Coro::AIO>. 661L<Coro::AIO>.
358 662

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines