ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/IO-AIO/AIO.pm
(Generate patch)

Comparing IO-AIO/AIO.pm (file contents):
Revision 1.101 by root, Sun Jan 7 22:59:57 2007 UTC vs.
Revision 1.116 by root, Wed Oct 3 21:27:51 2007 UTC

62etc.), but can also be used to easily do operations in parallel that are 62etc.), but can also be used to easily do operations in parallel that are
63normally done sequentially, e.g. stat'ing many files, which is much faster 63normally done sequentially, e.g. stat'ing many files, which is much faster
64on a RAID volume or over NFS when you do a number of stat operations 64on a RAID volume or over NFS when you do a number of stat operations
65concurrently. 65concurrently.
66 66
67While most of this works on all types of file descriptors (for example 67While most of this works on all types of file descriptors (for
68sockets), using these functions on file descriptors that support 68example sockets), using these functions on file descriptors that
69nonblocking operation (again, sockets, pipes etc.) is very inefficient or 69support nonblocking operation (again, sockets, pipes etc.) is very
70might not work (aio_read fails on sockets/pipes/fifos). Use an event loop 70inefficient. Use an event loop for that (such as the L<Event|Event>
71for that (such as the L<Event|Event> module): IO::AIO will naturally fit 71module): IO::AIO will naturally fit into such an event loop itself.
72into such an event loop itself.
73 72
74In this version, a number of threads are started that execute your 73In this version, a number of threads are started that execute your
75requests and signal their completion. You don't need thread support 74requests and signal their completion. You don't need thread support
76in perl, and the threads created by this module will not be visible 75in perl, and the threads created by this module will not be visible
77to perl. In the future, this module might make use of the native aio 76to perl. In the future, this module might make use of the native aio
79not well-supported or restricted (GNU/Linux doesn't allow them on normal 78not well-supported or restricted (GNU/Linux doesn't allow them on normal
80files currently, for example), and they would only support aio_read and 79files currently, for example), and they would only support aio_read and
81aio_write, so the remaining functionality would have to be implemented 80aio_write, so the remaining functionality would have to be implemented
82using threads anyway. 81using threads anyway.
83 82
84Although the module will work with in the presence of other (Perl-) 83Although the module will work in the presence of other (Perl-) threads,
85threads, it is currently not reentrant in any way, so use appropriate 84it is currently not reentrant in any way, so use appropriate locking
86locking yourself, always call C<poll_cb> from within the same thread, or 85yourself, always call C<poll_cb> from within the same thread, or never
87never call C<poll_cb> (or other C<aio_> functions) recursively. 86call C<poll_cb> (or other C<aio_> functions) recursively.
88 87
89=head2 EXAMPLE 88=head2 EXAMPLE
90 89
91This is a simple example that uses the Event module and loads 90This is a simple example that uses the Event module and loads
92F</etc/passwd> asynchronously: 91F</etc/passwd> asynchronously:
190use strict 'vars'; 189use strict 'vars';
191 190
192use base 'Exporter'; 191use base 'Exporter';
193 192
194BEGIN { 193BEGIN {
195 our $VERSION = '2.32'; 194 our $VERSION = '2.5';
196 195
197 our @AIO_REQ = qw(aio_sendfile aio_read aio_write aio_open aio_close aio_stat 196 our @AIO_REQ = qw(aio_sendfile aio_read aio_write aio_open aio_close aio_stat
198 aio_lstat aio_unlink aio_rmdir aio_readdir aio_scandir aio_symlink 197 aio_lstat aio_unlink aio_rmdir aio_readdir aio_scandir aio_symlink
199 aio_readlink aio_fsync aio_fdatasync aio_readahead aio_rename aio_link 198 aio_readlink aio_fsync aio_fdatasync aio_readahead aio_rename aio_link
200 aio_move aio_copy aio_group aio_nop aio_mknod aio_load aio_rmtree aio_mkdir); 199 aio_move aio_copy aio_group aio_nop aio_mknod aio_load aio_rmtree aio_mkdir
200 aio_chown aio_chmod aio_utime aio_truncate);
201 our @EXPORT = (@AIO_REQ, qw(aioreq_pri aioreq_nice aio_block)); 201 our @EXPORT = (@AIO_REQ, qw(aioreq_pri aioreq_nice aio_block));
202 our @EXPORT_OK = qw(poll_fileno poll_cb poll_wait flush 202 our @EXPORT_OK = qw(poll_fileno poll_cb poll_wait flush
203 min_parallel max_parallel max_idle 203 min_parallel max_parallel max_idle
204 nreqs nready npending nthreads 204 nreqs nready npending nthreads
205 max_poll_time max_poll_reqs); 205 max_poll_time max_poll_reqs);
271 aio_read $_[0], ..., sub { 271 aio_read $_[0], ..., sub {
272 ... 272 ...
273 }; 273 };
274 }; 274 };
275 275
276
276=item aioreq_nice $pri_adjust 277=item aioreq_nice $pri_adjust
277 278
278Similar to C<aioreq_pri>, but subtracts the given value from the current 279Similar to C<aioreq_pri>, but subtracts the given value from the current
279priority, so the effect is cumulative. 280priority, so the effect is cumulative.
281
280 282
281=item aio_open $pathname, $flags, $mode, $callback->($fh) 283=item aio_open $pathname, $flags, $mode, $callback->($fh)
282 284
283Asynchronously open or create a file and call the callback with a newly 285Asynchronously open or create a file and call the callback with a newly
284created filehandle for the file. 286created filehandle for the file.
305 } else { 307 } else {
306 die "open failed: $!\n"; 308 die "open failed: $!\n";
307 } 309 }
308 }; 310 };
309 311
312
310=item aio_close $fh, $callback->($status) 313=item aio_close $fh, $callback->($status)
311 314
312Asynchronously close a file and call the callback with the result 315Asynchronously close a file and call the callback with the result
313code. I<WARNING:> although accepted, you should not pass in a perl 316code.
314filehandle here, as perl will likely close the file descriptor another
315time when the filehandle is destroyed. Normally, you can safely call perls
316C<close> or just let filehandles go out of scope.
317 317
318This is supposed to be a bug in the API, so that might change. It's 318Unlike the other functions operating on files, this function uses the
319therefore best to avoid this function. 319PerlIO layer to close the filehandle. The reason is that the PerlIO API
320insists on closing the underlying fd itself, no matter what, and doesn't
321allow modifications to the fd. Unfortunately, it is not clear that you can
322call PerlIO from different threads (actually, its quite clear that this
323won't work in some cases), so while it likely works perfectly with simple
324file handles (such as the ones created by C<aio_open>) it might fail in
325interesting ways for others.
326
327Having said that, aio_close tries to clean up the filehandle as much as
328possible before handing it to an io thread, and generally does work.
329
320 330
321=item aio_read $fh,$offset,$length, $data,$dataoffset, $callback->($retval) 331=item aio_read $fh,$offset,$length, $data,$dataoffset, $callback->($retval)
322 332
323=item aio_write $fh,$offset,$length, $data,$dataoffset, $callback->($retval) 333=item aio_write $fh,$offset,$length, $data,$dataoffset, $callback->($retval)
324 334
325Reads or writes C<length> bytes from the specified C<fh> and C<offset> 335Reads or writes C<$length> bytes from the specified C<$fh> and C<$offset>
326into the scalar given by C<data> and offset C<dataoffset> and calls the 336into the scalar given by C<$data> and offset C<$dataoffset> and calls the
327callback without the actual number of bytes read (or -1 on error, just 337callback without the actual number of bytes read (or -1 on error, just
328like the syscall). 338like the syscall).
329 339
340If C<$offset> is undefined, then the current file descriptor offset will
341be used (and updated), otherwise the file descriptor offset will not be
342changed by these calls.
343
344If C<$length> is undefined in C<aio_write>, use the remaining length of C<$data>.
345
346If C<$dataoffset> is less than zero, it will be counted from the end of
347C<$data>.
348
330The C<$data> scalar I<MUST NOT> be modified in any way while the request 349The C<$data> scalar I<MUST NOT> be modified in any way while the request
331is outstanding. Modifying it can result in segfaults or WW3 (if the 350is outstanding. Modifying it can result in segfaults or World War III (if
332necessary/optional hardware is installed). 351the necessary/optional hardware is installed).
333 352
334Example: Read 15 bytes at offset 7 into scalar C<$buffer>, starting at 353Example: Read 15 bytes at offset 7 into scalar C<$buffer>, starting at
335offset C<0> within the scalar: 354offset C<0> within the scalar:
336 355
337 aio_read $fh, 7, 15, $buffer, 0, sub { 356 aio_read $fh, 7, 15, $buffer, 0, sub {
338 $_[0] > 0 or die "read error: $!"; 357 $_[0] > 0 or die "read error: $!";
339 print "read $_[0] bytes: <$buffer>\n"; 358 print "read $_[0] bytes: <$buffer>\n";
340 }; 359 };
360
341 361
342=item aio_sendfile $out_fh, $in_fh, $in_offset, $length, $callback->($retval) 362=item aio_sendfile $out_fh, $in_fh, $in_offset, $length, $callback->($retval)
343 363
344Tries to copy C<$length> bytes from C<$in_fh> to C<$out_fh>. It starts 364Tries to copy C<$length> bytes from C<$in_fh> to C<$out_fh>. It starts
345reading at byte offset C<$in_offset>, and starts writing at the current 365reading at byte offset C<$in_offset>, and starts writing at the current
359C<$in_fh> than are written, and there is no way to find out how many 379C<$in_fh> than are written, and there is no way to find out how many
360bytes have been read from C<aio_sendfile> alone, as C<aio_sendfile> only 380bytes have been read from C<aio_sendfile> alone, as C<aio_sendfile> only
361provides the number of bytes written to C<$out_fh>. Only if the result 381provides the number of bytes written to C<$out_fh>. Only if the result
362value equals C<$length> one can assume that C<$length> bytes have been 382value equals C<$length> one can assume that C<$length> bytes have been
363read. 383read.
384
364 385
365=item aio_readahead $fh,$offset,$length, $callback->($retval) 386=item aio_readahead $fh,$offset,$length, $callback->($retval)
366 387
367C<aio_readahead> populates the page cache with data from a file so that 388C<aio_readahead> populates the page cache with data from a file so that
368subsequent reads from that file will not block on disk I/O. The C<$offset> 389subsequent reads from that file will not block on disk I/O. The C<$offset>
374file. The current file offset of the file is left unchanged. 395file. The current file offset of the file is left unchanged.
375 396
376If that syscall doesn't exist (likely if your OS isn't Linux) it will be 397If that syscall doesn't exist (likely if your OS isn't Linux) it will be
377emulated by simply reading the data, which would have a similar effect. 398emulated by simply reading the data, which would have a similar effect.
378 399
400
379=item aio_stat $fh_or_path, $callback->($status) 401=item aio_stat $fh_or_path, $callback->($status)
380 402
381=item aio_lstat $fh, $callback->($status) 403=item aio_lstat $fh, $callback->($status)
382 404
383Works like perl's C<stat> or C<lstat> in void context. The callback will 405Works like perl's C<stat> or C<lstat> in void context. The callback will
396 aio_stat "/etc/passwd", sub { 418 aio_stat "/etc/passwd", sub {
397 $_[0] and die "stat failed: $!"; 419 $_[0] and die "stat failed: $!";
398 print "size is ", -s _, "\n"; 420 print "size is ", -s _, "\n";
399 }; 421 };
400 422
423
424=item aio_utime $fh_or_path, $atime, $mtime, $callback->($status)
425
426Works like perl's C<utime> function (including the special case of $atime
427and $mtime being undef). Fractional times are supported if the underlying
428syscalls support them.
429
430When called with a pathname, uses utimes(2) if available, otherwise
431utime(2). If called on a file descriptor, uses futimes(2) if available,
432otherwise returns ENOSYS, so this is not portable.
433
434Examples:
435
436 # set atime and mtime to current time (basically touch(1)):
437 aio_utime "path", undef, undef;
438 # set atime to current time and mtime to beginning of the epoch:
439 aio_utime "path", time, undef; # undef==0
440
441
442=item aio_chown $fh_or_path, $uid, $gid, $callback->($status)
443
444Works like perl's C<chown> function, except that C<undef> for either $uid
445or $gid is being interpreted as "do not change" (but -1 can also be used).
446
447Examples:
448
449 # same as "chown root path" in the shell:
450 aio_chown "path", 0, -1;
451 # same as above:
452 aio_chown "path", 0, undef;
453
454
455=item aio_truncate $fh_or_path, $offset, $callback->($status)
456
457Works like truncate(2) or ftruncate(2).
458
459
460=item aio_chmod $fh_or_path, $mode, $callback->($status)
461
462Works like perl's C<chmod> function.
463
464
401=item aio_unlink $pathname, $callback->($status) 465=item aio_unlink $pathname, $callback->($status)
402 466
403Asynchronously unlink (delete) a file and call the callback with the 467Asynchronously unlink (delete) a file and call the callback with the
404result code. 468result code.
405 469
470
406=item aio_mknod $path, $mode, $dev, $callback->($status) 471=item aio_mknod $path, $mode, $dev, $callback->($status)
407 472
408[EXPERIMENTAL] 473[EXPERIMENTAL]
409 474
410Asynchronously create a device node (or fifo). See mknod(2). 475Asynchronously create a device node (or fifo). See mknod(2).
411 476
412The only (POSIX-) portable way of calling this function is: 477The only (POSIX-) portable way of calling this function is:
413 478
414 aio_mknod $path, IO::AIO::S_IFIFO | $mode, 0, sub { ... 479 aio_mknod $path, IO::AIO::S_IFIFO | $mode, 0, sub { ...
480
415 481
416=item aio_link $srcpath, $dstpath, $callback->($status) 482=item aio_link $srcpath, $dstpath, $callback->($status)
417 483
418Asynchronously create a new link to the existing object at C<$srcpath> at 484Asynchronously create a new link to the existing object at C<$srcpath> at
419the path C<$dstpath> and call the callback with the result code. 485the path C<$dstpath> and call the callback with the result code.
420 486
487
421=item aio_symlink $srcpath, $dstpath, $callback->($status) 488=item aio_symlink $srcpath, $dstpath, $callback->($status)
422 489
423Asynchronously create a new symbolic link to the existing object at C<$srcpath> at 490Asynchronously create a new symbolic link to the existing object at C<$srcpath> at
424the path C<$dstpath> and call the callback with the result code. 491the path C<$dstpath> and call the callback with the result code.
492
425 493
426=item aio_readlink $path, $callback->($link) 494=item aio_readlink $path, $callback->($link)
427 495
428Asynchronously read the symlink specified by C<$path> and pass it to 496Asynchronously read the symlink specified by C<$path> and pass it to
429the callback. If an error occurs, nothing or undef gets passed to the 497the callback. If an error occurs, nothing or undef gets passed to the
430callback. 498callback.
431 499
500
432=item aio_rename $srcpath, $dstpath, $callback->($status) 501=item aio_rename $srcpath, $dstpath, $callback->($status)
433 502
434Asynchronously rename the object at C<$srcpath> to C<$dstpath>, just as 503Asynchronously rename the object at C<$srcpath> to C<$dstpath>, just as
435rename(2) and call the callback with the result code. 504rename(2) and call the callback with the result code.
505
436 506
437=item aio_mkdir $pathname, $mode, $callback->($status) 507=item aio_mkdir $pathname, $mode, $callback->($status)
438 508
439Asynchronously mkdir (create) a directory and call the callback with 509Asynchronously mkdir (create) a directory and call the callback with
440the result code. C<$mode> will be modified by the umask at the time the 510the result code. C<$mode> will be modified by the umask at the time the
441request is executed, so do not change your umask. 511request is executed, so do not change your umask.
442 512
513
443=item aio_rmdir $pathname, $callback->($status) 514=item aio_rmdir $pathname, $callback->($status)
444 515
445Asynchronously rmdir (delete) a directory and call the callback with the 516Asynchronously rmdir (delete) a directory and call the callback with the
446result code. 517result code.
518
447 519
448=item aio_readdir $pathname, $callback->($entries) 520=item aio_readdir $pathname, $callback->($entries)
449 521
450Unlike the POSIX call of the same name, C<aio_readdir> reads an entire 522Unlike the POSIX call of the same name, C<aio_readdir> reads an entire
451directory (i.e. opendir + readdir + closedir). The entries will not be 523directory (i.e. opendir + readdir + closedir). The entries will not be
452sorted, and will B<NOT> include the C<.> and C<..> entries. 524sorted, and will B<NOT> include the C<.> and C<..> entries.
453 525
454The callback a single argument which is either C<undef> or an array-ref 526The callback a single argument which is either C<undef> or an array-ref
455with the filenames. 527with the filenames.
528
456 529
457=item aio_load $path, $data, $callback->($status) 530=item aio_load $path, $data, $callback->($status)
458 531
459This is a composite request that tries to fully load the given file into 532This is a composite request that tries to fully load the given file into
460memory. Status is the same as with aio_read. 533memory. Status is the same as with aio_read.
469 my $pri = aioreq_pri; 542 my $pri = aioreq_pri;
470 my $grp = aio_group $cb; 543 my $grp = aio_group $cb;
471 544
472 aioreq_pri $pri; 545 aioreq_pri $pri;
473 add $grp aio_open $path, O_RDONLY, 0, sub { 546 add $grp aio_open $path, O_RDONLY, 0, sub {
474 my ($fh) = @_ 547 my $fh = shift
475 or return $grp->result (-1); 548 or return $grp->result (-1);
476 549
477 aioreq_pri $pri; 550 aioreq_pri $pri;
478 add $grp aio_read $fh, 0, (-s $fh), $$data, 0, sub { 551 add $grp aio_read $fh, 0, (-s $fh), $$data, 0, sub {
479 $grp->result ($_[0]); 552 $grp->result ($_[0]);
1078Strictly equivalent to: 1151Strictly equivalent to:
1079 1152
1080 IO::AIO::poll_wait, IO::AIO::poll_cb 1153 IO::AIO::poll_wait, IO::AIO::poll_cb
1081 while IO::AIO::nreqs; 1154 while IO::AIO::nreqs;
1082 1155
1156=back
1157
1083=head3 CONTROLLING THE NUMBER OF THREADS 1158=head3 CONTROLLING THE NUMBER OF THREADS
1159
1160=over
1084 1161
1085=item IO::AIO::min_parallel $nthreads 1162=item IO::AIO::min_parallel $nthreads
1086 1163
1087Set the minimum number of AIO threads to C<$nthreads>. The current 1164Set the minimum number of AIO threads to C<$nthreads>. The current
1088default is C<8>, which means eight asynchronous operations can execute 1165default is C<8>, which means eight asynchronous operations can execute
1136This is a very bad function to use in interactive programs because it 1213This is a very bad function to use in interactive programs because it
1137blocks, and a bad way to reduce concurrency because it is inexact: Better 1214blocks, and a bad way to reduce concurrency because it is inexact: Better
1138use an C<aio_group> together with a feed callback. 1215use an C<aio_group> together with a feed callback.
1139 1216
1140Sets the maximum number of outstanding requests to C<$nreqs>. If you 1217Sets the maximum number of outstanding requests to C<$nreqs>. If you
1141to queue up more than this number of requests, the next call to the 1218do queue up more than this number of requests, the next call to the
1142C<poll_cb> (and C<poll_some> and other functions calling C<poll_cb>) 1219C<poll_cb> (and C<poll_some> and other functions calling C<poll_cb>)
1143function will block until the limit is no longer exceeded. 1220function will block until the limit is no longer exceeded.
1144 1221
1145The default value is very large, so there is no practical limit on the 1222The default value is very large, so there is no practical limit on the
1146number of outstanding requests. 1223number of outstanding requests.
1147 1224
1148You can still queue as many requests as you want. Therefore, 1225You can still queue as many requests as you want. Therefore,
1149C<max_oustsanding> is mainly useful in simple scripts (with low values) or 1226C<max_oustsanding> is mainly useful in simple scripts (with low values) or
1150as a stop gap to shield against fatal memory overflow (with large values). 1227as a stop gap to shield against fatal memory overflow (with large values).
1151 1228
1229=back
1230
1152=head3 STATISTICAL INFORMATION 1231=head3 STATISTICAL INFORMATION
1232
1233=over
1153 1234
1154=item IO::AIO::nreqs 1235=item IO::AIO::nreqs
1155 1236
1156Returns the number of requests currently in the ready, execute or pending 1237Returns the number of requests currently in the ready, execute or pending
1157states (i.e. for which their callback has not been invoked yet). 1238states (i.e. for which their callback has not been invoked yet).
1172but not yet processed by poll_cb). 1253but not yet processed by poll_cb).
1173 1254
1174=back 1255=back
1175 1256
1176=cut 1257=cut
1177
1178# support function to convert a fd into a perl filehandle
1179sub _fd2fh {
1180 return undef if $_[0] < 0;
1181
1182 # try to generate nice filehandles
1183 my $sym = "IO::AIO::fd#$_[0]";
1184 local *$sym;
1185
1186 open *$sym, "+<&=$_[0]" # usually works under any unix
1187 or open *$sym, "<&=$_[0]" # cygwin needs this
1188 or open *$sym, ">&=$_[0]" # or this
1189 or return undef;
1190
1191 *$sym
1192}
1193 1258
1194min_parallel 8; 1259min_parallel 8;
1195 1260
1196END { flush } 1261END { flush }
1197 1262
1221bytes of memory. In addition, stat requests need a stat buffer (possibly 1286bytes of memory. In addition, stat requests need a stat buffer (possibly
1222a few hundred bytes), readdir requires a result buffer and so on. Perl 1287a few hundred bytes), readdir requires a result buffer and so on. Perl
1223scalars and other data passed into aio requests will also be locked and 1288scalars and other data passed into aio requests will also be locked and
1224will consume memory till the request has entered the done state. 1289will consume memory till the request has entered the done state.
1225 1290
1226This is now awfully much, so queuing lots of requests is not usually a 1291This is not awfully much, so queuing lots of requests is not usually a
1227problem. 1292problem.
1228 1293
1229Per-thread usage: 1294Per-thread usage:
1230 1295
1231In the execution phase, some aio requests require more memory for 1296In the execution phase, some aio requests require more memory for

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines