ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/IO-AIO/AIO.pm
(Generate patch)

Comparing IO-AIO/AIO.pm (file contents):
Revision 1.103 by root, Tue Jan 23 22:57:34 2007 UTC vs.
Revision 1.119 by root, Sun Dec 2 20:54:33 2007 UTC

26 $req->cancel; # cancel request if still in queue 26 $req->cancel; # cancel request if still in queue
27 27
28 my $grp = aio_group sub { print "all stats done\n" }; 28 my $grp = aio_group sub { print "all stats done\n" };
29 add $grp aio_stat "..." for ...; 29 add $grp aio_stat "..." for ...;
30 30
31 # AnyEvent integration 31 # AnyEvent integration (EV, Event, Glib, Tk, urxvt, pureperl...)
32 open my $fh, "<&=" . IO::AIO::poll_fileno or die "$!"; 32 open my $fh, "<&=" . IO::AIO::poll_fileno or die "$!";
33 my $w = AnyEvent->io (fh => $fh, poll => 'r', cb => sub { IO::AIO::poll_cb }); 33 my $w = AnyEvent->io (fh => $fh, poll => 'r', cb => sub { IO::AIO::poll_cb });
34
35 # EV integration
36 my $w = EV::io IO::AIO::poll_fileno, EV::READ, \&IO::AIO::poll_cb;
34 37
35 # Event integration 38 # Event integration
36 Event->io (fd => IO::AIO::poll_fileno, 39 Event->io (fd => IO::AIO::poll_fileno,
37 poll => 'r', 40 poll => 'r',
38 cb => \&IO::AIO::poll_cb); 41 cb => \&IO::AIO::poll_cb);
62etc.), but can also be used to easily do operations in parallel that are 65etc.), but can also be used to easily do operations in parallel that are
63normally done sequentially, e.g. stat'ing many files, which is much faster 66normally done sequentially, e.g. stat'ing many files, which is much faster
64on a RAID volume or over NFS when you do a number of stat operations 67on a RAID volume or over NFS when you do a number of stat operations
65concurrently. 68concurrently.
66 69
67While most of this works on all types of file descriptors (for example 70While most of this works on all types of file descriptors (for
68sockets), using these functions on file descriptors that support 71example sockets), using these functions on file descriptors that
69nonblocking operation (again, sockets, pipes etc.) is very inefficient or 72support nonblocking operation (again, sockets, pipes etc.) is very
70might not work (aio_read fails on sockets/pipes/fifos). Use an event loop 73inefficient. Use an event loop for that (such as the L<Event|Event>
71for that (such as the L<Event|Event> module): IO::AIO will naturally fit 74module): IO::AIO will naturally fit into such an event loop itself.
72into such an event loop itself.
73 75
74In this version, a number of threads are started that execute your 76In this version, a number of threads are started that execute your
75requests and signal their completion. You don't need thread support 77requests and signal their completion. You don't need thread support
76in perl, and the threads created by this module will not be visible 78in perl, and the threads created by this module will not be visible
77to perl. In the future, this module might make use of the native aio 79to perl. In the future, this module might make use of the native aio
79not well-supported or restricted (GNU/Linux doesn't allow them on normal 81not well-supported or restricted (GNU/Linux doesn't allow them on normal
80files currently, for example), and they would only support aio_read and 82files currently, for example), and they would only support aio_read and
81aio_write, so the remaining functionality would have to be implemented 83aio_write, so the remaining functionality would have to be implemented
82using threads anyway. 84using threads anyway.
83 85
84Although the module will work with in the presence of other (Perl-) 86Although the module will work in the presence of other (Perl-) threads,
85threads, it is currently not reentrant in any way, so use appropriate 87it is currently not reentrant in any way, so use appropriate locking
86locking yourself, always call C<poll_cb> from within the same thread, or 88yourself, always call C<poll_cb> from within the same thread, or never
87never call C<poll_cb> (or other C<aio_> functions) recursively. 89call C<poll_cb> (or other C<aio_> functions) recursively.
88 90
89=head2 EXAMPLE 91=head2 EXAMPLE
90 92
91This is a simple example that uses the Event module and loads 93This is a simple example that uses the Event module and loads
92F</etc/passwd> asynchronously: 94F</etc/passwd> asynchronously:
184 186
185=cut 187=cut
186 188
187package IO::AIO; 189package IO::AIO;
188 190
191use Carp ();
192
189no warnings; 193no warnings;
190use strict 'vars'; 194use strict 'vars';
191 195
192use base 'Exporter'; 196use base 'Exporter';
193 197
194BEGIN { 198BEGIN {
195 our $VERSION = '2.33'; 199 our $VERSION = '2.6';
196 200
197 our @AIO_REQ = qw(aio_sendfile aio_read aio_write aio_open aio_close aio_stat 201 our @AIO_REQ = qw(aio_sendfile aio_read aio_write aio_open aio_close aio_stat
198 aio_lstat aio_unlink aio_rmdir aio_readdir aio_scandir aio_symlink 202 aio_lstat aio_unlink aio_rmdir aio_readdir aio_scandir aio_symlink
199 aio_readlink aio_fsync aio_fdatasync aio_readahead aio_rename aio_link 203 aio_readlink aio_sync aio_fsync aio_fdatasync aio_readahead aio_rename aio_link
200 aio_move aio_copy aio_group aio_nop aio_mknod aio_load aio_rmtree aio_mkdir); 204 aio_move aio_copy aio_group aio_nop aio_mknod aio_load aio_rmtree aio_mkdir
205 aio_chown aio_chmod aio_utime aio_truncate);
201 our @EXPORT = (@AIO_REQ, qw(aioreq_pri aioreq_nice aio_block)); 206 our @EXPORT = (@AIO_REQ, qw(aioreq_pri aioreq_nice aio_block));
202 our @EXPORT_OK = qw(poll_fileno poll_cb poll_wait flush 207 our @EXPORT_OK = qw(poll_fileno poll_cb poll_wait flush
203 min_parallel max_parallel max_idle 208 min_parallel max_parallel max_idle
204 nreqs nready npending nthreads 209 nreqs nready npending nthreads
205 max_poll_time max_poll_reqs); 210 max_poll_time max_poll_reqs);
271 aio_read $_[0], ..., sub { 276 aio_read $_[0], ..., sub {
272 ... 277 ...
273 }; 278 };
274 }; 279 };
275 280
281
276=item aioreq_nice $pri_adjust 282=item aioreq_nice $pri_adjust
277 283
278Similar to C<aioreq_pri>, but subtracts the given value from the current 284Similar to C<aioreq_pri>, but subtracts the given value from the current
279priority, so the effect is cumulative. 285priority, so the effect is cumulative.
286
280 287
281=item aio_open $pathname, $flags, $mode, $callback->($fh) 288=item aio_open $pathname, $flags, $mode, $callback->($fh)
282 289
283Asynchronously open or create a file and call the callback with a newly 290Asynchronously open or create a file and call the callback with a newly
284created filehandle for the file. 291created filehandle for the file.
305 } else { 312 } else {
306 die "open failed: $!\n"; 313 die "open failed: $!\n";
307 } 314 }
308 }; 315 };
309 316
317
310=item aio_close $fh, $callback->($status) 318=item aio_close $fh, $callback->($status)
311 319
312Asynchronously close a file and call the callback with the result 320Asynchronously close a file and call the callback with the result
313code. I<WARNING:> although accepted, you should not pass in a perl 321code.
314filehandle here, as perl will likely close the file descriptor another
315time when the filehandle is destroyed. Normally, you can safely call perls
316C<close> or just let filehandles go out of scope.
317 322
318This is supposed to be a bug in the API, so that might change. It's 323Unfortunately, you can't do this to perl. Perl I<insists> very strongly on
319therefore best to avoid this function. 324closing the file descriptor associated with the filehandle itself. Here is
325what aio_close will try:
326
327 1. dup()licate the fd
328 2. asynchronously close() the duplicated fd
329 3. dup()licate the fd once more
330 4. let perl close() the filehandle
331 5. asynchronously close the duplicated fd
332
333The idea is that the first close() flushes stuff to disk that closing an
334fd will flush, so when perl closes the fd, nothing much will need to be
335flushed. The second async. close() will then flush stuff to disk that
336closing the last fd to the file will flush.
337
338Just FYI, SuSv3 has this to say on close:
339
340 All outstanding record locks owned by the process on the file
341 associated with the file descriptor shall be removed.
342
343 If fildes refers to a socket, close() shall cause the socket to be
344 destroyed. ... close() shall block for up to the current linger
345 interval until all data is transmitted.
346 [this actually sounds like a specification bug, but who knows]
347
348And at least Linux additionally actually flushes stuff on every close,
349even when the file itself is still open.
350
351Sounds enourmously inefficient and complicated? Yes... please show me how
352to nuke perl's fd out of existence...
353
354=cut
355
356sub aio_close($;$) {
357 aio_block {
358 my ($fh, $cb) = @_;
359
360 my $pri = aioreq_pri;
361 my $grp = aio_group $cb;
362
363 my $fd = fileno $fh;
364
365 defined $fd or Carp::croak "aio_close called with fd-less filehandle";
366
367 # if the dups fail we will simply get EBADF
368 my $fd2 = _dup $fd;
369 aioreq_pri $pri;
370 add $grp _aio_close $fd2, sub {
371 my $fd2 = _dup $fd;
372 close $fh;
373 aioreq_pri $pri;
374 add $grp _aio_close $fd2, sub {
375 $grp->result ($_[0]);
376 };
377 };
378
379 $grp
380 }
381}
382
320 383
321=item aio_read $fh,$offset,$length, $data,$dataoffset, $callback->($retval) 384=item aio_read $fh,$offset,$length, $data,$dataoffset, $callback->($retval)
322 385
323=item aio_write $fh,$offset,$length, $data,$dataoffset, $callback->($retval) 386=item aio_write $fh,$offset,$length, $data,$dataoffset, $callback->($retval)
324 387
325Reads or writes C<length> bytes from the specified C<fh> and C<offset> 388Reads or writes C<$length> bytes from the specified C<$fh> and C<$offset>
326into the scalar given by C<data> and offset C<dataoffset> and calls the 389into the scalar given by C<$data> and offset C<$dataoffset> and calls the
327callback without the actual number of bytes read (or -1 on error, just 390callback without the actual number of bytes read (or -1 on error, just
328like the syscall). 391like the syscall).
329 392
393If C<$offset> is undefined, then the current file descriptor offset will
394be used (and updated), otherwise the file descriptor offset will not be
395changed by these calls.
396
397If C<$length> is undefined in C<aio_write>, use the remaining length of C<$data>.
398
399If C<$dataoffset> is less than zero, it will be counted from the end of
400C<$data>.
401
330The C<$data> scalar I<MUST NOT> be modified in any way while the request 402The C<$data> scalar I<MUST NOT> be modified in any way while the request
331is outstanding. Modifying it can result in segfaults or WW3 (if the 403is outstanding. Modifying it can result in segfaults or World War III (if
332necessary/optional hardware is installed). 404the necessary/optional hardware is installed).
333 405
334Example: Read 15 bytes at offset 7 into scalar C<$buffer>, starting at 406Example: Read 15 bytes at offset 7 into scalar C<$buffer>, starting at
335offset C<0> within the scalar: 407offset C<0> within the scalar:
336 408
337 aio_read $fh, 7, 15, $buffer, 0, sub { 409 aio_read $fh, 7, 15, $buffer, 0, sub {
338 $_[0] > 0 or die "read error: $!"; 410 $_[0] > 0 or die "read error: $!";
339 print "read $_[0] bytes: <$buffer>\n"; 411 print "read $_[0] bytes: <$buffer>\n";
340 }; 412 };
413
341 414
342=item aio_sendfile $out_fh, $in_fh, $in_offset, $length, $callback->($retval) 415=item aio_sendfile $out_fh, $in_fh, $in_offset, $length, $callback->($retval)
343 416
344Tries to copy C<$length> bytes from C<$in_fh> to C<$out_fh>. It starts 417Tries to copy C<$length> bytes from C<$in_fh> to C<$out_fh>. It starts
345reading at byte offset C<$in_offset>, and starts writing at the current 418reading at byte offset C<$in_offset>, and starts writing at the current
359C<$in_fh> than are written, and there is no way to find out how many 432C<$in_fh> than are written, and there is no way to find out how many
360bytes have been read from C<aio_sendfile> alone, as C<aio_sendfile> only 433bytes have been read from C<aio_sendfile> alone, as C<aio_sendfile> only
361provides the number of bytes written to C<$out_fh>. Only if the result 434provides the number of bytes written to C<$out_fh>. Only if the result
362value equals C<$length> one can assume that C<$length> bytes have been 435value equals C<$length> one can assume that C<$length> bytes have been
363read. 436read.
437
364 438
365=item aio_readahead $fh,$offset,$length, $callback->($retval) 439=item aio_readahead $fh,$offset,$length, $callback->($retval)
366 440
367C<aio_readahead> populates the page cache with data from a file so that 441C<aio_readahead> populates the page cache with data from a file so that
368subsequent reads from that file will not block on disk I/O. The C<$offset> 442subsequent reads from that file will not block on disk I/O. The C<$offset>
374file. The current file offset of the file is left unchanged. 448file. The current file offset of the file is left unchanged.
375 449
376If that syscall doesn't exist (likely if your OS isn't Linux) it will be 450If that syscall doesn't exist (likely if your OS isn't Linux) it will be
377emulated by simply reading the data, which would have a similar effect. 451emulated by simply reading the data, which would have a similar effect.
378 452
453
379=item aio_stat $fh_or_path, $callback->($status) 454=item aio_stat $fh_or_path, $callback->($status)
380 455
381=item aio_lstat $fh, $callback->($status) 456=item aio_lstat $fh, $callback->($status)
382 457
383Works like perl's C<stat> or C<lstat> in void context. The callback will 458Works like perl's C<stat> or C<lstat> in void context. The callback will
396 aio_stat "/etc/passwd", sub { 471 aio_stat "/etc/passwd", sub {
397 $_[0] and die "stat failed: $!"; 472 $_[0] and die "stat failed: $!";
398 print "size is ", -s _, "\n"; 473 print "size is ", -s _, "\n";
399 }; 474 };
400 475
476
477=item aio_utime $fh_or_path, $atime, $mtime, $callback->($status)
478
479Works like perl's C<utime> function (including the special case of $atime
480and $mtime being undef). Fractional times are supported if the underlying
481syscalls support them.
482
483When called with a pathname, uses utimes(2) if available, otherwise
484utime(2). If called on a file descriptor, uses futimes(2) if available,
485otherwise returns ENOSYS, so this is not portable.
486
487Examples:
488
489 # set atime and mtime to current time (basically touch(1)):
490 aio_utime "path", undef, undef;
491 # set atime to current time and mtime to beginning of the epoch:
492 aio_utime "path", time, undef; # undef==0
493
494
495=item aio_chown $fh_or_path, $uid, $gid, $callback->($status)
496
497Works like perl's C<chown> function, except that C<undef> for either $uid
498or $gid is being interpreted as "do not change" (but -1 can also be used).
499
500Examples:
501
502 # same as "chown root path" in the shell:
503 aio_chown "path", 0, -1;
504 # same as above:
505 aio_chown "path", 0, undef;
506
507
508=item aio_truncate $fh_or_path, $offset, $callback->($status)
509
510Works like truncate(2) or ftruncate(2).
511
512
513=item aio_chmod $fh_or_path, $mode, $callback->($status)
514
515Works like perl's C<chmod> function.
516
517
401=item aio_unlink $pathname, $callback->($status) 518=item aio_unlink $pathname, $callback->($status)
402 519
403Asynchronously unlink (delete) a file and call the callback with the 520Asynchronously unlink (delete) a file and call the callback with the
404result code. 521result code.
405 522
523
406=item aio_mknod $path, $mode, $dev, $callback->($status) 524=item aio_mknod $path, $mode, $dev, $callback->($status)
407 525
408[EXPERIMENTAL] 526[EXPERIMENTAL]
409 527
410Asynchronously create a device node (or fifo). See mknod(2). 528Asynchronously create a device node (or fifo). See mknod(2).
411 529
412The only (POSIX-) portable way of calling this function is: 530The only (POSIX-) portable way of calling this function is:
413 531
414 aio_mknod $path, IO::AIO::S_IFIFO | $mode, 0, sub { ... 532 aio_mknod $path, IO::AIO::S_IFIFO | $mode, 0, sub { ...
533
415 534
416=item aio_link $srcpath, $dstpath, $callback->($status) 535=item aio_link $srcpath, $dstpath, $callback->($status)
417 536
418Asynchronously create a new link to the existing object at C<$srcpath> at 537Asynchronously create a new link to the existing object at C<$srcpath> at
419the path C<$dstpath> and call the callback with the result code. 538the path C<$dstpath> and call the callback with the result code.
420 539
540
421=item aio_symlink $srcpath, $dstpath, $callback->($status) 541=item aio_symlink $srcpath, $dstpath, $callback->($status)
422 542
423Asynchronously create a new symbolic link to the existing object at C<$srcpath> at 543Asynchronously create a new symbolic link to the existing object at C<$srcpath> at
424the path C<$dstpath> and call the callback with the result code. 544the path C<$dstpath> and call the callback with the result code.
545
425 546
426=item aio_readlink $path, $callback->($link) 547=item aio_readlink $path, $callback->($link)
427 548
428Asynchronously read the symlink specified by C<$path> and pass it to 549Asynchronously read the symlink specified by C<$path> and pass it to
429the callback. If an error occurs, nothing or undef gets passed to the 550the callback. If an error occurs, nothing or undef gets passed to the
430callback. 551callback.
431 552
553
432=item aio_rename $srcpath, $dstpath, $callback->($status) 554=item aio_rename $srcpath, $dstpath, $callback->($status)
433 555
434Asynchronously rename the object at C<$srcpath> to C<$dstpath>, just as 556Asynchronously rename the object at C<$srcpath> to C<$dstpath>, just as
435rename(2) and call the callback with the result code. 557rename(2) and call the callback with the result code.
558
436 559
437=item aio_mkdir $pathname, $mode, $callback->($status) 560=item aio_mkdir $pathname, $mode, $callback->($status)
438 561
439Asynchronously mkdir (create) a directory and call the callback with 562Asynchronously mkdir (create) a directory and call the callback with
440the result code. C<$mode> will be modified by the umask at the time the 563the result code. C<$mode> will be modified by the umask at the time the
441request is executed, so do not change your umask. 564request is executed, so do not change your umask.
442 565
566
443=item aio_rmdir $pathname, $callback->($status) 567=item aio_rmdir $pathname, $callback->($status)
444 568
445Asynchronously rmdir (delete) a directory and call the callback with the 569Asynchronously rmdir (delete) a directory and call the callback with the
446result code. 570result code.
571
447 572
448=item aio_readdir $pathname, $callback->($entries) 573=item aio_readdir $pathname, $callback->($entries)
449 574
450Unlike the POSIX call of the same name, C<aio_readdir> reads an entire 575Unlike the POSIX call of the same name, C<aio_readdir> reads an entire
451directory (i.e. opendir + readdir + closedir). The entries will not be 576directory (i.e. opendir + readdir + closedir). The entries will not be
452sorted, and will B<NOT> include the C<.> and C<..> entries. 577sorted, and will B<NOT> include the C<.> and C<..> entries.
453 578
454The callback a single argument which is either C<undef> or an array-ref 579The callback a single argument which is either C<undef> or an array-ref
455with the filenames. 580with the filenames.
581
456 582
457=item aio_load $path, $data, $callback->($status) 583=item aio_load $path, $data, $callback->($status)
458 584
459This is a composite request that tries to fully load the given file into 585This is a composite request that tries to fully load the given file into
460memory. Status is the same as with aio_read. 586memory. Status is the same as with aio_read.
767 893
768 $grp 894 $grp
769 } 895 }
770} 896}
771 897
898=item aio_sync $callback->($status)
899
900Asynchronously call sync and call the callback when finished.
901
772=item aio_fsync $fh, $callback->($status) 902=item aio_fsync $fh, $callback->($status)
773 903
774Asynchronously call fsync on the given filehandle and call the callback 904Asynchronously call fsync on the given filehandle and call the callback
775with the fsync result code. 905with the fsync result code.
776 906
1078Strictly equivalent to: 1208Strictly equivalent to:
1079 1209
1080 IO::AIO::poll_wait, IO::AIO::poll_cb 1210 IO::AIO::poll_wait, IO::AIO::poll_cb
1081 while IO::AIO::nreqs; 1211 while IO::AIO::nreqs;
1082 1212
1213=back
1214
1083=head3 CONTROLLING THE NUMBER OF THREADS 1215=head3 CONTROLLING THE NUMBER OF THREADS
1216
1217=over
1084 1218
1085=item IO::AIO::min_parallel $nthreads 1219=item IO::AIO::min_parallel $nthreads
1086 1220
1087Set the minimum number of AIO threads to C<$nthreads>. The current 1221Set the minimum number of AIO threads to C<$nthreads>. The current
1088default is C<8>, which means eight asynchronous operations can execute 1222default is C<8>, which means eight asynchronous operations can execute
1136This is a very bad function to use in interactive programs because it 1270This is a very bad function to use in interactive programs because it
1137blocks, and a bad way to reduce concurrency because it is inexact: Better 1271blocks, and a bad way to reduce concurrency because it is inexact: Better
1138use an C<aio_group> together with a feed callback. 1272use an C<aio_group> together with a feed callback.
1139 1273
1140Sets the maximum number of outstanding requests to C<$nreqs>. If you 1274Sets the maximum number of outstanding requests to C<$nreqs>. If you
1141to queue up more than this number of requests, the next call to the 1275do queue up more than this number of requests, the next call to the
1142C<poll_cb> (and C<poll_some> and other functions calling C<poll_cb>) 1276C<poll_cb> (and C<poll_some> and other functions calling C<poll_cb>)
1143function will block until the limit is no longer exceeded. 1277function will block until the limit is no longer exceeded.
1144 1278
1145The default value is very large, so there is no practical limit on the 1279The default value is very large, so there is no practical limit on the
1146number of outstanding requests. 1280number of outstanding requests.
1147 1281
1148You can still queue as many requests as you want. Therefore, 1282You can still queue as many requests as you want. Therefore,
1149C<max_oustsanding> is mainly useful in simple scripts (with low values) or 1283C<max_oustsanding> is mainly useful in simple scripts (with low values) or
1150as a stop gap to shield against fatal memory overflow (with large values). 1284as a stop gap to shield against fatal memory overflow (with large values).
1151 1285
1286=back
1287
1152=head3 STATISTICAL INFORMATION 1288=head3 STATISTICAL INFORMATION
1289
1290=over
1153 1291
1154=item IO::AIO::nreqs 1292=item IO::AIO::nreqs
1155 1293
1156Returns the number of requests currently in the ready, execute or pending 1294Returns the number of requests currently in the ready, execute or pending
1157states (i.e. for which their callback has not been invoked yet). 1295states (i.e. for which their callback has not been invoked yet).
1172but not yet processed by poll_cb). 1310but not yet processed by poll_cb).
1173 1311
1174=back 1312=back
1175 1313
1176=cut 1314=cut
1177
1178# support function to convert a fd into a perl filehandle
1179sub _fd2fh {
1180 return undef if $_[0] < 0;
1181
1182 # try to generate nice filehandles
1183 my $sym = "IO::AIO::fd#$_[0]";
1184 local *$sym;
1185
1186 open *$sym, "+<&=$_[0]" # usually works under any unix
1187 or open *$sym, "<&=$_[0]" # cygwin needs this
1188 or open *$sym, ">&=$_[0]" # or this
1189 or return undef;
1190
1191 *$sym
1192}
1193 1315
1194min_parallel 8; 1316min_parallel 8;
1195 1317
1196END { flush } 1318END { flush }
1197 1319
1221bytes of memory. In addition, stat requests need a stat buffer (possibly 1343bytes of memory. In addition, stat requests need a stat buffer (possibly
1222a few hundred bytes), readdir requires a result buffer and so on. Perl 1344a few hundred bytes), readdir requires a result buffer and so on. Perl
1223scalars and other data passed into aio requests will also be locked and 1345scalars and other data passed into aio requests will also be locked and
1224will consume memory till the request has entered the done state. 1346will consume memory till the request has entered the done state.
1225 1347
1226This is now awfully much, so queuing lots of requests is not usually a 1348This is not awfully much, so queuing lots of requests is not usually a
1227problem. 1349problem.
1228 1350
1229Per-thread usage: 1351Per-thread usage:
1230 1352
1231In the execution phase, some aio requests require more memory for 1353In the execution phase, some aio requests require more memory for

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines