ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/IO-AIO/AIO.pm
(Generate patch)

Comparing IO-AIO/AIO.pm (file contents):
Revision 1.89 by root, Sun Oct 29 11:03:18 2006 UTC vs.
Revision 1.119 by root, Sun Dec 2 20:54:33 2007 UTC

5=head1 SYNOPSIS 5=head1 SYNOPSIS
6 6
7 use IO::AIO; 7 use IO::AIO;
8 8
9 aio_open "/etc/passwd", O_RDONLY, 0, sub { 9 aio_open "/etc/passwd", O_RDONLY, 0, sub {
10 my ($fh) = @_; 10 my $fh = shift
11 or die "/etc/passwd: $!";
11 ... 12 ...
12 }; 13 };
13 14
14 aio_unlink "/tmp/file", sub { }; 15 aio_unlink "/tmp/file", sub { };
15 16
25 $req->cancel; # cancel request if still in queue 26 $req->cancel; # cancel request if still in queue
26 27
27 my $grp = aio_group sub { print "all stats done\n" }; 28 my $grp = aio_group sub { print "all stats done\n" };
28 add $grp aio_stat "..." for ...; 29 add $grp aio_stat "..." for ...;
29 30
30 # AnyEvent integration 31 # AnyEvent integration (EV, Event, Glib, Tk, urxvt, pureperl...)
31 open my $fh, "<&=" . IO::AIO::poll_fileno or die "$!"; 32 open my $fh, "<&=" . IO::AIO::poll_fileno or die "$!";
32 my $w = AnyEvent->io (fh => $fh, poll => 'r', cb => sub { IO::AIO::poll_cb }); 33 my $w = AnyEvent->io (fh => $fh, poll => 'r', cb => sub { IO::AIO::poll_cb });
34
35 # EV integration
36 my $w = EV::io IO::AIO::poll_fileno, EV::READ, \&IO::AIO::poll_cb;
33 37
34 # Event integration 38 # Event integration
35 Event->io (fd => IO::AIO::poll_fileno, 39 Event->io (fd => IO::AIO::poll_fileno,
36 poll => 'r', 40 poll => 'r',
37 cb => \&IO::AIO::poll_cb); 41 cb => \&IO::AIO::poll_cb);
61etc.), but can also be used to easily do operations in parallel that are 65etc.), but can also be used to easily do operations in parallel that are
62normally done sequentially, e.g. stat'ing many files, which is much faster 66normally done sequentially, e.g. stat'ing many files, which is much faster
63on a RAID volume or over NFS when you do a number of stat operations 67on a RAID volume or over NFS when you do a number of stat operations
64concurrently. 68concurrently.
65 69
66While this works on all types of file descriptors (for example sockets), 70While most of this works on all types of file descriptors (for
67using these functions on file descriptors that support nonblocking 71example sockets), using these functions on file descriptors that
68operation (again, sockets, pipes etc.) is very inefficient. Use an event 72support nonblocking operation (again, sockets, pipes etc.) is very
69loop for that (such as the L<Event|Event> module): IO::AIO will naturally 73inefficient. Use an event loop for that (such as the L<Event|Event>
70fit into such an event loop itself. 74module): IO::AIO will naturally fit into such an event loop itself.
71 75
72In this version, a number of threads are started that execute your 76In this version, a number of threads are started that execute your
73requests and signal their completion. You don't need thread support 77requests and signal their completion. You don't need thread support
74in perl, and the threads created by this module will not be visible 78in perl, and the threads created by this module will not be visible
75to perl. In the future, this module might make use of the native aio 79to perl. In the future, this module might make use of the native aio
77not well-supported or restricted (GNU/Linux doesn't allow them on normal 81not well-supported or restricted (GNU/Linux doesn't allow them on normal
78files currently, for example), and they would only support aio_read and 82files currently, for example), and they would only support aio_read and
79aio_write, so the remaining functionality would have to be implemented 83aio_write, so the remaining functionality would have to be implemented
80using threads anyway. 84using threads anyway.
81 85
82Although the module will work with in the presence of other (Perl-) 86Although the module will work in the presence of other (Perl-) threads,
83threads, it is currently not reentrant in any way, so use appropriate 87it is currently not reentrant in any way, so use appropriate locking
84locking yourself, always call C<poll_cb> from within the same thread, or 88yourself, always call C<poll_cb> from within the same thread, or never
85never call C<poll_cb> (or other C<aio_> functions) recursively. 89call C<poll_cb> (or other C<aio_> functions) recursively.
86 90
87=head2 EXAMPLE 91=head2 EXAMPLE
88 92
89This is a simple example that uses the Event module and loads 93This is a simple example that uses the Event module and loads
90F</etc/passwd> asynchronously: 94F</etc/passwd> asynchronously:
98 poll => 'r', 102 poll => 'r',
99 cb => \&IO::AIO::poll_cb); 103 cb => \&IO::AIO::poll_cb);
100 104
101 # queue the request to open /etc/passwd 105 # queue the request to open /etc/passwd
102 aio_open "/etc/passwd", O_RDONLY, 0, sub { 106 aio_open "/etc/passwd", O_RDONLY, 0, sub {
103 my $fh = $_[0] 107 my $fh = shift
104 or die "error while opening: $!"; 108 or die "error while opening: $!";
105 109
106 # stat'ing filehandles is generally non-blocking 110 # stat'ing filehandles is generally non-blocking
107 my $size = -s $fh; 111 my $size = -s $fh;
108 112
182 186
183=cut 187=cut
184 188
185package IO::AIO; 189package IO::AIO;
186 190
191use Carp ();
192
187no warnings; 193no warnings;
188use strict 'vars'; 194use strict 'vars';
189 195
190use base 'Exporter'; 196use base 'Exporter';
191 197
192BEGIN { 198BEGIN {
193 our $VERSION = '2.1'; 199 our $VERSION = '2.6';
194 200
195 our @AIO_REQ = qw(aio_sendfile aio_read aio_write aio_open aio_close aio_stat 201 our @AIO_REQ = qw(aio_sendfile aio_read aio_write aio_open aio_close aio_stat
196 aio_lstat aio_unlink aio_rmdir aio_readdir aio_scandir aio_symlink 202 aio_lstat aio_unlink aio_rmdir aio_readdir aio_scandir aio_symlink
197 aio_fsync aio_fdatasync aio_readahead aio_rename aio_link aio_move 203 aio_readlink aio_sync aio_fsync aio_fdatasync aio_readahead aio_rename aio_link
198 aio_copy aio_group aio_nop aio_mknod); 204 aio_move aio_copy aio_group aio_nop aio_mknod aio_load aio_rmtree aio_mkdir
205 aio_chown aio_chmod aio_utime aio_truncate);
199 our @EXPORT = (@AIO_REQ, qw(aioreq_pri aioreq_nice)); 206 our @EXPORT = (@AIO_REQ, qw(aioreq_pri aioreq_nice aio_block));
200 our @EXPORT_OK = qw(poll_fileno poll_cb poll_wait flush 207 our @EXPORT_OK = qw(poll_fileno poll_cb poll_wait flush
201 min_parallel max_parallel max_idle 208 min_parallel max_parallel max_idle
202 nreqs nready npending nthreads 209 nreqs nready npending nthreads
203 max_poll_time max_poll_reqs); 210 max_poll_time max_poll_reqs);
204 211
269 aio_read $_[0], ..., sub { 276 aio_read $_[0], ..., sub {
270 ... 277 ...
271 }; 278 };
272 }; 279 };
273 280
281
274=item aioreq_nice $pri_adjust 282=item aioreq_nice $pri_adjust
275 283
276Similar to C<aioreq_pri>, but subtracts the given value from the current 284Similar to C<aioreq_pri>, but subtracts the given value from the current
277priority, so the effect is cumulative. 285priority, so the effect is cumulative.
286
278 287
279=item aio_open $pathname, $flags, $mode, $callback->($fh) 288=item aio_open $pathname, $flags, $mode, $callback->($fh)
280 289
281Asynchronously open or create a file and call the callback with a newly 290Asynchronously open or create a file and call the callback with a newly
282created filehandle for the file. 291created filehandle for the file.
288list. They are the same as used by C<sysopen>. 297list. They are the same as used by C<sysopen>.
289 298
290Likewise, C<$mode> specifies the mode of the newly created file, if it 299Likewise, C<$mode> specifies the mode of the newly created file, if it
291didn't exist and C<O_CREAT> has been given, just like perl's C<sysopen>, 300didn't exist and C<O_CREAT> has been given, just like perl's C<sysopen>,
292except that it is mandatory (i.e. use C<0> if you don't create new files, 301except that it is mandatory (i.e. use C<0> if you don't create new files,
293and C<0666> or C<0777> if you do). 302and C<0666> or C<0777> if you do). Note that the C<$mode> will be modified
303by the umask in effect then the request is being executed, so better never
304change the umask.
294 305
295Example: 306Example:
296 307
297 aio_open "/etc/passwd", O_RDONLY, 0, sub { 308 aio_open "/etc/passwd", O_RDONLY, 0, sub {
298 if ($_[0]) { 309 if ($_[0]) {
301 } else { 312 } else {
302 die "open failed: $!\n"; 313 die "open failed: $!\n";
303 } 314 }
304 }; 315 };
305 316
317
306=item aio_close $fh, $callback->($status) 318=item aio_close $fh, $callback->($status)
307 319
308Asynchronously close a file and call the callback with the result 320Asynchronously close a file and call the callback with the result
309code. I<WARNING:> although accepted, you should not pass in a perl 321code.
310filehandle here, as perl will likely close the file descriptor another
311time when the filehandle is destroyed. Normally, you can safely call perls
312C<close> or just let filehandles go out of scope.
313 322
314This is supposed to be a bug in the API, so that might change. It's 323Unfortunately, you can't do this to perl. Perl I<insists> very strongly on
315therefore best to avoid this function. 324closing the file descriptor associated with the filehandle itself. Here is
325what aio_close will try:
326
327 1. dup()licate the fd
328 2. asynchronously close() the duplicated fd
329 3. dup()licate the fd once more
330 4. let perl close() the filehandle
331 5. asynchronously close the duplicated fd
332
333The idea is that the first close() flushes stuff to disk that closing an
334fd will flush, so when perl closes the fd, nothing much will need to be
335flushed. The second async. close() will then flush stuff to disk that
336closing the last fd to the file will flush.
337
338Just FYI, SuSv3 has this to say on close:
339
340 All outstanding record locks owned by the process on the file
341 associated with the file descriptor shall be removed.
342
343 If fildes refers to a socket, close() shall cause the socket to be
344 destroyed. ... close() shall block for up to the current linger
345 interval until all data is transmitted.
346 [this actually sounds like a specification bug, but who knows]
347
348And at least Linux additionally actually flushes stuff on every close,
349even when the file itself is still open.
350
351Sounds enourmously inefficient and complicated? Yes... please show me how
352to nuke perl's fd out of existence...
353
354=cut
355
356sub aio_close($;$) {
357 aio_block {
358 my ($fh, $cb) = @_;
359
360 my $pri = aioreq_pri;
361 my $grp = aio_group $cb;
362
363 my $fd = fileno $fh;
364
365 defined $fd or Carp::croak "aio_close called with fd-less filehandle";
366
367 # if the dups fail we will simply get EBADF
368 my $fd2 = _dup $fd;
369 aioreq_pri $pri;
370 add $grp _aio_close $fd2, sub {
371 my $fd2 = _dup $fd;
372 close $fh;
373 aioreq_pri $pri;
374 add $grp _aio_close $fd2, sub {
375 $grp->result ($_[0]);
376 };
377 };
378
379 $grp
380 }
381}
382
316 383
317=item aio_read $fh,$offset,$length, $data,$dataoffset, $callback->($retval) 384=item aio_read $fh,$offset,$length, $data,$dataoffset, $callback->($retval)
318 385
319=item aio_write $fh,$offset,$length, $data,$dataoffset, $callback->($retval) 386=item aio_write $fh,$offset,$length, $data,$dataoffset, $callback->($retval)
320 387
321Reads or writes C<length> bytes from the specified C<fh> and C<offset> 388Reads or writes C<$length> bytes from the specified C<$fh> and C<$offset>
322into the scalar given by C<data> and offset C<dataoffset> and calls the 389into the scalar given by C<$data> and offset C<$dataoffset> and calls the
323callback without the actual number of bytes read (or -1 on error, just 390callback without the actual number of bytes read (or -1 on error, just
324like the syscall). 391like the syscall).
325 392
393If C<$offset> is undefined, then the current file descriptor offset will
394be used (and updated), otherwise the file descriptor offset will not be
395changed by these calls.
396
397If C<$length> is undefined in C<aio_write>, use the remaining length of C<$data>.
398
399If C<$dataoffset> is less than zero, it will be counted from the end of
400C<$data>.
401
326The C<$data> scalar I<MUST NOT> be modified in any way while the request 402The C<$data> scalar I<MUST NOT> be modified in any way while the request
327is outstanding. Modifying it can result in segfaults or WW3 (if the 403is outstanding. Modifying it can result in segfaults or World War III (if
328necessary/optional hardware is installed). 404the necessary/optional hardware is installed).
329 405
330Example: Read 15 bytes at offset 7 into scalar C<$buffer>, starting at 406Example: Read 15 bytes at offset 7 into scalar C<$buffer>, starting at
331offset C<0> within the scalar: 407offset C<0> within the scalar:
332 408
333 aio_read $fh, 7, 15, $buffer, 0, sub { 409 aio_read $fh, 7, 15, $buffer, 0, sub {
334 $_[0] > 0 or die "read error: $!"; 410 $_[0] > 0 or die "read error: $!";
335 print "read $_[0] bytes: <$buffer>\n"; 411 print "read $_[0] bytes: <$buffer>\n";
336 }; 412 };
413
337 414
338=item aio_sendfile $out_fh, $in_fh, $in_offset, $length, $callback->($retval) 415=item aio_sendfile $out_fh, $in_fh, $in_offset, $length, $callback->($retval)
339 416
340Tries to copy C<$length> bytes from C<$in_fh> to C<$out_fh>. It starts 417Tries to copy C<$length> bytes from C<$in_fh> to C<$out_fh>. It starts
341reading at byte offset C<$in_offset>, and starts writing at the current 418reading at byte offset C<$in_offset>, and starts writing at the current
355C<$in_fh> than are written, and there is no way to find out how many 432C<$in_fh> than are written, and there is no way to find out how many
356bytes have been read from C<aio_sendfile> alone, as C<aio_sendfile> only 433bytes have been read from C<aio_sendfile> alone, as C<aio_sendfile> only
357provides the number of bytes written to C<$out_fh>. Only if the result 434provides the number of bytes written to C<$out_fh>. Only if the result
358value equals C<$length> one can assume that C<$length> bytes have been 435value equals C<$length> one can assume that C<$length> bytes have been
359read. 436read.
437
360 438
361=item aio_readahead $fh,$offset,$length, $callback->($retval) 439=item aio_readahead $fh,$offset,$length, $callback->($retval)
362 440
363C<aio_readahead> populates the page cache with data from a file so that 441C<aio_readahead> populates the page cache with data from a file so that
364subsequent reads from that file will not block on disk I/O. The C<$offset> 442subsequent reads from that file will not block on disk I/O. The C<$offset>
370file. The current file offset of the file is left unchanged. 448file. The current file offset of the file is left unchanged.
371 449
372If that syscall doesn't exist (likely if your OS isn't Linux) it will be 450If that syscall doesn't exist (likely if your OS isn't Linux) it will be
373emulated by simply reading the data, which would have a similar effect. 451emulated by simply reading the data, which would have a similar effect.
374 452
453
375=item aio_stat $fh_or_path, $callback->($status) 454=item aio_stat $fh_or_path, $callback->($status)
376 455
377=item aio_lstat $fh, $callback->($status) 456=item aio_lstat $fh, $callback->($status)
378 457
379Works like perl's C<stat> or C<lstat> in void context. The callback will 458Works like perl's C<stat> or C<lstat> in void context. The callback will
392 aio_stat "/etc/passwd", sub { 471 aio_stat "/etc/passwd", sub {
393 $_[0] and die "stat failed: $!"; 472 $_[0] and die "stat failed: $!";
394 print "size is ", -s _, "\n"; 473 print "size is ", -s _, "\n";
395 }; 474 };
396 475
476
477=item aio_utime $fh_or_path, $atime, $mtime, $callback->($status)
478
479Works like perl's C<utime> function (including the special case of $atime
480and $mtime being undef). Fractional times are supported if the underlying
481syscalls support them.
482
483When called with a pathname, uses utimes(2) if available, otherwise
484utime(2). If called on a file descriptor, uses futimes(2) if available,
485otherwise returns ENOSYS, so this is not portable.
486
487Examples:
488
489 # set atime and mtime to current time (basically touch(1)):
490 aio_utime "path", undef, undef;
491 # set atime to current time and mtime to beginning of the epoch:
492 aio_utime "path", time, undef; # undef==0
493
494
495=item aio_chown $fh_or_path, $uid, $gid, $callback->($status)
496
497Works like perl's C<chown> function, except that C<undef> for either $uid
498or $gid is being interpreted as "do not change" (but -1 can also be used).
499
500Examples:
501
502 # same as "chown root path" in the shell:
503 aio_chown "path", 0, -1;
504 # same as above:
505 aio_chown "path", 0, undef;
506
507
508=item aio_truncate $fh_or_path, $offset, $callback->($status)
509
510Works like truncate(2) or ftruncate(2).
511
512
513=item aio_chmod $fh_or_path, $mode, $callback->($status)
514
515Works like perl's C<chmod> function.
516
517
397=item aio_unlink $pathname, $callback->($status) 518=item aio_unlink $pathname, $callback->($status)
398 519
399Asynchronously unlink (delete) a file and call the callback with the 520Asynchronously unlink (delete) a file and call the callback with the
400result code. 521result code.
401 522
523
402=item aio_mknod $path, $mode, $dev, $callback->($status) 524=item aio_mknod $path, $mode, $dev, $callback->($status)
403 525
404[EXPERIMENTAL] 526[EXPERIMENTAL]
405 527
406Asynchronously create a device node (or fifo). See mknod(2). 528Asynchronously create a device node (or fifo). See mknod(2).
407 529
408The only (POSIX-) portable way of calling this function is: 530The only (POSIX-) portable way of calling this function is:
409 531
410 aio_mknod $path, IO::AIO::S_IFIFO | $mode, 0, sub { ... 532 aio_mknod $path, IO::AIO::S_IFIFO | $mode, 0, sub { ...
533
411 534
412=item aio_link $srcpath, $dstpath, $callback->($status) 535=item aio_link $srcpath, $dstpath, $callback->($status)
413 536
414Asynchronously create a new link to the existing object at C<$srcpath> at 537Asynchronously create a new link to the existing object at C<$srcpath> at
415the path C<$dstpath> and call the callback with the result code. 538the path C<$dstpath> and call the callback with the result code.
416 539
540
417=item aio_symlink $srcpath, $dstpath, $callback->($status) 541=item aio_symlink $srcpath, $dstpath, $callback->($status)
418 542
419Asynchronously create a new symbolic link to the existing object at C<$srcpath> at 543Asynchronously create a new symbolic link to the existing object at C<$srcpath> at
420the path C<$dstpath> and call the callback with the result code. 544the path C<$dstpath> and call the callback with the result code.
421 545
546
547=item aio_readlink $path, $callback->($link)
548
549Asynchronously read the symlink specified by C<$path> and pass it to
550the callback. If an error occurs, nothing or undef gets passed to the
551callback.
552
553
422=item aio_rename $srcpath, $dstpath, $callback->($status) 554=item aio_rename $srcpath, $dstpath, $callback->($status)
423 555
424Asynchronously rename the object at C<$srcpath> to C<$dstpath>, just as 556Asynchronously rename the object at C<$srcpath> to C<$dstpath>, just as
425rename(2) and call the callback with the result code. 557rename(2) and call the callback with the result code.
426 558
559
560=item aio_mkdir $pathname, $mode, $callback->($status)
561
562Asynchronously mkdir (create) a directory and call the callback with
563the result code. C<$mode> will be modified by the umask at the time the
564request is executed, so do not change your umask.
565
566
427=item aio_rmdir $pathname, $callback->($status) 567=item aio_rmdir $pathname, $callback->($status)
428 568
429Asynchronously rmdir (delete) a directory and call the callback with the 569Asynchronously rmdir (delete) a directory and call the callback with the
430result code. 570result code.
571
431 572
432=item aio_readdir $pathname, $callback->($entries) 573=item aio_readdir $pathname, $callback->($entries)
433 574
434Unlike the POSIX call of the same name, C<aio_readdir> reads an entire 575Unlike the POSIX call of the same name, C<aio_readdir> reads an entire
435directory (i.e. opendir + readdir + closedir). The entries will not be 576directory (i.e. opendir + readdir + closedir). The entries will not be
436sorted, and will B<NOT> include the C<.> and C<..> entries. 577sorted, and will B<NOT> include the C<.> and C<..> entries.
437 578
438The callback a single argument which is either C<undef> or an array-ref 579The callback a single argument which is either C<undef> or an array-ref
439with the filenames. 580with the filenames.
581
582
583=item aio_load $path, $data, $callback->($status)
584
585This is a composite request that tries to fully load the given file into
586memory. Status is the same as with aio_read.
587
588=cut
589
590sub aio_load($$;$) {
591 aio_block {
592 my ($path, undef, $cb) = @_;
593 my $data = \$_[1];
594
595 my $pri = aioreq_pri;
596 my $grp = aio_group $cb;
597
598 aioreq_pri $pri;
599 add $grp aio_open $path, O_RDONLY, 0, sub {
600 my $fh = shift
601 or return $grp->result (-1);
602
603 aioreq_pri $pri;
604 add $grp aio_read $fh, 0, (-s $fh), $$data, 0, sub {
605 $grp->result ($_[0]);
606 };
607 };
608
609 $grp
610 }
611}
440 612
441=item aio_copy $srcpath, $dstpath, $callback->($status) 613=item aio_copy $srcpath, $dstpath, $callback->($status)
442 614
443Try to copy the I<file> (directories not supported as either source or 615Try to copy the I<file> (directories not supported as either source or
444destination) from C<$srcpath> to C<$dstpath> and call the callback with 616destination) from C<$srcpath> to C<$dstpath> and call the callback with
454errors are being ignored. 626errors are being ignored.
455 627
456=cut 628=cut
457 629
458sub aio_copy($$;$) { 630sub aio_copy($$;$) {
631 aio_block {
459 my ($src, $dst, $cb) = @_; 632 my ($src, $dst, $cb) = @_;
460 633
461 my $pri = aioreq_pri; 634 my $pri = aioreq_pri;
462 my $grp = aio_group $cb; 635 my $grp = aio_group $cb;
463 636
464 aioreq_pri $pri; 637 aioreq_pri $pri;
465 add $grp aio_open $src, O_RDONLY, 0, sub { 638 add $grp aio_open $src, O_RDONLY, 0, sub {
466 if (my $src_fh = $_[0]) { 639 if (my $src_fh = $_[0]) {
467 my @stat = stat $src_fh; 640 my @stat = stat $src_fh;
468 641
469 aioreq_pri $pri; 642 aioreq_pri $pri;
470 add $grp aio_open $dst, O_CREAT | O_WRONLY | O_TRUNC, 0200, sub { 643 add $grp aio_open $dst, O_CREAT | O_WRONLY | O_TRUNC, 0200, sub {
471 if (my $dst_fh = $_[0]) { 644 if (my $dst_fh = $_[0]) {
472 aioreq_pri $pri; 645 aioreq_pri $pri;
473 add $grp aio_sendfile $dst_fh, $src_fh, 0, $stat[7], sub { 646 add $grp aio_sendfile $dst_fh, $src_fh, 0, $stat[7], sub {
474 if ($_[0] == $stat[7]) { 647 if ($_[0] == $stat[7]) {
475 $grp->result (0); 648 $grp->result (0);
476 close $src_fh; 649 close $src_fh;
477 650
478 # those should not normally block. should. should. 651 # those should not normally block. should. should.
479 utime $stat[8], $stat[9], $dst; 652 utime $stat[8], $stat[9], $dst;
480 chmod $stat[2] & 07777, $dst_fh; 653 chmod $stat[2] & 07777, $dst_fh;
481 chown $stat[4], $stat[5], $dst_fh; 654 chown $stat[4], $stat[5], $dst_fh;
482 close $dst_fh; 655 close $dst_fh;
483 } else { 656 } else {
484 $grp->result (-1); 657 $grp->result (-1);
485 close $src_fh; 658 close $src_fh;
486 close $dst_fh; 659 close $dst_fh;
487 660
488 aioreq $pri; 661 aioreq $pri;
489 add $grp aio_unlink $dst; 662 add $grp aio_unlink $dst;
663 }
490 } 664 };
665 } else {
666 $grp->result (-1);
491 }; 667 }
492 } else {
493 $grp->result (-1);
494 } 668 },
669
670 } else {
671 $grp->result (-1);
495 }, 672 }
496
497 } else {
498 $grp->result (-1);
499 } 673 };
674
675 $grp
500 }; 676 }
501
502 $grp
503} 677}
504 678
505=item aio_move $srcpath, $dstpath, $callback->($status) 679=item aio_move $srcpath, $dstpath, $callback->($status)
506 680
507Try to move the I<file> (directories not supported as either source or 681Try to move the I<file> (directories not supported as either source or
513that is successful, unlinking the C<$srcpath>. 687that is successful, unlinking the C<$srcpath>.
514 688
515=cut 689=cut
516 690
517sub aio_move($$;$) { 691sub aio_move($$;$) {
692 aio_block {
518 my ($src, $dst, $cb) = @_; 693 my ($src, $dst, $cb) = @_;
519 694
520 my $pri = aioreq_pri; 695 my $pri = aioreq_pri;
521 my $grp = aio_group $cb; 696 my $grp = aio_group $cb;
522 697
523 aioreq_pri $pri; 698 aioreq_pri $pri;
524 add $grp aio_rename $src, $dst, sub { 699 add $grp aio_rename $src, $dst, sub {
525 if ($_[0] && $! == EXDEV) { 700 if ($_[0] && $! == EXDEV) {
526 aioreq_pri $pri; 701 aioreq_pri $pri;
527 add $grp aio_copy $src, $dst, sub { 702 add $grp aio_copy $src, $dst, sub {
703 $grp->result ($_[0]);
704
705 if (!$_[0]) {
706 aioreq_pri $pri;
707 add $grp aio_unlink $src;
708 }
709 };
710 } else {
528 $grp->result ($_[0]); 711 $grp->result ($_[0]);
529
530 if (!$_[0]) {
531 aioreq_pri $pri;
532 add $grp aio_unlink $src;
533 }
534 }; 712 }
535 } else {
536 $grp->result ($_[0]);
537 } 713 };
714
715 $grp
538 }; 716 }
539
540 $grp
541} 717}
542 718
543=item aio_scandir $path, $maxreq, $callback->($dirs, $nondirs) 719=item aio_scandir $path, $maxreq, $callback->($dirs, $nondirs)
544 720
545Scans a directory (similar to C<aio_readdir>) but additionally tries to 721Scans a directory (similar to C<aio_readdir>) but additionally tries to
592as those tend to return 0 or 1 as link counts, which disables the 768as those tend to return 0 or 1 as link counts, which disables the
593directory counting heuristic. 769directory counting heuristic.
594 770
595=cut 771=cut
596 772
597sub aio_scandir($$$) { 773sub aio_scandir($$;$) {
774 aio_block {
598 my ($path, $maxreq, $cb) = @_; 775 my ($path, $maxreq, $cb) = @_;
599 776
600 my $pri = aioreq_pri; 777 my $pri = aioreq_pri;
601 778
602 my $grp = aio_group $cb; 779 my $grp = aio_group $cb;
603 780
604 $maxreq = 4 if $maxreq <= 0; 781 $maxreq = 4 if $maxreq <= 0;
605 782
606 # stat once 783 # stat once
607 aioreq_pri $pri;
608 add $grp aio_stat $path, sub {
609 return $grp->result () if $_[0];
610 my $now = time;
611 my $hash1 = join ":", (stat _)[0,1,3,7,9];
612
613 # read the directory entries
614 aioreq_pri $pri; 784 aioreq_pri $pri;
615 add $grp aio_readdir $path, sub { 785 add $grp aio_stat $path, sub {
616 my $entries = shift
617 or return $grp->result (); 786 return $grp->result () if $_[0];
787 my $now = time;
788 my $hash1 = join ":", (stat _)[0,1,3,7,9];
618 789
619 # stat the dir another time 790 # read the directory entries
620 aioreq_pri $pri; 791 aioreq_pri $pri;
792 add $grp aio_readdir $path, sub {
793 my $entries = shift
794 or return $grp->result ();
795
796 # stat the dir another time
797 aioreq_pri $pri;
621 add $grp aio_stat $path, sub { 798 add $grp aio_stat $path, sub {
622 my $hash2 = join ":", (stat _)[0,1,3,7,9]; 799 my $hash2 = join ":", (stat _)[0,1,3,7,9];
623 800
624 my $ndirs; 801 my $ndirs;
625 802
626 # take the slow route if anything looks fishy 803 # take the slow route if anything looks fishy
627 if ($hash1 ne $hash2 or (stat _)[9] == $now) { 804 if ($hash1 ne $hash2 or (stat _)[9] == $now) {
628 $ndirs = -1; 805 $ndirs = -1;
629 } else { 806 } else {
630 # if nlink == 2, we are finished 807 # if nlink == 2, we are finished
631 # on non-posix-fs's, we rely on nlink < 2 808 # on non-posix-fs's, we rely on nlink < 2
632 $ndirs = (stat _)[3] - 2 809 $ndirs = (stat _)[3] - 2
633 or return $grp->result ([], $entries); 810 or return $grp->result ([], $entries);
634 } 811 }
635 812
636 # sort into likely dirs and likely nondirs 813 # sort into likely dirs and likely nondirs
637 # dirs == files without ".", short entries first 814 # dirs == files without ".", short entries first
638 $entries = [map $_->[0], 815 $entries = [map $_->[0],
639 sort { $b->[1] cmp $a->[1] } 816 sort { $b->[1] cmp $a->[1] }
640 map [$_, sprintf "%s%04d", (/.\./ ? "1" : "0"), length], 817 map [$_, sprintf "%s%04d", (/.\./ ? "1" : "0"), length],
641 @$entries]; 818 @$entries];
642 819
643 my (@dirs, @nondirs); 820 my (@dirs, @nondirs);
644 821
645 my $statgrp = add $grp aio_group sub { 822 my $statgrp = add $grp aio_group sub {
646 $grp->result (\@dirs, \@nondirs); 823 $grp->result (\@dirs, \@nondirs);
647 }; 824 };
648 825
649 limit $statgrp $maxreq; 826 limit $statgrp $maxreq;
650 feed $statgrp sub { 827 feed $statgrp sub {
651 return unless @$entries; 828 return unless @$entries;
652 my $entry = pop @$entries; 829 my $entry = pop @$entries;
653 830
654 aioreq_pri $pri; 831 aioreq_pri $pri;
655 add $statgrp aio_stat "$path/$entry/.", sub { 832 add $statgrp aio_stat "$path/$entry/.", sub {
656 if ($_[0] < 0) { 833 if ($_[0] < 0) {
657 push @nondirs, $entry; 834 push @nondirs, $entry;
658 } else { 835 } else {
659 # need to check for real directory 836 # need to check for real directory
660 aioreq_pri $pri; 837 aioreq_pri $pri;
661 add $statgrp aio_lstat "$path/$entry", sub { 838 add $statgrp aio_lstat "$path/$entry", sub {
662 if (-d _) { 839 if (-d _) {
663 push @dirs, $entry; 840 push @dirs, $entry;
664 841
665 unless (--$ndirs) { 842 unless (--$ndirs) {
666 push @nondirs, @$entries; 843 push @nondirs, @$entries;
667 feed $statgrp; 844 feed $statgrp;
845 }
846 } else {
847 push @nondirs, $entry;
668 } 848 }
669 } else {
670 push @nondirs, $entry;
671 } 849 }
672 } 850 }
673 } 851 };
674 }; 852 };
675 }; 853 };
676 }; 854 };
677 }; 855 };
856
857 $grp
678 }; 858 }
679
680 $grp
681} 859}
860
861=item aio_rmtree $path, $callback->($status)
862
863Delete a directory tree starting (and including) C<$path>, return the
864status of the final C<rmdir> only. This is a composite request that
865uses C<aio_scandir> to recurse into and rmdir directories, and unlink
866everything else.
867
868=cut
869
870sub aio_rmtree;
871sub aio_rmtree($;$) {
872 aio_block {
873 my ($path, $cb) = @_;
874
875 my $pri = aioreq_pri;
876 my $grp = aio_group $cb;
877
878 aioreq_pri $pri;
879 add $grp aio_scandir $path, 0, sub {
880 my ($dirs, $nondirs) = @_;
881
882 my $dirgrp = aio_group sub {
883 add $grp aio_rmdir $path, sub {
884 $grp->result ($_[0]);
885 };
886 };
887
888 (aioreq_pri $pri), add $dirgrp aio_rmtree "$path/$_" for @$dirs;
889 (aioreq_pri $pri), add $dirgrp aio_unlink "$path/$_" for @$nondirs;
890
891 add $grp $dirgrp;
892 };
893
894 $grp
895 }
896}
897
898=item aio_sync $callback->($status)
899
900Asynchronously call sync and call the callback when finished.
682 901
683=item aio_fsync $fh, $callback->($status) 902=item aio_fsync $fh, $callback->($status)
684 903
685Asynchronously call fsync on the given filehandle and call the callback 904Asynchronously call fsync on the given filehandle and call the callback
686with the fsync result code. 905with the fsync result code.
964 poll => 'r', nice => 1, 1183 poll => 'r', nice => 1,
965 cb => &IO::AIO::poll_cb); 1184 cb => &IO::AIO::poll_cb);
966 1185
967=item IO::AIO::poll_wait 1186=item IO::AIO::poll_wait
968 1187
1188If there are any outstanding requests and none of them in the result
969Wait till the result filehandle becomes ready for reading (simply does a 1189phase, wait till the result filehandle becomes ready for reading (simply
970C<select> on the filehandle. This is useful if you want to synchronously 1190does a C<select> on the filehandle. This is useful if you want to
971wait for some requests to finish). 1191synchronously wait for some requests to finish).
972 1192
973See C<nreqs> for an example. 1193See C<nreqs> for an example.
974 1194
975=item IO::AIO::poll 1195=item IO::AIO::poll
976 1196
977Waits until some requests have been handled. 1197Waits until some requests have been handled.
978 1198
1199Returns the number of requests processed, but is otherwise strictly
979Strictly equivalent to: 1200equivalent to:
980 1201
981 IO::AIO::poll_wait, IO::AIO::poll_cb 1202 IO::AIO::poll_wait, IO::AIO::poll_cb
982 if IO::AIO::nreqs;
983 1203
984=item IO::AIO::flush 1204=item IO::AIO::flush
985 1205
986Wait till all outstanding AIO requests have been handled. 1206Wait till all outstanding AIO requests have been handled.
987 1207
988Strictly equivalent to: 1208Strictly equivalent to:
989 1209
990 IO::AIO::poll_wait, IO::AIO::poll_cb 1210 IO::AIO::poll_wait, IO::AIO::poll_cb
991 while IO::AIO::nreqs; 1211 while IO::AIO::nreqs;
992 1212
1213=back
1214
993=head3 CONTROLLING THE NUMBER OF THREADS 1215=head3 CONTROLLING THE NUMBER OF THREADS
1216
1217=over
994 1218
995=item IO::AIO::min_parallel $nthreads 1219=item IO::AIO::min_parallel $nthreads
996 1220
997Set the minimum number of AIO threads to C<$nthreads>. The current 1221Set the minimum number of AIO threads to C<$nthreads>. The current
998default is C<8>, which means eight asynchronous operations can execute 1222default is C<8>, which means eight asynchronous operations can execute
1046This is a very bad function to use in interactive programs because it 1270This is a very bad function to use in interactive programs because it
1047blocks, and a bad way to reduce concurrency because it is inexact: Better 1271blocks, and a bad way to reduce concurrency because it is inexact: Better
1048use an C<aio_group> together with a feed callback. 1272use an C<aio_group> together with a feed callback.
1049 1273
1050Sets the maximum number of outstanding requests to C<$nreqs>. If you 1274Sets the maximum number of outstanding requests to C<$nreqs>. If you
1051to queue up more than this number of requests, the next call to the 1275do queue up more than this number of requests, the next call to the
1052C<poll_cb> (and C<poll_some> and other functions calling C<poll_cb>) 1276C<poll_cb> (and C<poll_some> and other functions calling C<poll_cb>)
1053function will block until the limit is no longer exceeded. 1277function will block until the limit is no longer exceeded.
1054 1278
1055The default value is very large, so there is no practical limit on the 1279The default value is very large, so there is no practical limit on the
1056number of outstanding requests. 1280number of outstanding requests.
1057 1281
1058You can still queue as many requests as you want. Therefore, 1282You can still queue as many requests as you want. Therefore,
1059C<max_oustsanding> is mainly useful in simple scripts (with low values) or 1283C<max_oustsanding> is mainly useful in simple scripts (with low values) or
1060as a stop gap to shield against fatal memory overflow (with large values). 1284as a stop gap to shield against fatal memory overflow (with large values).
1061 1285
1286=back
1287
1062=head3 STATISTICAL INFORMATION 1288=head3 STATISTICAL INFORMATION
1289
1290=over
1063 1291
1064=item IO::AIO::nreqs 1292=item IO::AIO::nreqs
1065 1293
1066Returns the number of requests currently in the ready, execute or pending 1294Returns the number of requests currently in the ready, execute or pending
1067states (i.e. for which their callback has not been invoked yet). 1295states (i.e. for which their callback has not been invoked yet).
1083 1311
1084=back 1312=back
1085 1313
1086=cut 1314=cut
1087 1315
1088# support function to convert a fd into a perl filehandle
1089sub _fd2fh {
1090 return undef if $_[0] < 0;
1091
1092 # try to generate nice filehandles
1093 my $sym = "IO::AIO::fd#$_[0]";
1094 local *$sym;
1095
1096 open *$sym, "+<&=$_[0]" # usually works under any unix
1097 or open *$sym, "<&=$_[0]" # cygwin needs this
1098 or open *$sym, ">&=$_[0]" # or this
1099 or return undef;
1100
1101 *$sym
1102}
1103
1104min_parallel 8; 1316min_parallel 8;
1105 1317
1106END { 1318END { flush }
1107 min_parallel 1;
1108 flush;
1109};
1110 1319
11111; 13201;
1112 1321
1113=head2 FORK BEHAVIOUR 1322=head2 FORK BEHAVIOUR
1114 1323
1134bytes of memory. In addition, stat requests need a stat buffer (possibly 1343bytes of memory. In addition, stat requests need a stat buffer (possibly
1135a few hundred bytes), readdir requires a result buffer and so on. Perl 1344a few hundred bytes), readdir requires a result buffer and so on. Perl
1136scalars and other data passed into aio requests will also be locked and 1345scalars and other data passed into aio requests will also be locked and
1137will consume memory till the request has entered the done state. 1346will consume memory till the request has entered the done state.
1138 1347
1139This is now awfully much, so queuing lots of requests is not usually a 1348This is not awfully much, so queuing lots of requests is not usually a
1140problem. 1349problem.
1141 1350
1142Per-thread usage: 1351Per-thread usage:
1143 1352
1144In the execution phase, some aio requests require more memory for 1353In the execution phase, some aio requests require more memory for

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines