ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/IO-AIO/AIO.pm
(Generate patch)

Comparing IO-AIO/AIO.pm (file contents):
Revision 1.142 by root, Wed Oct 22 18:15:36 2008 UTC vs.
Revision 1.155 by root, Sun Jun 14 20:36:59 2009 UTC

193use strict 'vars'; 193use strict 'vars';
194 194
195use base 'Exporter'; 195use base 'Exporter';
196 196
197BEGIN { 197BEGIN {
198 our $VERSION = '3.16'; 198 our $VERSION = '3.23';
199 199
200 our @AIO_REQ = qw(aio_sendfile aio_read aio_write aio_open aio_close 200 our @AIO_REQ = qw(aio_sendfile aio_read aio_write aio_open aio_close
201 aio_stat aio_lstat aio_unlink aio_rmdir aio_readdir 201 aio_stat aio_lstat aio_unlink aio_rmdir aio_readdir aio_readdirx
202 aio_scandir aio_symlink aio_readlink aio_sync aio_fsync 202 aio_scandir aio_symlink aio_readlink aio_sync aio_fsync
203 aio_fdatasync aio_sync_file_range aio_pathsync aio_readahead 203 aio_fdatasync aio_sync_file_range aio_pathsync aio_readahead
204 aio_rename aio_link aio_move aio_copy aio_group 204 aio_rename aio_link aio_move aio_copy aio_group
205 aio_nop aio_mknod aio_load aio_rmtree aio_mkdir aio_chown 205 aio_nop aio_mknod aio_load aio_rmtree aio_mkdir aio_chown
206 aio_chmod aio_utime aio_truncate); 206 aio_chmod aio_utime aio_truncate);
208 our @EXPORT = (@AIO_REQ, qw(aioreq_pri aioreq_nice)); 208 our @EXPORT = (@AIO_REQ, qw(aioreq_pri aioreq_nice));
209 our @EXPORT_OK = qw(poll_fileno poll_cb poll_wait flush 209 our @EXPORT_OK = qw(poll_fileno poll_cb poll_wait flush
210 min_parallel max_parallel max_idle 210 min_parallel max_parallel max_idle
211 nreqs nready npending nthreads 211 nreqs nready npending nthreads
212 max_poll_time max_poll_reqs); 212 max_poll_time max_poll_reqs);
213
214 push @AIO_REQ, qw(aio_busy); # not exported
213 215
214 @IO::AIO::GRP::ISA = 'IO::AIO::REQ'; 216 @IO::AIO::GRP::ISA = 'IO::AIO::REQ';
215 217
216 require XSLoader; 218 require XSLoader;
217 XSLoader::load ("IO::AIO", $VERSION); 219 XSLoader::load ("IO::AIO", $VERSION);
336 338
337=item aio_read $fh,$offset,$length, $data,$dataoffset, $callback->($retval) 339=item aio_read $fh,$offset,$length, $data,$dataoffset, $callback->($retval)
338 340
339=item aio_write $fh,$offset,$length, $data,$dataoffset, $callback->($retval) 341=item aio_write $fh,$offset,$length, $data,$dataoffset, $callback->($retval)
340 342
341Reads or writes C<$length> bytes from the specified C<$fh> and C<$offset> 343Reads or writes C<$length> bytes from or to the specified C<$fh> and
342into the scalar given by C<$data> and offset C<$dataoffset> and calls the 344C<$offset> into the scalar given by C<$data> and offset C<$dataoffset>
343callback without the actual number of bytes read (or -1 on error, just 345and calls the callback without the actual number of bytes read (or -1 on
344like the syscall). 346error, just like the syscall).
347
348C<aio_read> will, like C<sysread>, shrink or grow the C<$data> scalar to
349offset plus the actual number of bytes read.
345 350
346If C<$offset> is undefined, then the current file descriptor offset will 351If C<$offset> is undefined, then the current file descriptor offset will
347be used (and updated), otherwise the file descriptor offset will not be 352be used (and updated), otherwise the file descriptor offset will not be
348changed by these calls. 353changed by these calls.
349 354
350If C<$length> is undefined in C<aio_write>, use the remaining length of C<$data>. 355If C<$length> is undefined in C<aio_write>, use the remaining length of
356C<$data>.
351 357
352If C<$dataoffset> is less than zero, it will be counted from the end of 358If C<$dataoffset> is less than zero, it will be counted from the end of
353C<$data>. 359C<$data>.
354 360
355The C<$data> scalar I<MUST NOT> be modified in any way while the request 361The C<$data> scalar I<MUST NOT> be modified in any way while the request
527 533
528Unlike the POSIX call of the same name, C<aio_readdir> reads an entire 534Unlike the POSIX call of the same name, C<aio_readdir> reads an entire
529directory (i.e. opendir + readdir + closedir). The entries will not be 535directory (i.e. opendir + readdir + closedir). The entries will not be
530sorted, and will B<NOT> include the C<.> and C<..> entries. 536sorted, and will B<NOT> include the C<.> and C<..> entries.
531 537
532The callback a single argument which is either C<undef> or an array-ref 538The callback is passed a single argument which is either C<undef> or an
533with the filenames. 539array-ref with the filenames.
540
541
542=item aio_readdirx $pathname, $flags, $callback->($entries, $flags)
543
544Quite similar to C<aio_readdir>, but the C<$flags> argument allows to tune
545behaviour and output format. In case of an error, C<$entries> will be
546C<undef>.
547
548The flags are a combination of the following constants, ORed together (the
549flags will also be passed to the callback, possibly modified):
550
551=over 4
552
553=item IO::AIO::READDIR_DENTS
554
555When this flag is off, then the callback gets an arrayref with of names
556only (as with C<aio_readdir>), otherwise it gets an arrayref with
557C<[$name, $type, $inode]> arrayrefs, each describing a single directory
558entry in more detail.
559
560C<$name> is the name of the entry.
561
562C<$type> is one of the C<IO::AIO::DT_xxx> constants:
563
564C<IO::AIO::DT_UNKNOWN>, C<IO::AIO::DT_FIFO>, C<IO::AIO::DT_CHR>, C<IO::AIO::DT_DIR>,
565C<IO::AIO::DT_BLK>, C<IO::AIO::DT_REG>, C<IO::AIO::DT_LNK>, C<IO::AIO::DT_SOCK>,
566C<IO::AIO::DT_WHT>.
567
568C<IO::AIO::DT_UNKNOWN> means just that: readdir does not know. If you need to
569know, you have to run stat yourself. Also, for speed reasons, the C<$type>
570scalars are read-only: you can not modify them.
571
572C<$inode> is the inode number (which might not be exact on systems with 64
573bit inode numbers and 32 bit perls). This field has unspecified content on
574systems that do not deliver the inode information.
575
576=item IO::AIO::READDIR_DIRS_FIRST
577
578When this flag is set, then the names will be returned in an order where
579likely directories come first. This is useful when you need to quickly
580find directories, or you want to find all directories while avoiding to
581stat() each entry.
582
583If the system returns type information in readdir, then this is used
584to find directories directly. Otherwise, likely directories are files
585beginning with ".", or otherwise files with no dots, of which files with
586short names are tried first.
587
588=item IO::AIO::READDIR_STAT_ORDER
589
590When this flag is set, then the names will be returned in an order
591suitable for stat()'ing each one. That is, when you plan to stat()
592all files in the given directory, then the returned order will likely
593be fastest.
594
595If both this flag and C<IO::AIO::READDIR_DIRS_FIRST> are specified, then
596the likely dirs come first, resulting in a less optimal stat order.
597
598=item IO::AIO::READDIR_FOUND_UNKNOWN
599
600This flag should not be set when calling C<aio_readdirx>. Instead, it
601is being set by C<aio_readdirx>, when any of the C<$type>'s found were
602C<IO::AIO::DT_UNKNOWN>. The absense of this flag therefore indicates that all
603C<$type>'s are known, which can be used to speed up some algorithms.
604
605=back
534 606
535 607
536=item aio_load $path, $data, $callback->($status) 608=item aio_load $path, $data, $callback->($status)
537 609
538This is a composite request that tries to fully load the given file into 610This is a composite request that tries to fully load the given file into
585 my $grp = aio_group $cb; 657 my $grp = aio_group $cb;
586 658
587 aioreq_pri $pri; 659 aioreq_pri $pri;
588 add $grp aio_open $src, O_RDONLY, 0, sub { 660 add $grp aio_open $src, O_RDONLY, 0, sub {
589 if (my $src_fh = $_[0]) { 661 if (my $src_fh = $_[0]) {
590 my @stat = stat $src_fh; 662 my @stat = stat $src_fh; # hmm, might bock over nfs?
591 663
592 aioreq_pri $pri; 664 aioreq_pri $pri;
593 add $grp aio_open $dst, O_CREAT | O_WRONLY | O_TRUNC, 0200, sub { 665 add $grp aio_open $dst, O_CREAT | O_WRONLY | O_TRUNC, 0200, sub {
594 if (my $dst_fh = $_[0]) { 666 if (my $dst_fh = $_[0]) {
595 aioreq_pri $pri; 667 aioreq_pri $pri;
596 add $grp aio_sendfile $dst_fh, $src_fh, 0, $stat[7], sub { 668 add $grp aio_sendfile $dst_fh, $src_fh, 0, $stat[7], sub {
597 if ($_[0] == $stat[7]) { 669 if ($_[0] == $stat[7]) {
598 $grp->result (0); 670 $grp->result (0);
599 close $src_fh; 671 close $src_fh;
600 672
601 # those should not normally block. should. should. 673 my $ch = sub {
602 utime $stat[8], $stat[9], $dst; 674 aioreq_pri $pri;
603 chmod $stat[2] & 07777, $dst_fh; 675 add $grp aio_chmod $dst_fh, $stat[2] & 07777, sub {
604 chown $stat[4], $stat[5], $dst_fh; 676 aioreq_pri $pri;
677 add $grp aio_chown $dst_fh, $stat[4], $stat[5], sub {
678 aioreq_pri $pri;
679 add $grp aio_close $dst_fh;
680 }
681 };
682 };
605 683
606 aioreq_pri $pri; 684 aioreq_pri $pri;
607 add $grp aio_close $dst_fh; 685 add $grp aio_utime $dst_fh, $stat[8], $stat[9], sub {
686 if ($_[0] < 0 && $! == ENOSYS) {
687 aioreq_pri $pri;
688 add $grp aio_utime $dst, $stat[8], $stat[9], $ch;
689 } else {
690 $ch->();
691 }
692 };
608 } else { 693 } else {
609 $grp->result (-1); 694 $grp->result (-1);
610 close $src_fh; 695 close $src_fh;
611 close $dst_fh; 696 close $dst_fh;
612 697
690 775
691Implementation notes. 776Implementation notes.
692 777
693The C<aio_readdir> cannot be avoided, but C<stat()>'ing every entry can. 778The C<aio_readdir> cannot be avoided, but C<stat()>'ing every entry can.
694 779
780If readdir returns file type information, then this is used directly to
781find directories.
782
695After reading the directory, the modification time, size etc. of the 783Otherwise, after reading the directory, the modification time, size etc.
696directory before and after the readdir is checked, and if they match (and 784of the directory before and after the readdir is checked, and if they
697isn't the current time), the link count will be used to decide how many 785match (and isn't the current time), the link count will be used to decide
698entries are directories (if >= 2). Otherwise, no knowledge of the number 786how many entries are directories (if >= 2). Otherwise, no knowledge of the
699of subdirectories will be assumed. 787number of subdirectories will be assumed.
700 788
701Then entries will be sorted into likely directories (everything without 789Then entries will be sorted into likely directories a non-initial dot
702a non-initial dot currently) and likely non-directories (everything 790currently) and likely non-directories (see C<aio_readdirx>). Then every
703else). Then every entry plus an appended C</.> will be C<stat>'ed, 791entry plus an appended C</.> will be C<stat>'ed, likely directories first,
704likely directories first. If that succeeds, it assumes that the entry 792in order of their inode numbers. If that succeeds, it assumes that the
705is a directory or a symlink to directory (which will be checked 793entry is a directory or a symlink to directory (which will be checked
706seperately). This is often faster than stat'ing the entry itself because 794seperately). This is often faster than stat'ing the entry itself because
707filesystems might detect the type of the entry without reading the inode 795filesystems might detect the type of the entry without reading the inode
708data (e.g. ext2fs filetype feature). 796data (e.g. ext2fs filetype feature), even on systems that cannot return
797the filetype information on readdir.
709 798
710If the known number of directories (link count - 2) has been reached, the 799If the known number of directories (link count - 2) has been reached, the
711rest of the entries is assumed to be non-directories. 800rest of the entries is assumed to be non-directories.
712 801
713This only works with certainty on POSIX (= UNIX) filesystems, which 802This only works with certainty on POSIX (= UNIX) filesystems, which
735 my $now = time; 824 my $now = time;
736 my $hash1 = join ":", (stat _)[0,1,3,7,9]; 825 my $hash1 = join ":", (stat _)[0,1,3,7,9];
737 826
738 # read the directory entries 827 # read the directory entries
739 aioreq_pri $pri; 828 aioreq_pri $pri;
740 add $grp aio_readdir $path, sub { 829 add $grp aio_readdirx $path, READDIR_DIRS_FIRST, sub {
741 my $entries = shift 830 my $entries = shift
742 or return $grp->result (); 831 or return $grp->result ();
743 832
744 # stat the dir another time 833 # stat the dir another time
745 aioreq_pri $pri; 834 aioreq_pri $pri;
751 # take the slow route if anything looks fishy 840 # take the slow route if anything looks fishy
752 if ($hash1 ne $hash2 or (stat _)[9] == $now) { 841 if ($hash1 ne $hash2 or (stat _)[9] == $now) {
753 $ndirs = -1; 842 $ndirs = -1;
754 } else { 843 } else {
755 # if nlink == 2, we are finished 844 # if nlink == 2, we are finished
756 # on non-posix-fs's, we rely on nlink < 2 845 # for non-posix-fs's, we rely on nlink < 2
757 $ndirs = (stat _)[3] - 2 846 $ndirs = (stat _)[3] - 2
758 or return $grp->result ([], $entries); 847 or return $grp->result ([], $entries);
759 } 848 }
760 849
761 # sort into likely dirs and likely nondirs
762 # dirs == files without ".", short entries first
763 $entries = [map $_->[0],
764 sort { $b->[1] cmp $a->[1] }
765 map [$_, sprintf "%s%04d", (/.\./ ? "1" : "0"), length],
766 @$entries];
767
768 my (@dirs, @nondirs); 850 my (@dirs, @nondirs);
769 851
770 my $statgrp = add $grp aio_group sub { 852 my $statgrp = add $grp aio_group sub {
771 $grp->result (\@dirs, \@nondirs); 853 $grp->result (\@dirs, \@nondirs);
772 }; 854 };
773 855
774 limit $statgrp $maxreq; 856 limit $statgrp $maxreq;
775 feed $statgrp sub { 857 feed $statgrp sub {
776 return unless @$entries; 858 return unless @$entries;
777 my $entry = pop @$entries; 859 my $entry = shift @$entries;
778 860
779 aioreq_pri $pri; 861 aioreq_pri $pri;
780 add $statgrp aio_stat "$path/$entry/.", sub { 862 add $statgrp aio_stat "$path/$entry/.", sub {
781 if ($_[0] < 0) { 863 if ($_[0] < 0) {
782 push @nondirs, $entry; 864 push @nondirs, $entry;
963=item cancel $req 1045=item cancel $req
964 1046
965Cancels the request, if possible. Has the effect of skipping execution 1047Cancels the request, if possible. Has the effect of skipping execution
966when entering the B<execute> state and skipping calling the callback when 1048when entering the B<execute> state and skipping calling the callback when
967entering the the B<result> state, but will leave the request otherwise 1049entering the the B<result> state, but will leave the request otherwise
968untouched. That means that requests that currently execute will not be 1050untouched (with the exception of readdir). That means that requests that
969stopped and resources held by the request will not be freed prematurely. 1051currently execute will not be stopped and resources held by the request
1052will not be freed prematurely.
970 1053
971=item cb $req $callback->(...) 1054=item cb $req $callback->(...)
972 1055
973Replace (or simply set) the callback registered to the request. 1056Replace (or simply set) the callback registered to the request.
974 1057

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines