ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/IO-AIO/AIO.pm
(Generate patch)

Comparing IO-AIO/AIO.pm (file contents):
Revision 1.185 by root, Sat Dec 11 19:06:07 2010 UTC vs.
Revision 1.203 by root, Thu Jul 7 22:36:18 2011 UTC

168use common::sense; 168use common::sense;
169 169
170use base 'Exporter'; 170use base 'Exporter';
171 171
172BEGIN { 172BEGIN {
173 our $VERSION = '3.7'; 173 our $VERSION = '3.93';
174 174
175 our @AIO_REQ = qw(aio_sendfile aio_read aio_write aio_open aio_close 175 our @AIO_REQ = qw(aio_sendfile aio_read aio_write aio_open aio_close
176 aio_stat aio_lstat aio_unlink aio_rmdir aio_readdir aio_readdirx 176 aio_stat aio_lstat aio_unlink aio_rmdir aio_readdir aio_readdirx
177 aio_scandir aio_symlink aio_readlink aio_sync aio_fsync 177 aio_scandir aio_symlink aio_readlink aio_realpath aio_sync aio_fsync
178 aio_fdatasync aio_sync_file_range aio_pathsync aio_readahead 178 aio_fdatasync aio_sync_file_range aio_fallocate
179 aio_pathsync aio_readahead
179 aio_rename aio_link aio_move aio_copy aio_group 180 aio_rename aio_link aio_move aio_copy aio_group
180 aio_nop aio_mknod aio_load aio_rmtree aio_mkdir aio_chown 181 aio_nop aio_mknod aio_load aio_rmtree aio_mkdir aio_chown
181 aio_chmod aio_utime aio_truncate 182 aio_chmod aio_utime aio_truncate
182 aio_msync aio_mtouch aio_mlock aio_mlockall 183 aio_msync aio_mtouch aio_mlock aio_mlockall
183 aio_statvfs); 184 aio_statvfs);
184 185
185 our @EXPORT = (@AIO_REQ, qw(aioreq_pri aioreq_nice)); 186 our @EXPORT = (@AIO_REQ, qw(aioreq_pri aioreq_nice));
186 our @EXPORT_OK = qw(poll_fileno poll_cb poll_wait flush 187 our @EXPORT_OK = qw(poll_fileno poll_cb poll_wait flush
187 min_parallel max_parallel max_idle 188 min_parallel max_parallel max_idle idle_timeout
188 nreqs nready npending nthreads 189 nreqs nready npending nthreads
189 max_poll_time max_poll_reqs 190 max_poll_time max_poll_reqs
190 sendfile fadvise madvise 191 sendfile fadvise madvise
191 mmap munmap munlock munlockall); 192 mmap munmap munlock munlockall);
192 193
222 aio_unlink $pathname, $callback->($status) 223 aio_unlink $pathname, $callback->($status)
223 aio_mknod $path, $mode, $dev, $callback->($status) 224 aio_mknod $path, $mode, $dev, $callback->($status)
224 aio_link $srcpath, $dstpath, $callback->($status) 225 aio_link $srcpath, $dstpath, $callback->($status)
225 aio_symlink $srcpath, $dstpath, $callback->($status) 226 aio_symlink $srcpath, $dstpath, $callback->($status)
226 aio_readlink $path, $callback->($link) 227 aio_readlink $path, $callback->($link)
228 aio_realpath $path, $callback->($link)
227 aio_rename $srcpath, $dstpath, $callback->($status) 229 aio_rename $srcpath, $dstpath, $callback->($status)
228 aio_mkdir $pathname, $mode, $callback->($status) 230 aio_mkdir $pathname, $mode, $callback->($status)
229 aio_rmdir $pathname, $callback->($status) 231 aio_rmdir $pathname, $callback->($status)
230 aio_readdir $pathname, $callback->($entries) 232 aio_readdir $pathname, $callback->($entries)
231 aio_readdirx $pathname, $flags, $callback->($entries, $flags) 233 aio_readdirx $pathname, $flags, $callback->($entries, $flags)
258 IO::AIO::max_poll_reqs $nreqs 260 IO::AIO::max_poll_reqs $nreqs
259 IO::AIO::max_poll_time $seconds 261 IO::AIO::max_poll_time $seconds
260 IO::AIO::min_parallel $nthreads 262 IO::AIO::min_parallel $nthreads
261 IO::AIO::max_parallel $nthreads 263 IO::AIO::max_parallel $nthreads
262 IO::AIO::max_idle $nthreads 264 IO::AIO::max_idle $nthreads
265 IO::AIO::idle_timeout $seconds
263 IO::AIO::max_outstanding $maxreqs 266 IO::AIO::max_outstanding $maxreqs
264 IO::AIO::nreqs 267 IO::AIO::nreqs
265 IO::AIO::nready 268 IO::AIO::nready
266 IO::AIO::npending 269 IO::AIO::npending
267 270
367 } else { 370 } else {
368 die "open failed: $!\n"; 371 die "open failed: $!\n";
369 } 372 }
370 }; 373 };
371 374
375In addition to all the common open modes/flags (C<O_RDONLY>, C<O_WRONLY>,
376C<O_RDWR>, C<O_CREAT>, C<O_TRUNC>, C<O_EXCL> and C<O_APPEND>), the
377following POSIX and non-POSIX constants are available (missing ones on
378your system are, as usual, C<0>):
379
380C<O_ASYNC>, C<O_DIRECT>, C<O_NOATIME>, C<O_CLOEXEC>, C<O_NOCTTY>, C<O_NOFOLLOW>,
381C<O_NONBLOCK>, C<O_EXEC>, C<O_SEARCH>, C<O_DIRECTORY>, C<O_DSYNC>,
382C<O_RSYNC>, C<O_SYNC> and C<O_TTY_INIT>.
383
372 384
373=item aio_close $fh, $callback->($status) 385=item aio_close $fh, $callback->($status)
374 386
375Asynchronously close a file and call the callback with the result 387Asynchronously close a file and call the callback with the result
376code. 388code.
426 438
427Tries to copy C<$length> bytes from C<$in_fh> to C<$out_fh>. It starts 439Tries to copy C<$length> bytes from C<$in_fh> to C<$out_fh>. It starts
428reading at byte offset C<$in_offset>, and starts writing at the current 440reading at byte offset C<$in_offset>, and starts writing at the current
429file offset of C<$out_fh>. Because of that, it is not safe to issue more 441file offset of C<$out_fh>. Because of that, it is not safe to issue more
430than one C<aio_sendfile> per C<$out_fh>, as they will interfere with each 442than one C<aio_sendfile> per C<$out_fh>, as they will interfere with each
431other. 443other. The same C<$in_fh> works fine though, as this function does not
444move or use the file offset of C<$in_fh>.
432 445
433Please note that C<aio_sendfile> can read more bytes from C<$in_fh> than 446Please note that C<aio_sendfile> can read more bytes from C<$in_fh> than
434are written, and there is no way to find out how many bytes have been read 447are written, and there is no way to find out how many more bytes have been
435from C<aio_sendfile> alone, as C<aio_sendfile> only provides the number of 448read from C<aio_sendfile> alone, as C<aio_sendfile> only provides the
436bytes written to C<$out_fh>. Only if the result value equals C<$length> 449number of bytes written to C<$out_fh>. Only if the result value equals
437one can assume that C<$length> bytes have been read. 450C<$length> one can assume that C<$length> bytes have been read.
438 451
439Unlike with other C<aio_> functions, it makes a lot of sense to use 452Unlike with other C<aio_> functions, it makes a lot of sense to use
440C<aio_sendfile> on non-blocking sockets, as long as one end (typically 453C<aio_sendfile> on non-blocking sockets, as long as one end (typically
441the C<$in_fh>) is a file - the file I/O will then be asynchronous, while 454the C<$in_fh>) is a file - the file I/O will then be asynchronous, while
442the socket I/O will be non-blocking. Note, however, that you can run into 455the socket I/O will be non-blocking. Note, however, that you can run
443a trap where C<aio_sendfile> reads some data with readahead, then fails 456into a trap where C<aio_sendfile> reads some data with readahead, then
444to write all data, and when the socket is ready the next time, the data 457fails to write all data, and when the socket is ready the next time, the
445in the cache is already lost, forcing C<aio_sendfile> to again hit the 458data in the cache is already lost, forcing C<aio_sendfile> to again hit
446disk. Explicit C<aio_read> + C<aio_write> let's you control resource usage 459the disk. Explicit C<aio_read> + C<aio_write> let's you better control
447much better. 460resource usage.
448 461
449This call tries to make use of a native C<sendfile> syscall to provide 462This call tries to make use of a native C<sendfile>-like syscall to
450zero-copy operation. For this to work, C<$out_fh> should refer to a 463provide zero-copy operation. For this to work, C<$out_fh> should refer to
451socket, and C<$in_fh> should refer to an mmap'able file. 464a socket, and C<$in_fh> should refer to an mmap'able file.
452 465
453If a native sendfile cannot be found or it fails with C<ENOSYS>, 466If a native sendfile cannot be found or it fails with C<ENOSYS>,
454C<ENOTSUP>, C<EOPNOTSUPP>, C<EAFNOSUPPORT>, C<EPROTOTYPE> or C<ENOTSOCK>, 467C<EINVAL>, C<ENOTSUP>, C<EOPNOTSUPP>, C<EAFNOSUPPORT>, C<EPROTOTYPE> or
455it will be emulated, so you can call C<aio_sendfile> on any type of 468C<ENOTSOCK>, it will be emulated, so you can call C<aio_sendfile> on any
456filehandle regardless of the limitations of the operating system. 469type of filehandle regardless of the limitations of the operating system.
470
471As native sendfile syscalls (as practically any non-POSIX interface hacked
472together in a hurry to improve benchmark numbers) tend to be rather buggy
473on many systems, this implementation tries to work around some known bugs
474in Linux and FreeBSD kernels (probably others, too), but that might fail,
475so you really really should check the return value of C<aio_sendfile> -
476fewre bytes than expected might have been transferred.
457 477
458 478
459=item aio_readahead $fh,$offset,$length, $callback->($retval) 479=item aio_readahead $fh,$offset,$length, $callback->($retval)
460 480
461C<aio_readahead> populates the page cache with data from a file so that 481C<aio_readahead> populates the page cache with data from a file so that
483for an explanation. 503for an explanation.
484 504
485Currently, the stats are always 64-bit-stats, i.e. instead of returning an 505Currently, the stats are always 64-bit-stats, i.e. instead of returning an
486error when stat'ing a large file, the results will be silently truncated 506error when stat'ing a large file, the results will be silently truncated
487unless perl itself is compiled with large file support. 507unless perl itself is compiled with large file support.
508
509To help interpret the mode and dev/rdev stat values, IO::AIO offers the
510following constants and functions (if not implemented, the constants will
511be C<0> and the functions will either C<croak> or fall back on traditional
512behaviour).
513
514C<S_IFMT>, C<S_IFIFO>, C<S_IFCHR>, C<S_IFBLK>, C<S_IFLNK>, C<S_IFREG>,
515C<S_IFDIR>, C<S_IFWHT>, C<S_IFSOCK>, C<IO::AIO::major $dev_t>,
516C<IO::AIO::minor $dev_t>, C<IO::AIO::makedev $major, $minor>.
488 517
489Example: Print the length of F</etc/passwd>: 518Example: Print the length of F</etc/passwd>:
490 519
491 aio_stat "/etc/passwd", sub { 520 aio_stat "/etc/passwd", sub {
492 $_[0] and die "stat failed: $!"; 521 $_[0] and die "stat failed: $!";
594 623
595The only (POSIX-) portable way of calling this function is: 624The only (POSIX-) portable way of calling this function is:
596 625
597 aio_mknod $path, IO::AIO::S_IFIFO | $mode, 0, sub { ... 626 aio_mknod $path, IO::AIO::S_IFIFO | $mode, 0, sub { ...
598 627
628See C<aio_stat> for info about some potentially helpful extra constants
629and functions.
599 630
600=item aio_link $srcpath, $dstpath, $callback->($status) 631=item aio_link $srcpath, $dstpath, $callback->($status)
601 632
602Asynchronously create a new link to the existing object at C<$srcpath> at 633Asynchronously create a new link to the existing object at C<$srcpath> at
603the path C<$dstpath> and call the callback with the result code. 634the path C<$dstpath> and call the callback with the result code.
614Asynchronously read the symlink specified by C<$path> and pass it to 645Asynchronously read the symlink specified by C<$path> and pass it to
615the callback. If an error occurs, nothing or undef gets passed to the 646the callback. If an error occurs, nothing or undef gets passed to the
616callback. 647callback.
617 648
618 649
650=item aio_realpath $path, $callback->($path)
651
652Asynchronously make the path absolute and resolve any symlinks in
653C<$path>. The resulting path only consists of directories (Same as
654L<Cwd::realpath>).
655
656This request can be used to get the absolute path of the current working
657directory by passing it a path of F<.> (a single dot).
658
659
619=item aio_rename $srcpath, $dstpath, $callback->($status) 660=item aio_rename $srcpath, $dstpath, $callback->($status)
620 661
621Asynchronously rename the object at C<$srcpath> to C<$dstpath>, just as 662Asynchronously rename the object at C<$srcpath> to C<$dstpath>, just as
622rename(2) and call the callback with the result code. 663rename(2) and call the callback with the result code.
623 664
656 697
657=over 4 698=over 4
658 699
659=item IO::AIO::READDIR_DENTS 700=item IO::AIO::READDIR_DENTS
660 701
661When this flag is off, then the callback gets an arrayref with of names 702When this flag is off, then the callback gets an arrayref consisting of
662only (as with C<aio_readdir>), otherwise it gets an arrayref with 703names only (as with C<aio_readdir>), otherwise it gets an arrayref with
663C<[$name, $type, $inode]> arrayrefs, each describing a single directory 704C<[$name, $type, $inode]> arrayrefs, each describing a single directory
664entry in more detail. 705entry in more detail.
665 706
666C<$name> is the name of the entry. 707C<$name> is the name of the entry.
667 708
680systems that do not deliver the inode information. 721systems that do not deliver the inode information.
681 722
682=item IO::AIO::READDIR_DIRS_FIRST 723=item IO::AIO::READDIR_DIRS_FIRST
683 724
684When this flag is set, then the names will be returned in an order where 725When this flag is set, then the names will be returned in an order where
685likely directories come first. This is useful when you need to quickly 726likely directories come first, in optimal stat order. This is useful when
686find directories, or you want to find all directories while avoiding to 727you need to quickly find directories, or you want to find all directories
687stat() each entry. 728while avoiding to stat() each entry.
688 729
689If the system returns type information in readdir, then this is used 730If the system returns type information in readdir, then this is used
690to find directories directly. Otherwise, likely directories are files 731to find directories directly. Otherwise, likely directories are names
691beginning with ".", or otherwise files with no dots, of which files with 732beginning with ".", or otherwise names with no dots, of which names with
692short names are tried first. 733short names are tried first.
693 734
694=item IO::AIO::READDIR_STAT_ORDER 735=item IO::AIO::READDIR_STAT_ORDER
695 736
696When this flag is set, then the names will be returned in an order 737When this flag is set, then the names will be returned in an order
841 if ($_[0] && $! == EXDEV) { 882 if ($_[0] && $! == EXDEV) {
842 aioreq_pri $pri; 883 aioreq_pri $pri;
843 add $grp aio_copy $src, $dst, sub { 884 add $grp aio_copy $src, $dst, sub {
844 $grp->result ($_[0]); 885 $grp->result ($_[0]);
845 886
846 if (!$_[0]) { 887 unless ($_[0]) {
847 aioreq_pri $pri; 888 aioreq_pri $pri;
848 add $grp aio_unlink $src; 889 add $grp aio_unlink $src;
849 } 890 }
850 }; 891 };
851 } else { 892 } else {
1394 1435
1395See C<poll_cb> for an example. 1436See C<poll_cb> for an example.
1396 1437
1397=item IO::AIO::poll_cb 1438=item IO::AIO::poll_cb
1398 1439
1399Process some outstanding events on the result pipe. You have to call this 1440Process some outstanding events on the result pipe. You have to call
1400regularly. Returns C<0> if all events could be processed, or C<-1> if it 1441this regularly. Returns C<0> if all events could be processed (or there
1401returned earlier for whatever reason. Returns immediately when no events 1442were no events to process), or C<-1> if it returned earlier for whatever
1402are outstanding. The amount of events processed depends on the settings of 1443reason. Returns immediately when no events are outstanding. The amount of
1403C<IO::AIO::max_poll_req> and C<IO::AIO::max_poll_time>. 1444events processed depends on the settings of C<IO::AIO::max_poll_req> and
1445C<IO::AIO::max_poll_time>.
1404 1446
1405If not all requests were processed for whatever reason, the filehandle 1447If not all requests were processed for whatever reason, the filehandle
1406will still be ready when C<poll_cb> returns, so normally you don't have to 1448will still be ready when C<poll_cb> returns, so normally you don't have to
1407do anything special to have it called later. 1449do anything special to have it called later.
1450
1451Apart from calling C<IO::AIO::poll_cb> when the event filehandle becomes
1452ready, it can be beneficial to call this function from loops which submit
1453a lot of requests, to make sure the results get processed when they become
1454available and not just when the loop is finished and the event loop takes
1455over again. This function returns very fast when there are no outstanding
1456requests.
1408 1457
1409Example: Install an Event watcher that automatically calls 1458Example: Install an Event watcher that automatically calls
1410IO::AIO::poll_cb with high priority (more examples can be found in the 1459IO::AIO::poll_cb with high priority (more examples can be found in the
1411SYNOPSIS section, at the top of this document): 1460SYNOPSIS section, at the top of this document):
1412 1461
1514 1563
1515Under normal circumstances you don't need to call this function. 1564Under normal circumstances you don't need to call this function.
1516 1565
1517=item IO::AIO::max_idle $nthreads 1566=item IO::AIO::max_idle $nthreads
1518 1567
1519Limit the number of threads (default: 4) that are allowed to idle (i.e., 1568Limit the number of threads (default: 4) that are allowed to idle
1520threads that did not get a request to process within 10 seconds). That 1569(i.e., threads that did not get a request to process within the idle
1521means if a thread becomes idle while C<$nthreads> other threads are also 1570timeout (default: 10 seconds). That means if a thread becomes idle while
1522idle, it will free its resources and exit. 1571C<$nthreads> other threads are also idle, it will free its resources and
1572exit.
1523 1573
1524This is useful when you allow a large number of threads (e.g. 100 or 1000) 1574This is useful when you allow a large number of threads (e.g. 100 or 1000)
1525to allow for extremely high load situations, but want to free resources 1575to allow for extremely high load situations, but want to free resources
1526under normal circumstances (1000 threads can easily consume 30MB of RAM). 1576under normal circumstances (1000 threads can easily consume 30MB of RAM).
1527 1577
1528The default is probably ok in most situations, especially if thread 1578The default is probably ok in most situations, especially if thread
1529creation is fast. If thread creation is very slow on your system you might 1579creation is fast. If thread creation is very slow on your system you might
1530want to use larger values. 1580want to use larger values.
1531 1581
1582=item IO::AIO::idle_timeout $seconds
1583
1584Sets the minimum idle timeout (default 10) after which worker threads are
1585allowed to exit. SEe C<IO::AIO::max_idle>.
1586
1532=item IO::AIO::max_outstanding $maxreqs 1587=item IO::AIO::max_outstanding $maxreqs
1588
1589Sets the maximum number of outstanding requests to C<$nreqs>. If
1590you do queue up more than this number of requests, the next call to
1591C<IO::AIO::poll_cb> (and other functions calling C<poll_cb>, such as
1592C<IO::AIO::flush> or C<IO::AIO::poll>) will block until the limit is no
1593longer exceeded.
1594
1595In other words, this setting does not enforce a queue limit, but can be
1596used to make poll functions block if the limit is exceeded.
1533 1597
1534This is a very bad function to use in interactive programs because it 1598This is a very bad function to use in interactive programs because it
1535blocks, and a bad way to reduce concurrency because it is inexact: Better 1599blocks, and a bad way to reduce concurrency because it is inexact: Better
1536use an C<aio_group> together with a feed callback. 1600use an C<aio_group> together with a feed callback.
1537 1601
1538Sets the maximum number of outstanding requests to C<$nreqs>. If you 1602It's main use is in scripts without an event loop - when you want to stat
1539do queue up more than this number of requests, the next call to the 1603a lot of files, you can write somehting like this:
1540C<poll_cb> (and C<poll_some> and other functions calling C<poll_cb>)
1541function will block until the limit is no longer exceeded.
1542 1604
1543The default value is very large, so there is no practical limit on the 1605 IO::AIO::max_outstanding 32;
1544number of outstanding requests.
1545 1606
1546You can still queue as many requests as you want. Therefore, 1607 for my $path (...) {
1547C<max_outstanding> is mainly useful in simple scripts (with low values) or 1608 aio_stat $path , ...;
1548as a stop gap to shield against fatal memory overflow (with large values). 1609 IO::AIO::poll_cb;
1610 }
1611
1612 IO::AIO::flush;
1613
1614The call to C<poll_cb> inside the loop will normally return instantly, but
1615as soon as more thna C<32> reqeusts are in-flight, it will block until
1616some requests have been handled. This keeps the loop from pushing a large
1617number of C<aio_stat> requests onto the queue.
1618
1619The default value for C<max_outstanding> is very large, so there is no
1620practical limit on the number of outstanding requests.
1549 1621
1550=back 1622=back
1551 1623
1552=head3 STATISTICAL INFORMATION 1624=head3 STATISTICAL INFORMATION
1553 1625
1729 Danga::Socket->AddOtherFds (IO::AIO::poll_fileno => 1801 Danga::Socket->AddOtherFds (IO::AIO::poll_fileno =>
1730 \&IO::AIO::poll_cb); 1802 \&IO::AIO::poll_cb);
1731 1803
1732=head2 FORK BEHAVIOUR 1804=head2 FORK BEHAVIOUR
1733 1805
1734This module should do "the right thing" when the process using it forks: 1806Usage of pthreads in a program changes the semantics of fork
1807considerably. Specifically, only async-safe functions can be called after
1808fork. Perl doesn't know about this, so in general, you cannot call fork
1809with defined behaviour in perl. IO::AIO uses pthreads, so this applies,
1810but many other extensions and (for inexplicable reasons) perl itself often
1811is linked against pthreads, so this limitation applies.
1735 1812
1736Before the fork, IO::AIO enters a quiescent state where no requests 1813Some operating systems have extensions that allow safe use of fork, and
1737can be added in other threads and no results will be processed. After 1814this module should do "the right thing" on those, and tries on others. At
1738the fork the parent simply leaves the quiescent state and continues 1815the time of this writing (2011) only GNU/Linux supports these extensions
1739request/result processing, while the child frees the request/result queue 1816to POSIX.
1740(so that the requests started before the fork will only be handled in the
1741parent). Threads will be started on demand until the limit set in the
1742parent process has been reached again.
1743
1744In short: the parent will, after a short pause, continue as if fork had
1745not been called, while the child will act as if IO::AIO has not been used
1746yet.
1747 1817
1748=head2 MEMORY USAGE 1818=head2 MEMORY USAGE
1749 1819
1750Per-request usage: 1820Per-request usage:
1751 1821

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines