ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/IO-AIO/AIO.pm
(Generate patch)

Comparing IO-AIO/AIO.pm (file contents):
Revision 1.184 by root, Mon Nov 1 22:03:43 2010 UTC vs.
Revision 1.199 by root, Wed Jun 29 12:46:36 2011 UTC

168use common::sense; 168use common::sense;
169 169
170use base 'Exporter'; 170use base 'Exporter';
171 171
172BEGIN { 172BEGIN {
173 our $VERSION = '3.7'; 173 our $VERSION = '3.92';
174 174
175 our @AIO_REQ = qw(aio_sendfile aio_read aio_write aio_open aio_close 175 our @AIO_REQ = qw(aio_sendfile aio_read aio_write aio_open aio_close
176 aio_stat aio_lstat aio_unlink aio_rmdir aio_readdir aio_readdirx 176 aio_stat aio_lstat aio_unlink aio_rmdir aio_readdir aio_readdirx
177 aio_scandir aio_symlink aio_readlink aio_sync aio_fsync 177 aio_scandir aio_symlink aio_readlink aio_sync aio_fsync
178 aio_fdatasync aio_sync_file_range aio_pathsync aio_readahead 178 aio_fdatasync aio_sync_file_range aio_pathsync aio_readahead
182 aio_msync aio_mtouch aio_mlock aio_mlockall 182 aio_msync aio_mtouch aio_mlock aio_mlockall
183 aio_statvfs); 183 aio_statvfs);
184 184
185 our @EXPORT = (@AIO_REQ, qw(aioreq_pri aioreq_nice)); 185 our @EXPORT = (@AIO_REQ, qw(aioreq_pri aioreq_nice));
186 our @EXPORT_OK = qw(poll_fileno poll_cb poll_wait flush 186 our @EXPORT_OK = qw(poll_fileno poll_cb poll_wait flush
187 min_parallel max_parallel max_idle 187 min_parallel max_parallel max_idle idle_timeout
188 nreqs nready npending nthreads 188 nreqs nready npending nthreads
189 max_poll_time max_poll_reqs 189 max_poll_time max_poll_reqs
190 sendfile fadvise madvise 190 sendfile fadvise madvise
191 mmap munmap munlock munlockall); 191 mmap munmap munlock munlockall);
192 192
258 IO::AIO::max_poll_reqs $nreqs 258 IO::AIO::max_poll_reqs $nreqs
259 IO::AIO::max_poll_time $seconds 259 IO::AIO::max_poll_time $seconds
260 IO::AIO::min_parallel $nthreads 260 IO::AIO::min_parallel $nthreads
261 IO::AIO::max_parallel $nthreads 261 IO::AIO::max_parallel $nthreads
262 IO::AIO::max_idle $nthreads 262 IO::AIO::max_idle $nthreads
263 IO::AIO::idle_timeout $seconds
263 IO::AIO::max_outstanding $maxreqs 264 IO::AIO::max_outstanding $maxreqs
264 IO::AIO::nreqs 265 IO::AIO::nreqs
265 IO::AIO::nready 266 IO::AIO::nready
266 IO::AIO::npending 267 IO::AIO::npending
267 268
367 } else { 368 } else {
368 die "open failed: $!\n"; 369 die "open failed: $!\n";
369 } 370 }
370 }; 371 };
371 372
373In addition to all the common open modes/flags (C<O_RDONLY>, C<O_WRONLY>,
374C<O_RDWR>, C<O_CREAT>, C<O_TRUNC>, C<O_EXCL> and C<O_APPEND>), the
375following POSIX and non-POSIX constants are available (missing ones on
376your system are, as usual, C<0>):
377
378C<O_ASYNC>, C<O_DIRECT>, C<O_NOATIME>, C<O_CLOEXEC>, C<O_NOCTTY>, C<O_NOFOLLOW>,
379C<O_NONBLOCK>, C<O_EXEC>, C<O_SEARCH>, C<O_DIRECTORY>, C<O_DSYNC>,
380C<O_RSYNC>, C<O_SYNC> and C<O_TTY_INIT>.
381
372 382
373=item aio_close $fh, $callback->($status) 383=item aio_close $fh, $callback->($status)
374 384
375Asynchronously close a file and call the callback with the result 385Asynchronously close a file and call the callback with the result
376code. 386code.
426 436
427Tries to copy C<$length> bytes from C<$in_fh> to C<$out_fh>. It starts 437Tries to copy C<$length> bytes from C<$in_fh> to C<$out_fh>. It starts
428reading at byte offset C<$in_offset>, and starts writing at the current 438reading at byte offset C<$in_offset>, and starts writing at the current
429file offset of C<$out_fh>. Because of that, it is not safe to issue more 439file offset of C<$out_fh>. Because of that, it is not safe to issue more
430than one C<aio_sendfile> per C<$out_fh>, as they will interfere with each 440than one C<aio_sendfile> per C<$out_fh>, as they will interfere with each
431other. 441other. The same C<$in_fh> works fine though, as this function does not
442move or use the file offset of C<$in_fh>.
432 443
444Please note that C<aio_sendfile> can read more bytes from C<$in_fh> than
445are written, and there is no way to find out how many more bytes have been
446read from C<aio_sendfile> alone, as C<aio_sendfile> only provides the
447number of bytes written to C<$out_fh>. Only if the result value equals
448C<$length> one can assume that C<$length> bytes have been read.
449
450Unlike with other C<aio_> functions, it makes a lot of sense to use
451C<aio_sendfile> on non-blocking sockets, as long as one end (typically
452the C<$in_fh>) is a file - the file I/O will then be asynchronous, while
453the socket I/O will be non-blocking. Note, however, that you can run
454into a trap where C<aio_sendfile> reads some data with readahead, then
455fails to write all data, and when the socket is ready the next time, the
456data in the cache is already lost, forcing C<aio_sendfile> to again hit
457the disk. Explicit C<aio_read> + C<aio_write> let's you better control
458resource usage.
459
433This call tries to make use of a native C<sendfile> syscall to provide 460This call tries to make use of a native C<sendfile>-like syscall to
434zero-copy operation. For this to work, C<$out_fh> should refer to a 461provide zero-copy operation. For this to work, C<$out_fh> should refer to
435socket, and C<$in_fh> should refer to an mmap'able file. 462a socket, and C<$in_fh> should refer to an mmap'able file.
436 463
437If a native sendfile cannot be found or it fails with C<ENOSYS>, 464If a native sendfile cannot be found or it fails with C<ENOSYS>,
438C<ENOTSUP>, C<EOPNOTSUPP>, C<EAFNOSUPPORT>, C<EPROTOTYPE> or C<ENOTSOCK>, 465C<EINVAL>, C<ENOTSUP>, C<EOPNOTSUPP>, C<EAFNOSUPPORT>, C<EPROTOTYPE> or
439it will be emulated, so you can call C<aio_sendfile> on any type of 466C<ENOTSOCK>, it will be emulated, so you can call C<aio_sendfile> on any
440filehandle regardless of the limitations of the operating system. 467type of filehandle regardless of the limitations of the operating system.
441 468
442Please note, however, that C<aio_sendfile> can read more bytes from 469As native sendfile syscalls (as practically any non-POSIX interface hacked
443C<$in_fh> than are written, and there is no way to find out how many 470together in a hurry to improve benchmark numbers) tend to be rather buggy
444bytes have been read from C<aio_sendfile> alone, as C<aio_sendfile> only 471on many systems, this implementation tries to work around some known bugs
445provides the number of bytes written to C<$out_fh>. Only if the result 472in Linux and FreeBSD kernels (probably others, too), but that might fail,
446value equals C<$length> one can assume that C<$length> bytes have been 473so you really really should check the return value of C<aio_sendfile> -
447read. 474fewre bytes than expected might have been transferred.
448 475
449 476
450=item aio_readahead $fh,$offset,$length, $callback->($retval) 477=item aio_readahead $fh,$offset,$length, $callback->($retval)
451 478
452C<aio_readahead> populates the page cache with data from a file so that 479C<aio_readahead> populates the page cache with data from a file so that
474for an explanation. 501for an explanation.
475 502
476Currently, the stats are always 64-bit-stats, i.e. instead of returning an 503Currently, the stats are always 64-bit-stats, i.e. instead of returning an
477error when stat'ing a large file, the results will be silently truncated 504error when stat'ing a large file, the results will be silently truncated
478unless perl itself is compiled with large file support. 505unless perl itself is compiled with large file support.
506
507To help interpret the mode and dev/rdev stat values, IO::AIO offers the
508following constants and functions (if not implemented, the constants will
509be C<0> and the functions will either C<croak> or fall back on traditional
510behaviour).
511
512C<S_IFMT>, C<S_IFIFO>, C<S_IFCHR>, C<S_IFBLK>, C<S_IFLNK>, C<S_IFREG>,
513C<S_IFDIR>, C<S_IFWHT>, C<S_IFSOCK>, C<IO::AIO::major $dev_t>,
514C<IO::AIO::minor $dev_t>, C<IO::AIO::makedev $major, $minor>.
479 515
480Example: Print the length of F</etc/passwd>: 516Example: Print the length of F</etc/passwd>:
481 517
482 aio_stat "/etc/passwd", sub { 518 aio_stat "/etc/passwd", sub {
483 $_[0] and die "stat failed: $!"; 519 $_[0] and die "stat failed: $!";
585 621
586The only (POSIX-) portable way of calling this function is: 622The only (POSIX-) portable way of calling this function is:
587 623
588 aio_mknod $path, IO::AIO::S_IFIFO | $mode, 0, sub { ... 624 aio_mknod $path, IO::AIO::S_IFIFO | $mode, 0, sub { ...
589 625
626See C<aio_stat> for info about some potentially helpful extra constants
627and functions.
590 628
591=item aio_link $srcpath, $dstpath, $callback->($status) 629=item aio_link $srcpath, $dstpath, $callback->($status)
592 630
593Asynchronously create a new link to the existing object at C<$srcpath> at 631Asynchronously create a new link to the existing object at C<$srcpath> at
594the path C<$dstpath> and call the callback with the result code. 632the path C<$dstpath> and call the callback with the result code.
647 685
648=over 4 686=over 4
649 687
650=item IO::AIO::READDIR_DENTS 688=item IO::AIO::READDIR_DENTS
651 689
652When this flag is off, then the callback gets an arrayref with of names 690When this flag is off, then the callback gets an arrayref consisting of
653only (as with C<aio_readdir>), otherwise it gets an arrayref with 691names only (as with C<aio_readdir>), otherwise it gets an arrayref with
654C<[$name, $type, $inode]> arrayrefs, each describing a single directory 692C<[$name, $type, $inode]> arrayrefs, each describing a single directory
655entry in more detail. 693entry in more detail.
656 694
657C<$name> is the name of the entry. 695C<$name> is the name of the entry.
658 696
671systems that do not deliver the inode information. 709systems that do not deliver the inode information.
672 710
673=item IO::AIO::READDIR_DIRS_FIRST 711=item IO::AIO::READDIR_DIRS_FIRST
674 712
675When this flag is set, then the names will be returned in an order where 713When this flag is set, then the names will be returned in an order where
676likely directories come first. This is useful when you need to quickly 714likely directories come first, in optimal stat order. This is useful when
677find directories, or you want to find all directories while avoiding to 715you need to quickly find directories, or you want to find all directories
678stat() each entry. 716while avoiding to stat() each entry.
679 717
680If the system returns type information in readdir, then this is used 718If the system returns type information in readdir, then this is used
681to find directories directly. Otherwise, likely directories are files 719to find directories directly. Otherwise, likely directories are names
682beginning with ".", or otherwise files with no dots, of which files with 720beginning with ".", or otherwise names with no dots, of which names with
683short names are tried first. 721short names are tried first.
684 722
685=item IO::AIO::READDIR_STAT_ORDER 723=item IO::AIO::READDIR_STAT_ORDER
686 724
687When this flag is set, then the names will be returned in an order 725When this flag is set, then the names will be returned in an order
832 if ($_[0] && $! == EXDEV) { 870 if ($_[0] && $! == EXDEV) {
833 aioreq_pri $pri; 871 aioreq_pri $pri;
834 add $grp aio_copy $src, $dst, sub { 872 add $grp aio_copy $src, $dst, sub {
835 $grp->result ($_[0]); 873 $grp->result ($_[0]);
836 874
837 if (!$_[0]) { 875 unless ($_[0]) {
838 aioreq_pri $pri; 876 aioreq_pri $pri;
839 add $grp aio_unlink $src; 877 add $grp aio_unlink $src;
840 } 878 }
841 }; 879 };
842 } else { 880 } else {
1385 1423
1386See C<poll_cb> for an example. 1424See C<poll_cb> for an example.
1387 1425
1388=item IO::AIO::poll_cb 1426=item IO::AIO::poll_cb
1389 1427
1390Process some outstanding events on the result pipe. You have to call this 1428Process some outstanding events on the result pipe. You have to call
1391regularly. Returns C<0> if all events could be processed, or C<-1> if it 1429this regularly. Returns C<0> if all events could be processed (or there
1392returned earlier for whatever reason. Returns immediately when no events 1430were no events to process), or C<-1> if it returned earlier for whatever
1393are outstanding. The amount of events processed depends on the settings of 1431reason. Returns immediately when no events are outstanding. The amount of
1394C<IO::AIO::max_poll_req> and C<IO::AIO::max_poll_time>. 1432events processed depends on the settings of C<IO::AIO::max_poll_req> and
1433C<IO::AIO::max_poll_time>.
1395 1434
1396If not all requests were processed for whatever reason, the filehandle 1435If not all requests were processed for whatever reason, the filehandle
1397will still be ready when C<poll_cb> returns, so normally you don't have to 1436will still be ready when C<poll_cb> returns, so normally you don't have to
1398do anything special to have it called later. 1437do anything special to have it called later.
1438
1439Apart from calling C<IO::AIO::poll_cb> when the event filehandle becomes
1440ready, it can be beneficial to call this function from loops which submit
1441a lot of requests, to make sure the results get processed when they become
1442available and not just when the loop is finished and the event loop takes
1443over again. This function returns very fast when there are no outstanding
1444requests.
1399 1445
1400Example: Install an Event watcher that automatically calls 1446Example: Install an Event watcher that automatically calls
1401IO::AIO::poll_cb with high priority (more examples can be found in the 1447IO::AIO::poll_cb with high priority (more examples can be found in the
1402SYNOPSIS section, at the top of this document): 1448SYNOPSIS section, at the top of this document):
1403 1449
1505 1551
1506Under normal circumstances you don't need to call this function. 1552Under normal circumstances you don't need to call this function.
1507 1553
1508=item IO::AIO::max_idle $nthreads 1554=item IO::AIO::max_idle $nthreads
1509 1555
1510Limit the number of threads (default: 4) that are allowed to idle (i.e., 1556Limit the number of threads (default: 4) that are allowed to idle
1511threads that did not get a request to process within 10 seconds). That 1557(i.e., threads that did not get a request to process within the idle
1512means if a thread becomes idle while C<$nthreads> other threads are also 1558timeout (default: 10 seconds). That means if a thread becomes idle while
1513idle, it will free its resources and exit. 1559C<$nthreads> other threads are also idle, it will free its resources and
1560exit.
1514 1561
1515This is useful when you allow a large number of threads (e.g. 100 or 1000) 1562This is useful when you allow a large number of threads (e.g. 100 or 1000)
1516to allow for extremely high load situations, but want to free resources 1563to allow for extremely high load situations, but want to free resources
1517under normal circumstances (1000 threads can easily consume 30MB of RAM). 1564under normal circumstances (1000 threads can easily consume 30MB of RAM).
1518 1565
1519The default is probably ok in most situations, especially if thread 1566The default is probably ok in most situations, especially if thread
1520creation is fast. If thread creation is very slow on your system you might 1567creation is fast. If thread creation is very slow on your system you might
1521want to use larger values. 1568want to use larger values.
1522 1569
1570=item IO::AIO::idle_timeout $seconds
1571
1572Sets the minimum idle timeout (default 10) after which worker threads are
1573allowed to exit. SEe C<IO::AIO::max_idle>.
1574
1523=item IO::AIO::max_outstanding $maxreqs 1575=item IO::AIO::max_outstanding $maxreqs
1576
1577Sets the maximum number of outstanding requests to C<$nreqs>. If
1578you do queue up more than this number of requests, the next call to
1579C<IO::AIO::poll_cb> (and other functions calling C<poll_cb>, such as
1580C<IO::AIO::flush> or C<IO::AIO::poll>) will block until the limit is no
1581longer exceeded.
1582
1583In other words, this setting does not enforce a queue limit, but can be
1584used to make poll functions block if the limit is exceeded.
1524 1585
1525This is a very bad function to use in interactive programs because it 1586This is a very bad function to use in interactive programs because it
1526blocks, and a bad way to reduce concurrency because it is inexact: Better 1587blocks, and a bad way to reduce concurrency because it is inexact: Better
1527use an C<aio_group> together with a feed callback. 1588use an C<aio_group> together with a feed callback.
1528 1589
1529Sets the maximum number of outstanding requests to C<$nreqs>. If you 1590It's main use is in scripts without an event loop - when you want to stat
1530do queue up more than this number of requests, the next call to the 1591a lot of files, you can write somehting like this:
1531C<poll_cb> (and C<poll_some> and other functions calling C<poll_cb>)
1532function will block until the limit is no longer exceeded.
1533 1592
1534The default value is very large, so there is no practical limit on the 1593 IO::AIO::max_outstanding 32;
1535number of outstanding requests.
1536 1594
1537You can still queue as many requests as you want. Therefore, 1595 for my $path (...) {
1538C<max_outstanding> is mainly useful in simple scripts (with low values) or 1596 aio_stat $path , ...;
1539as a stop gap to shield against fatal memory overflow (with large values). 1597 IO::AIO::poll_cb;
1598 }
1599
1600 IO::AIO::flush;
1601
1602The call to C<poll_cb> inside the loop will normally return instantly, but
1603as soon as more thna C<32> reqeusts are in-flight, it will block until
1604some requests have been handled. This keeps the loop from pushing a large
1605number of C<aio_stat> requests onto the queue.
1606
1607The default value for C<max_outstanding> is very large, so there is no
1608practical limit on the number of outstanding requests.
1540 1609
1541=back 1610=back
1542 1611
1543=head3 STATISTICAL INFORMATION 1612=head3 STATISTICAL INFORMATION
1544 1613
1720 Danga::Socket->AddOtherFds (IO::AIO::poll_fileno => 1789 Danga::Socket->AddOtherFds (IO::AIO::poll_fileno =>
1721 \&IO::AIO::poll_cb); 1790 \&IO::AIO::poll_cb);
1722 1791
1723=head2 FORK BEHAVIOUR 1792=head2 FORK BEHAVIOUR
1724 1793
1725This module should do "the right thing" when the process using it forks: 1794Usage of pthreads in a program changes the semantics of fork
1795considerably. Specifically, only async-safe functions can be called after
1796fork. Perl doesn't know about this, so in general, you cannot call fork
1797with defined behaviour in perl. IO::AIO uses pthreads, so this applies,
1798but many other extensions and (for inexplicable reasons) perl itself often
1799is linked against pthreads, so this limitation applies.
1726 1800
1727Before the fork, IO::AIO enters a quiescent state where no requests 1801Some operating systems have extensions that allow safe use of fork, and
1728can be added in other threads and no results will be processed. After 1802this module should do "the right thing" on those, and tries on others. At
1729the fork the parent simply leaves the quiescent state and continues 1803the time of this writing (2011) only GNU/Linux supports these extensions
1730request/result processing, while the child frees the request/result queue 1804to POSIX.
1731(so that the requests started before the fork will only be handled in the
1732parent). Threads will be started on demand until the limit set in the
1733parent process has been reached again.
1734
1735In short: the parent will, after a short pause, continue as if fork had
1736not been called, while the child will act as if IO::AIO has not been used
1737yet.
1738 1805
1739=head2 MEMORY USAGE 1806=head2 MEMORY USAGE
1740 1807
1741Per-request usage: 1808Per-request usage:
1742 1809

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines