… | |
… | |
168 | use common::sense; |
168 | use common::sense; |
169 | |
169 | |
170 | use base 'Exporter'; |
170 | use base 'Exporter'; |
171 | |
171 | |
172 | BEGIN { |
172 | BEGIN { |
173 | our $VERSION = '3.9'; |
173 | our $VERSION = '3.91'; |
174 | |
174 | |
175 | our @AIO_REQ = qw(aio_sendfile aio_read aio_write aio_open aio_close |
175 | our @AIO_REQ = qw(aio_sendfile aio_read aio_write aio_open aio_close |
176 | aio_stat aio_lstat aio_unlink aio_rmdir aio_readdir aio_readdirx |
176 | aio_stat aio_lstat aio_unlink aio_rmdir aio_readdir aio_readdirx |
177 | aio_scandir aio_symlink aio_readlink aio_sync aio_fsync |
177 | aio_scandir aio_symlink aio_readlink aio_sync aio_fsync |
178 | aio_fdatasync aio_sync_file_range aio_pathsync aio_readahead |
178 | aio_fdatasync aio_sync_file_range aio_pathsync aio_readahead |
… | |
… | |
436 | |
436 | |
437 | Tries to copy C<$length> bytes from C<$in_fh> to C<$out_fh>. It starts |
437 | Tries to copy C<$length> bytes from C<$in_fh> to C<$out_fh>. It starts |
438 | reading at byte offset C<$in_offset>, and starts writing at the current |
438 | reading at byte offset C<$in_offset>, and starts writing at the current |
439 | file offset of C<$out_fh>. Because of that, it is not safe to issue more |
439 | file offset of C<$out_fh>. Because of that, it is not safe to issue more |
440 | than one C<aio_sendfile> per C<$out_fh>, as they will interfere with each |
440 | than one C<aio_sendfile> per C<$out_fh>, as they will interfere with each |
441 | other. |
441 | other. The same C<$in_fh> works fine though, as this function does not |
|
|
442 | move or use the file offset of C<$in_fh>. |
442 | |
443 | |
443 | Please note that C<aio_sendfile> can read more bytes from C<$in_fh> than |
444 | Please note that C<aio_sendfile> can read more bytes from C<$in_fh> than |
444 | are written, and there is no way to find out how many bytes have been read |
445 | are written, and there is no way to find out how many more bytes have been |
445 | from C<aio_sendfile> alone, as C<aio_sendfile> only provides the number of |
446 | read from C<aio_sendfile> alone, as C<aio_sendfile> only provides the |
446 | bytes written to C<$out_fh>. Only if the result value equals C<$length> |
447 | number of bytes written to C<$out_fh>. Only if the result value equals |
447 | one can assume that C<$length> bytes have been read. |
448 | C<$length> one can assume that C<$length> bytes have been read. |
448 | |
449 | |
449 | Unlike with other C<aio_> functions, it makes a lot of sense to use |
450 | Unlike with other C<aio_> functions, it makes a lot of sense to use |
450 | C<aio_sendfile> on non-blocking sockets, as long as one end (typically |
451 | C<aio_sendfile> on non-blocking sockets, as long as one end (typically |
451 | the C<$in_fh>) is a file - the file I/O will then be asynchronous, while |
452 | the C<$in_fh>) is a file - the file I/O will then be asynchronous, while |
452 | the socket I/O will be non-blocking. Note, however, that you can run into |
453 | the socket I/O will be non-blocking. Note, however, that you can run |
453 | a trap where C<aio_sendfile> reads some data with readahead, then fails |
454 | into a trap where C<aio_sendfile> reads some data with readahead, then |
454 | to write all data, and when the socket is ready the next time, the data |
455 | fails to write all data, and when the socket is ready the next time, the |
455 | in the cache is already lost, forcing C<aio_sendfile> to again hit the |
456 | data in the cache is already lost, forcing C<aio_sendfile> to again hit |
456 | disk. Explicit C<aio_read> + C<aio_write> let's you control resource usage |
457 | the disk. Explicit C<aio_read> + C<aio_write> let's you better control |
457 | much better. |
458 | resource usage. |
458 | |
459 | |
459 | This call tries to make use of a native C<sendfile> syscall to provide |
460 | This call tries to make use of a native C<sendfile>-like syscall to |
460 | zero-copy operation. For this to work, C<$out_fh> should refer to a |
461 | provide zero-copy operation. For this to work, C<$out_fh> should refer to |
461 | socket, and C<$in_fh> should refer to an mmap'able file. |
462 | a socket, and C<$in_fh> should refer to an mmap'able file. |
462 | |
463 | |
463 | If a native sendfile cannot be found or it fails with C<ENOSYS>, |
464 | If a native sendfile cannot be found or it fails with C<ENOSYS>, |
464 | C<ENOTSUP>, C<EOPNOTSUPP>, C<EAFNOSUPPORT>, C<EPROTOTYPE> or C<ENOTSOCK>, |
465 | C<EINVAL>, C<ENOTSUP>, C<EOPNOTSUPP>, C<EAFNOSUPPORT>, C<EPROTOTYPE> or |
465 | it will be emulated, so you can call C<aio_sendfile> on any type of |
466 | C<ENOTSOCK>, it will be emulated, so you can call C<aio_sendfile> on any |
466 | filehandle regardless of the limitations of the operating system. |
467 | type of filehandle regardless of the limitations of the operating system. |
|
|
468 | |
|
|
469 | As native sendfile syscalls (as practically any non-POSIX interface hacked |
|
|
470 | together in a hurry to improve benchmark numbers) tend to be rather buggy |
|
|
471 | on many systems, this implementation tries to work around some known bugs |
|
|
472 | in Linux and FreeBSD kernels (probably others, too), but that might fail, |
|
|
473 | so you really really should check the return value of C<aio_sendfile> - |
|
|
474 | fewre bytes than expected might have been transferred. |
467 | |
475 | |
468 | |
476 | |
469 | =item aio_readahead $fh,$offset,$length, $callback->($retval) |
477 | =item aio_readahead $fh,$offset,$length, $callback->($retval) |
470 | |
478 | |
471 | C<aio_readahead> populates the page cache with data from a file so that |
479 | C<aio_readahead> populates the page cache with data from a file so that |
… | |
… | |
862 | if ($_[0] && $! == EXDEV) { |
870 | if ($_[0] && $! == EXDEV) { |
863 | aioreq_pri $pri; |
871 | aioreq_pri $pri; |
864 | add $grp aio_copy $src, $dst, sub { |
872 | add $grp aio_copy $src, $dst, sub { |
865 | $grp->result ($_[0]); |
873 | $grp->result ($_[0]); |
866 | |
874 | |
867 | if (!$_[0]) { |
875 | unless ($_[0]) { |
868 | aioreq_pri $pri; |
876 | aioreq_pri $pri; |
869 | add $grp aio_unlink $src; |
877 | add $grp aio_unlink $src; |
870 | } |
878 | } |
871 | }; |
879 | }; |
872 | } else { |
880 | } else { |
… | |
… | |
1781 | Danga::Socket->AddOtherFds (IO::AIO::poll_fileno => |
1789 | Danga::Socket->AddOtherFds (IO::AIO::poll_fileno => |
1782 | \&IO::AIO::poll_cb); |
1790 | \&IO::AIO::poll_cb); |
1783 | |
1791 | |
1784 | =head2 FORK BEHAVIOUR |
1792 | =head2 FORK BEHAVIOUR |
1785 | |
1793 | |
1786 | This module should do "the right thing" when the process using it forks: |
1794 | Usage of pthreads in a program changes the semantics of fork |
|
|
1795 | considerably. Specifically, only async-safe functions can be called after |
|
|
1796 | fork. Perl doesn't know about this, so in general, you cannot call fork |
|
|
1797 | with defined behaviour in perl. IO::AIO uses pthreads, so this applies, |
|
|
1798 | but many other extensions and (for inexplicable reasons) perl itself often |
|
|
1799 | is linked against pthreads, so this limitation applies. |
1787 | |
1800 | |
1788 | Before the fork, IO::AIO enters a quiescent state where no requests |
1801 | Some operating systems have extensions that allow safe use of fork, and |
1789 | can be added in other threads and no results will be processed. After |
1802 | this module should do "the right thing" on those, and tries on others. At |
1790 | the fork the parent simply leaves the quiescent state and continues |
1803 | the time of this writing (2011) only GNU/Linux supports these extensions |
1791 | request/result processing, while the child frees the request/result queue |
1804 | to POSIX. |
1792 | (so that the requests started before the fork will only be handled in the |
|
|
1793 | parent). Threads will be started on demand until the limit set in the |
|
|
1794 | parent process has been reached again. |
|
|
1795 | |
|
|
1796 | In short: the parent will, after a short pause, continue as if fork had |
|
|
1797 | not been called, while the child will act as if IO::AIO has not been used |
|
|
1798 | yet. |
|
|
1799 | |
1805 | |
1800 | =head2 MEMORY USAGE |
1806 | =head2 MEMORY USAGE |
1801 | |
1807 | |
1802 | Per-request usage: |
1808 | Per-request usage: |
1803 | |
1809 | |