ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/IO-AIO/AIO.pm
(Generate patch)

Comparing IO-AIO/AIO.pm (file contents):
Revision 1.58 by root, Sun Oct 22 10:33:19 2006 UTC vs.
Revision 1.67 by root, Tue Oct 24 02:25:16 2006 UTC

51 51
52This module implements asynchronous I/O using whatever means your 52This module implements asynchronous I/O using whatever means your
53operating system supports. 53operating system supports.
54 54
55Currently, a number of threads are started that execute your read/writes 55Currently, a number of threads are started that execute your read/writes
56and signal their completion. You don't need thread support in your libc or 56and signal their completion. You don't need thread support in perl, and
57perl, and the threads created by this module will not be visible to the 57the threads created by this module will not be visible to perl. In the
58pthreads library. In the future, this module might make use of the native 58future, this module might make use of the native aio functions available
59aio functions available on many operating systems. However, they are often 59on many operating systems. However, they are often not well-supported
60not well-supported (Linux doesn't allow them on normal files currently, 60(Linux doesn't allow them on normal files currently, for example),
61for example), and they would only support aio_read and aio_write, so the 61and they would only support aio_read and aio_write, so the remaining
62remaining functionality would have to be implemented using threads anyway. 62functionality would have to be implemented using threads anyway.
63 63
64Although the module will work with in the presence of other threads, it is 64Although the module will work with in the presence of other threads, it is
65currently not reentrant, so use appropriate locking yourself, always call 65currently not reentrant, so use appropriate locking yourself, always call
66C<poll_cb> from within the same thread, or never call C<poll_cb> (or other 66C<poll_cb> from within the same thread, or never call C<poll_cb> (or other
67C<aio_> functions) recursively. 67C<aio_> functions) recursively.
76use base 'Exporter'; 76use base 'Exporter';
77 77
78BEGIN { 78BEGIN {
79 our $VERSION = '2.0'; 79 our $VERSION = '2.0';
80 80
81 our @EXPORT = qw(aio_sendfile aio_read aio_write aio_open aio_close aio_stat 81 our @AIO_REQ = qw(aio_sendfile aio_read aio_write aio_open aio_close aio_stat
82 aio_lstat aio_unlink aio_rmdir aio_readdir aio_scandir aio_symlink 82 aio_lstat aio_unlink aio_rmdir aio_readdir aio_scandir aio_symlink
83 aio_fsync aio_fdatasync aio_readahead aio_rename aio_link aio_move 83 aio_fsync aio_fdatasync aio_readahead aio_rename aio_link aio_move
84 aio_group); 84 aio_group aio_nop);
85 our @EXPORT_OK = qw(poll_fileno poll_cb min_parallel max_parallel max_outstanding nreqs); 85 our @EXPORT = (@AIO_REQ, qw(aioreq_pri));
86 our @EXPORT_OK = qw(poll_fileno poll_cb poll_wait flush
87 min_parallel max_parallel max_outstanding nreqs);
86 88
87 @IO::AIO::GRP::ISA = 'IO::AIO::REQ'; 89 @IO::AIO::GRP::ISA = 'IO::AIO::REQ';
88 90
89 require XSLoader; 91 require XSLoader;
90 XSLoader::load ("IO::AIO", $VERSION); 92 XSLoader::load ("IO::AIO", $VERSION);
352Scans a directory (similar to C<aio_readdir>) but additionally tries to 354Scans a directory (similar to C<aio_readdir>) but additionally tries to
353separate the entries of directory C<$path> into two sets of names, ones 355separate the entries of directory C<$path> into two sets of names, ones
354you can recurse into (directories or links to them), and ones you cannot 356you can recurse into (directories or links to them), and ones you cannot
355recurse into (everything else). 357recurse into (everything else).
356 358
357C<aio_scandir> is a composite request that consists of many sub 359C<aio_scandir> is a composite request that creates of many sub requests_
358requests. C<$maxreq> specifies the maximum number of outstanding aio 360C<$maxreq> specifies the maximum number of outstanding aio requests that
359requests that this function generates. If it is C<< <= 0 >>, then a 361this function generates. If it is C<< <= 0 >>, then a suitable default
360suitable default will be chosen (currently 8). 362will be chosen (currently 6).
361 363
362On error, the callback is called without arguments, otherwise it receives 364On error, the callback is called without arguments, otherwise it receives
363two array-refs with path-relative entry names. 365two array-refs with path-relative entry names.
364 366
365Example: 367Example:
404sub aio_scandir($$$) { 406sub aio_scandir($$$) {
405 my ($path, $maxreq, $cb) = @_; 407 my ($path, $maxreq, $cb) = @_;
406 408
407 my $grp = aio_group $cb; 409 my $grp = aio_group $cb;
408 410
409 $maxreq = 8 if $maxreq <= 0; 411 $maxreq = 6 if $maxreq <= 0;
410 412
411 # stat once 413 # stat once
412 add $grp aio_stat $path, sub { 414 add $grp aio_stat $path, sub {
413 return $grp->result () if $_[0]; 415 return $grp->result () if $_[0];
414 my $now = time; 416 my $now = time;
445 my (@dirs, @nondirs); 447 my (@dirs, @nondirs);
446 448
447 my ($statcb, $schedcb); 449 my ($statcb, $schedcb);
448 my $nreq = 0; 450 my $nreq = 0;
449 451
452 my $statgrp = add $grp aio_group;
453
450 $schedcb = sub { 454 $schedcb = sub {
451 if (@$entries) { 455 if (@$entries) {
452 if ($nreq < $maxreq) { 456 if ($nreq < $maxreq) {
453 my $ent = pop @$entries; 457 my $ent = pop @$entries;
454 $nreq++; 458 $nreq++;
455 add $grp aio_stat "$path/$ent/.", sub { $statcb->($_[0], $ent) }; 459 add $statgrp aio_stat "$path/$ent/.", sub { $statcb->($_[0], $ent) };
456 } 460 }
457 } elsif (!$nreq) { 461 } elsif (!$nreq) {
458 # finished 462 # finished
463 $statgrp->cancel;
459 undef $statcb; 464 undef $statcb;
460 undef $schedcb; 465 undef $schedcb;
461 $grp->result (\@dirs, \@nondirs) if $cb; 466 $grp->result (\@dirs, \@nondirs);
462 undef $cb;
463 } 467 }
464 }; 468 };
465 $statcb = sub { 469 $statcb = sub {
466 my ($status, $entry) = @_; 470 my ($status, $entry) = @_;
467 471
531 add $grp 535 add $grp
532 (aio_stat ...), 536 (aio_stat ...),
533 (aio_stat ...), 537 (aio_stat ...),
534 ...; 538 ...;
535 539
540=item aio_nop $callback->()
541
542This is a special request - it does nothing in itself and is only used for
543side effects, such as when you want to add a dummy request to a group so
544that finishing the requests in the group depends on executing the given
545code.
546
547While this request does nothing, it still goes through the execution
548phase and still requires a worker thread. Thus, the callback will not
549be executed immediately but only after other requests in the queue have
550entered their execution phase. This can be used to measure request
551latency.
552
536=item IO::AIO::aio_sleep $fractional_seconds, $callback->() *NOT EXPORTED* 553=item IO::AIO::aio_sleep $fractional_seconds, $callback->() *NOT EXPORTED*
537 554
538Mainly used for debugging and benchmarking, this aio request puts one of 555Mainly used for debugging and benchmarking, this aio request puts one of
539the request workers to sleep for the given time. 556the request workers to sleep for the given time.
540 557
558callback) and B<done> (request has reached the end of its lifetime and 575callback) and B<done> (request has reached the end of its lifetime and
559holds no resources anymore). 576holds no resources anymore).
560 577
561=over 4 578=over 4
562 579
563=item $req->cancel 580=item cancel $req
564 581
565Cancels the request, if possible. Has the effect of skipping execution 582Cancels the request, if possible. Has the effect of skipping execution
566when entering the B<execute> state and skipping calling the callback when 583when entering the B<execute> state and skipping calling the callback when
567entering the the B<result> state, but will leave the request otherwise 584entering the the B<result> state, but will leave the request otherwise
568untouched. That means that requests that currently execute will not be 585untouched. That means that requests that currently execute will not be
569stopped and resources held by the request will not be freed prematurely. 586stopped and resources held by the request will not be freed prematurely.
570 587
588=item cb $req $callback->(...)
589
590Replace (or simply set) the callback registered to the request.
591
571=back 592=back
572 593
573=head2 IO::AIO::GRP CLASS 594=head2 IO::AIO::GRP CLASS
574 595
575This class is a subclass of L<IO::AIO::REQ>, so all its methods apply to 596This class is a subclass of L<IO::AIO::REQ>, so all its methods apply to
601 }; 622 };
602 623
603This makes it very easy to create composite requests (see the source of 624This makes it very easy to create composite requests (see the source of
604C<aio_move> for an application) that work and feel like simple requests. 625C<aio_move> for an application) that work and feel like simple requests.
605 626
627=over 4
628
606The IO::AIO::GRP objects will be cleaned up during calls to 629=item * The IO::AIO::GRP objects will be cleaned up during calls to
607C<IO::AIO::poll_cb>, just like any other request. 630C<IO::AIO::poll_cb>, just like any other request.
608 631
609They can be canceled like any other request. Canceling will cancel not 632=item * They can be canceled like any other request. Canceling will cancel not
610just the request itself, but also all requests it contains. 633only the request itself, but also all requests it contains.
611 634
612They can also can also be added to other IO::AIO::GRP objects. 635=item * They can also can also be added to other IO::AIO::GRP objects.
636
637=item * You must not add requests to a group from within the group callback (or
638any later time).
639
640=item * This does not harmonise well with C<max_outstanding>, so best do
641not combine C<aio_group> with it. Groups and feeders are recommended for
642this kind of concurrency-limiting.
643
644=back
613 645
614Their lifetime, simplified, looks like this: when they are empty, they 646Their lifetime, simplified, looks like this: when they are empty, they
615will finish very quickly. If they contain only requests that are in the 647will finish very quickly. If they contain only requests that are in the
616C<done> state, they will also finish. Otherwise they will continue to 648C<done> state, they will also finish. Otherwise they will continue to
617exist. 649exist.
621group. And only when all those requests have finished will the the group 653group. And only when all those requests have finished will the the group
622itself finish. 654itself finish.
623 655
624=over 4 656=over 4
625 657
658=item add $grp ...
659
626=item $grp->add (...) 660=item $grp->add (...)
627
628=item add $grp ...
629 661
630Add one or more requests to the group. Any type of L<IO::AIO::REQ> can 662Add one or more requests to the group. Any type of L<IO::AIO::REQ> can
631be added, including other groups, as long as you do not create circular 663be added, including other groups, as long as you do not create circular
632dependencies. 664dependencies.
633 665
635 667
636=item $grp->result (...) 668=item $grp->result (...)
637 669
638Set the result value(s) that will be passed to the group callback when all 670Set the result value(s) that will be passed to the group callback when all
639subrequests have finished. By default, no argument will be passed. 671subrequests have finished. By default, no argument will be passed.
672
673=item feed $grp $callback->($grp)
674
675[VERY EXPERIMENTAL]
676
677Sets a feeder/generator on this group: every group can have an attached
678generator that generates requests if idle. The idea behind this is that,
679although you could just queue as many requests as you want in a group,
680this might starve other requests for a potentially long time. For
681example, C<aio_scandir> might generate hundreds of thousands C<aio_stat>
682requests, delaying any later requests for a long time.
683
684To avoid this, and allow incremental generation of requests, you can
685instead a group and set a feeder on it that generates those requests. The
686feed callback will be called whenever there are few enough (see C<feed_limit>,
687below) requests active in the group itself and is expected to queue more
688requests.
689
690The feed can queue as many requests as it likes (i.e. C<add> does not
691impose any limits).
692
693If the feed does not queue more requests when called, it will be
694automatically removed from the group.
695
696If the feed limit is C<0>, it will be set to C<2> automatically.
697
698Example:
699
700 # stat all files in @files, but only ever use four aio requests concurrently:
701
702 my $grp = aio_group sub { print "finished\n" };
703 feed_limit $grp 4;
704 feed $grp sub {
705 my $file = pop @files
706 or return;
707
708 add $grp aio_stat $file, sub { ... };
709 };
710
711=item feed_limit $grp $num
712
713Sets the feeder limit for the group: The feeder will be called whenever
714the group contains less than this many requests.
715
716Setting the limit to C<0> will pause the feeding process.
640 717
641=back 718=back
642 719
643=head2 SUPPORT FUNCTIONS 720=head2 SUPPORT FUNCTIONS
644 721
702 IO::AIO::poll_wait, IO::AIO::poll_cb 779 IO::AIO::poll_wait, IO::AIO::poll_cb
703 if IO::AIO::nreqs; 780 if IO::AIO::nreqs;
704 781
705=item IO::AIO::min_parallel $nthreads 782=item IO::AIO::min_parallel $nthreads
706 783
707Set the minimum number of AIO threads to C<$nthreads>. The current default 784Set the minimum number of AIO threads to C<$nthreads>. The current
708is C<4>, which means four asynchronous operations can be done at one time 785default is C<8>, which means eight asynchronous operations can execute
709(the number of outstanding operations, however, is unlimited). 786concurrently at any one time (the number of outstanding requests,
787however, is unlimited).
710 788
711IO::AIO starts threads only on demand, when an AIO request is queued and 789IO::AIO starts threads only on demand, when an AIO request is queued and
712no free thread exists. 790no free thread exists.
713 791
714It is recommended to keep the number of threads low, as some Linux 792It is recommended to keep the number of threads relatively low, as some
715kernel versions will scale negatively with the number of threads (higher 793Linux kernel versions will scale negatively with the number of threads
716parallelity => MUCH higher latency). With current Linux 2.6 versions, 4-32 794(higher parallelity => MUCH higher latency). With current Linux 2.6
717threads should be fine. 795versions, 4-32 threads should be fine.
718 796
719Under most circumstances you don't need to call this function, as the 797Under most circumstances you don't need to call this function, as the
720module selects a default that is suitable for low to moderate load. 798module selects a default that is suitable for low to moderate load.
721 799
722=item IO::AIO::max_parallel $nthreads 800=item IO::AIO::max_parallel $nthreads
733 811
734Under normal circumstances you don't need to call this function. 812Under normal circumstances you don't need to call this function.
735 813
736=item $oldnreqs = IO::AIO::max_outstanding $nreqs 814=item $oldnreqs = IO::AIO::max_outstanding $nreqs
737 815
816[DEPRECATED]
817
738Sets the maximum number of outstanding requests to C<$nreqs>. If you 818Sets the maximum number of outstanding requests to C<$nreqs>. If you
739try to queue up more than this number of requests, the caller will block until 819try to queue up more than this number of requests, the caller will block until
740some requests have been handled. 820some requests have been handled.
741 821
742The default is very large, so normally there is no practical limit. If you 822The default is very large, so normally there is no practical limit. If you
743queue up many requests in a loop it often improves speed if you set 823queue up many requests in a loop it often improves speed if you set
744this to a relatively low number, such as C<100>. 824this to a relatively low number, such as C<100>.
825
826This function does not work well together with C<aio_group>'s, and their
827feeder interface is better suited to limiting concurrency, so do not use
828this function.
745 829
746Under normal circumstances you don't need to call this function. 830Under normal circumstances you don't need to call this function.
747 831
748=back 832=back
749 833
763 or return undef; 847 or return undef;
764 848
765 *$sym 849 *$sym
766} 850}
767 851
768min_parallel 4; 852min_parallel 8;
769 853
770END { 854END {
771 max_parallel 0; 855 max_parallel 0;
772} 856}
773 857
787 871
788In short: the parent will, after a short pause, continue as if fork had 872In short: the parent will, after a short pause, continue as if fork had
789not been called, while the child will act as if IO::AIO has not been used 873not been called, while the child will act as if IO::AIO has not been used
790yet. 874yet.
791 875
876=head2 MEMORY USAGE
877
878Each aio request uses - depending on your architecture - around 128 bytes
879of memory. In addition, stat requests need a stat buffer (possibly a few
880hundred bytes). Perl scalars and other data passed into aio requests will
881also be locked.
882
883This is now awfully much, so queuing lots of requests is not usually a
884problem.
885
886Each thread needs a stack area which is usually around 16k, sometimes much
887larger, depending on the OS.
888
792=head1 SEE ALSO 889=head1 SEE ALSO
793 890
794L<Coro>, L<Linux::AIO> (obsolete). 891L<Coro>, L<Linux::AIO> (obsolete).
795 892
796=head1 AUTHOR 893=head1 AUTHOR

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines