ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/IO-AIO/AIO.pm
(Generate patch)

Comparing IO-AIO/AIO.pm (file contents):
Revision 1.59 by root, Sun Oct 22 10:33:26 2006 UTC vs.
Revision 1.73 by root, Tue Oct 24 16:35:04 2006 UTC

18 }; 18 };
19 19
20 # version 2+ has request and group objects 20 # version 2+ has request and group objects
21 use IO::AIO 2; 21 use IO::AIO 2;
22 22
23 aioreq_pri 4; # give next request a very high priority
23 my $req = aio_unlink "/tmp/file", sub { }; 24 my $req = aio_unlink "/tmp/file", sub { };
24 $req->cancel; # cancel request if still in queue 25 $req->cancel; # cancel request if still in queue
25 26
26 my $grp = aio_group sub { print "all stats done\n" }; 27 my $grp = aio_group sub { print "all stats done\n" };
27 add $grp aio_stat "..." for ...; 28 add $grp aio_stat "..." for ...;
50=head1 DESCRIPTION 51=head1 DESCRIPTION
51 52
52This module implements asynchronous I/O using whatever means your 53This module implements asynchronous I/O using whatever means your
53operating system supports. 54operating system supports.
54 55
55Currently, a number of threads are started that execute your read/writes 56In this version, a number of threads are started that execute your
56and signal their completion. You don't need thread support in your libc or 57requests and signal their completion. You don't need thread support
57perl, and the threads created by this module will not be visible to the 58in perl, and the threads created by this module will not be visible
58pthreads library. In the future, this module might make use of the native 59to perl. In the future, this module might make use of the native aio
59aio functions available on many operating systems. However, they are often 60functions available on many operating systems. However, they are often
60not well-supported (Linux doesn't allow them on normal files currently, 61not well-supported or restricted (Linux doesn't allow them on normal
61for example), and they would only support aio_read and aio_write, so the 62files currently, for example), and they would only support aio_read and
62remaining functionality would have to be implemented using threads anyway. 63aio_write, so the remaining functionality would have to be implemented
64using threads anyway.
63 65
64Although the module will work with in the presence of other threads, it is 66Although the module will work with in the presence of other (Perl-)
65currently not reentrant, so use appropriate locking yourself, always call 67threads, it is currently not reentrant in any way, so use appropriate
66C<poll_cb> from within the same thread, or never call C<poll_cb> (or other 68locking yourself, always call C<poll_cb> from within the same thread, or
67C<aio_> functions) recursively. 69never call C<poll_cb> (or other C<aio_> functions) recursively.
70
71=head1 REQUEST ANATOMY AND LIFETIME
72
73Every C<aio_*> function creates a request. which is a C data structure not
74directly visible to Perl.
75
76If called in non-void context, every request function returns a Perl
77object representing the request. In void context, nothing is returned,
78which saves a bit of memory.
79
80The perl object is a fairly standard ref-to-hash object. The hash contents
81are not used by IO::AIO so you are free to store anything you like in it.
82
83During their existance, aio requests travel through the following states,
84in order:
85
86=over 4
87
88=item ready
89
90Immediately after a request is created it is put into the ready state,
91waiting for a thread to execute it.
92
93=item execute
94
95A thread has accepted the request for processing and is currently
96executing it (e.g. blocking in read).
97
98=item pending
99
100The request has been executed and is waiting for result processing.
101
102While request submission and execution is fully asynchronous, result
103processing is not and relies on the perl interpreter calling C<poll_cb>
104(or another function with the same effect).
105
106=item result
107
108The request results are processed synchronously by C<poll_cb>.
109
110The C<poll_cb> function will process all outstanding aio requests by
111calling their callbacks, freeing memory associated with them and managing
112any groups they are contained in.
113
114=item done
115
116Request has reached the end of its lifetime and holds no resources anymore
117(except possibly for the Perl object, but its connection to the actual
118aio request is severed and calling its methods will either do nothing or
119result in a runtime error).
68 120
69=cut 121=cut
70 122
71package IO::AIO; 123package IO::AIO;
72 124
76use base 'Exporter'; 128use base 'Exporter';
77 129
78BEGIN { 130BEGIN {
79 our $VERSION = '2.0'; 131 our $VERSION = '2.0';
80 132
81 our @EXPORT = qw(aio_sendfile aio_read aio_write aio_open aio_close aio_stat 133 our @AIO_REQ = qw(aio_sendfile aio_read aio_write aio_open aio_close aio_stat
82 aio_lstat aio_unlink aio_rmdir aio_readdir aio_scandir aio_symlink 134 aio_lstat aio_unlink aio_rmdir aio_readdir aio_scandir aio_symlink
83 aio_fsync aio_fdatasync aio_readahead aio_rename aio_link aio_move 135 aio_fsync aio_fdatasync aio_readahead aio_rename aio_link aio_move
84 aio_group); 136 aio_group aio_nop);
85 our @EXPORT_OK = qw(poll_fileno poll_cb min_parallel max_parallel max_outstanding nreqs); 137 our @EXPORT = (@AIO_REQ, qw(aioreq_pri aioreq_nice));
138 our @EXPORT_OK = qw(poll_fileno poll_cb poll_wait flush
139 min_parallel max_parallel max_outstanding nreqs);
86 140
87 @IO::AIO::GRP::ISA = 'IO::AIO::REQ'; 141 @IO::AIO::GRP::ISA = 'IO::AIO::REQ';
88 142
89 require XSLoader; 143 require XSLoader;
90 XSLoader::load ("IO::AIO", $VERSION); 144 XSLoader::load ("IO::AIO", $VERSION);
121environment, d) use Glib::filename_from_unicode on unicode filenames or e) 175environment, d) use Glib::filename_from_unicode on unicode filenames or e)
122use something else. 176use something else.
123 177
124=over 4 178=over 4
125 179
180=item aioreq_pri $pri
181
182Sets the priority for the next aio request. The default priority
183is C<0>, the minimum and maximum priorities are C<-4> and C<4>,
184respectively. Requests with higher priority will be serviced first.
185
186The priority will be reset to C<0> after each call to one of the C<aio_>
187functions.
188
189Example: open a file with low priority, then read something from it with
190higher priority so the read request is serviced before other low priority
191open requests (potentially spamming the cache):
192
193 aioreq_pri -3;
194 aio_open ..., sub {
195 return unless $_[0];
196
197 aioreq_pri -2;
198 aio_read $_[0], ..., sub {
199 ...
200 };
201 };
202
203=item aioreq_nice $pri_adjust
204
205Similar to C<aioreq_pri>, but subtracts the given value from the current
206priority, so effects are cumulative.
207
126=item aio_open $pathname, $flags, $mode, $callback->($fh) 208=item aio_open $pathname, $flags, $mode, $callback->($fh)
127 209
128Asynchronously open or create a file and call the callback with a newly 210Asynchronously open or create a file and call the callback with a newly
129created filehandle for the file. 211created filehandle for the file.
130 212
181 $_[0] > 0 or die "read error: $!"; 263 $_[0] > 0 or die "read error: $!";
182 print "read $_[0] bytes: <$buffer>\n"; 264 print "read $_[0] bytes: <$buffer>\n";
183 }; 265 };
184 266
185=item aio_move $srcpath, $dstpath, $callback->($status) 267=item aio_move $srcpath, $dstpath, $callback->($status)
186
187[EXPERIMENTAL due to internal aio_group use]
188 268
189Try to move the I<file> (directories not supported as either source or 269Try to move the I<file> (directories not supported as either source or
190destination) from C<$srcpath> to C<$dstpath> and call the callback with 270destination) from C<$srcpath> to C<$dstpath> and call the callback with
191the C<0> (error) or C<-1> ok. 271the C<0> (error) or C<-1> ok.
192 272
345The callback a single argument which is either C<undef> or an array-ref 425The callback a single argument which is either C<undef> or an array-ref
346with the filenames. 426with the filenames.
347 427
348=item aio_scandir $path, $maxreq, $callback->($dirs, $nondirs) 428=item aio_scandir $path, $maxreq, $callback->($dirs, $nondirs)
349 429
350[EXPERIMENTAL due to internal aio_group use]
351
352Scans a directory (similar to C<aio_readdir>) but additionally tries to 430Scans a directory (similar to C<aio_readdir>) but additionally tries to
353separate the entries of directory C<$path> into two sets of names, ones 431separate the entries of directory C<$path> into two sets of names, ones
354you can recurse into (directories or links to them), and ones you cannot 432you can recurse into (directories or links to them), and ones you cannot
355recurse into (everything else). 433recurse into (everything else).
356 434
357C<aio_scandir> is a composite request that consists of many sub 435C<aio_scandir> is a composite request that creates of many sub requests_
358requests. C<$maxreq> specifies the maximum number of outstanding aio 436C<$maxreq> specifies the maximum number of outstanding aio requests that
359requests that this function generates. If it is C<< <= 0 >>, then a 437this function generates. If it is C<< <= 0 >>, then a suitable default
360suitable default will be chosen (currently 8). 438will be chosen (currently 6).
361 439
362On error, the callback is called without arguments, otherwise it receives 440On error, the callback is called without arguments, otherwise it receives
363two array-refs with path-relative entry names. 441two array-refs with path-relative entry names.
364 442
365Example: 443Example:
404sub aio_scandir($$$) { 482sub aio_scandir($$$) {
405 my ($path, $maxreq, $cb) = @_; 483 my ($path, $maxreq, $cb) = @_;
406 484
407 my $grp = aio_group $cb; 485 my $grp = aio_group $cb;
408 486
409 $maxreq = 8 if $maxreq <= 0; 487 $maxreq = 6 if $maxreq <= 0;
410 488
411 # stat once 489 # stat once
412 add $grp aio_stat $path, sub { 490 add $grp aio_stat $path, sub {
413 return $grp->result () if $_[0]; 491 return $grp->result () if $_[0];
414 my $now = time; 492 my $now = time;
445 my (@dirs, @nondirs); 523 my (@dirs, @nondirs);
446 524
447 my ($statcb, $schedcb); 525 my ($statcb, $schedcb);
448 my $nreq = 0; 526 my $nreq = 0;
449 527
528 my $statgrp = add $grp aio_group;
529
450 $schedcb = sub { 530 $schedcb = sub {
451 if (@$entries) { 531 if (@$entries) {
452 if ($nreq < $maxreq) { 532 if ($nreq < $maxreq) {
453 my $ent = pop @$entries; 533 my $ent = pop @$entries;
454 $nreq++; 534 $nreq++;
455 add $grp aio_stat "$path/$ent/.", sub { $statcb->($_[0], $ent) }; 535 add $statgrp aio_stat "$path/$ent/.", sub { $statcb->($_[0], $ent) };
456 } 536 }
457 } elsif (!$nreq) { 537 } elsif (!$nreq) {
458 # finished 538 # finished
539 $statgrp->cancel;
459 undef $statcb; 540 undef $statcb;
460 undef $schedcb; 541 undef $schedcb;
461 $grp->result (\@dirs, \@nondirs) if $cb; 542 $grp->result (\@dirs, \@nondirs);
462 undef $cb;
463 } 543 }
464 }; 544 };
465 $statcb = sub { 545 $statcb = sub {
466 my ($status, $entry) = @_; 546 my ($status, $entry) = @_;
467 547
515 595
516[EXPERIMENTAL] 596[EXPERIMENTAL]
517 597
518This is a very special aio request: Instead of doing something, it is a 598This is a very special aio request: Instead of doing something, it is a
519container for other aio requests, which is useful if you want to bundle 599container for other aio requests, which is useful if you want to bundle
520many requests into a single, composite, request. 600many requests into a single, composite, request with a definite callback
601and the ability to cancel the whole request with its subrequests.
521 602
522Returns an object of class L<IO::AIO::GRP>. See its documentation below 603Returns an object of class L<IO::AIO::GRP>. See its documentation below
523for more info. 604for more info.
524 605
525Example: 606Example:
531 add $grp 612 add $grp
532 (aio_stat ...), 613 (aio_stat ...),
533 (aio_stat ...), 614 (aio_stat ...),
534 ...; 615 ...;
535 616
617=item aio_nop $callback->()
618
619This is a special request - it does nothing in itself and is only used for
620side effects, such as when you want to add a dummy request to a group so
621that finishing the requests in the group depends on executing the given
622code.
623
624While this request does nothing, it still goes through the execution
625phase and still requires a worker thread. Thus, the callback will not
626be executed immediately but only after other requests in the queue have
627entered their execution phase. This can be used to measure request
628latency.
629
536=item IO::AIO::aio_sleep $fractional_seconds, $callback->() *NOT EXPORTED* 630=item IO::AIO::aio_busy $fractional_seconds, $callback->() *NOT EXPORTED*
537 631
538Mainly used for debugging and benchmarking, this aio request puts one of 632Mainly used for debugging and benchmarking, this aio request puts one of
539the request workers to sleep for the given time. 633the request workers to sleep for the given time.
540 634
541While it is theoretically handy to have simple I/O scheduling requests 635While it is theoretically handy to have simple I/O scheduling requests
542like sleep and file handle readable/writable, the overhead this creates 636like sleep and file handle readable/writable, the overhead this creates is
543is immense, so do not use this function except to put your application 637immense (it blocks a thread for a long time) so do not use this function
544under artificial I/O pressure. 638except to put your application under artificial I/O pressure.
545 639
546=back 640=back
547 641
548=head2 IO::AIO::REQ CLASS 642=head2 IO::AIO::REQ CLASS
549 643
550All non-aggregate C<aio_*> functions return an object of this class when 644All non-aggregate C<aio_*> functions return an object of this class when
551called in non-void context. 645called in non-void context.
552 646
553A request always moves through the following five states in its lifetime,
554in order: B<ready> (request has been created, but has not been executed
555yet), B<execute> (request is currently being executed), B<pending>
556(request has been executed but callback has not been called yet),
557B<result> (results are being processed synchronously, includes calling the
558callback) and B<done> (request has reached the end of its lifetime and
559holds no resources anymore).
560
561=over 4 647=over 4
562 648
563=item $req->cancel 649=item cancel $req
564 650
565Cancels the request, if possible. Has the effect of skipping execution 651Cancels the request, if possible. Has the effect of skipping execution
566when entering the B<execute> state and skipping calling the callback when 652when entering the B<execute> state and skipping calling the callback when
567entering the the B<result> state, but will leave the request otherwise 653entering the the B<result> state, but will leave the request otherwise
568untouched. That means that requests that currently execute will not be 654untouched. That means that requests that currently execute will not be
569stopped and resources held by the request will not be freed prematurely. 655stopped and resources held by the request will not be freed prematurely.
570 656
657=item cb $req $callback->(...)
658
659Replace (or simply set) the callback registered to the request.
660
571=back 661=back
572 662
573=head2 IO::AIO::GRP CLASS 663=head2 IO::AIO::GRP CLASS
574 664
575This class is a subclass of L<IO::AIO::REQ>, so all its methods apply to 665This class is a subclass of L<IO::AIO::REQ>, so all its methods apply to
601 }; 691 };
602 692
603This makes it very easy to create composite requests (see the source of 693This makes it very easy to create composite requests (see the source of
604C<aio_move> for an application) that work and feel like simple requests. 694C<aio_move> for an application) that work and feel like simple requests.
605 695
696=over 4
697
606The IO::AIO::GRP objects will be cleaned up during calls to 698=item * The IO::AIO::GRP objects will be cleaned up during calls to
607C<IO::AIO::poll_cb>, just like any other request. 699C<IO::AIO::poll_cb>, just like any other request.
608 700
609They can be canceled like any other request. Canceling will cancel not 701=item * They can be canceled like any other request. Canceling will cancel not
610only the request itself, but also all requests it contains. 702only the request itself, but also all requests it contains.
611 703
612They can also can also be added to other IO::AIO::GRP objects. 704=item * They can also can also be added to other IO::AIO::GRP objects.
705
706=item * You must not add requests to a group from within the group callback (or
707any later time).
708
709=item * This does not harmonise well with C<max_outstanding>, so best do
710not combine C<aio_group> with it. Groups and feeders are recommended for
711this kind of concurrency-limiting.
712
713=back
613 714
614Their lifetime, simplified, looks like this: when they are empty, they 715Their lifetime, simplified, looks like this: when they are empty, they
615will finish very quickly. If they contain only requests that are in the 716will finish very quickly. If they contain only requests that are in the
616C<done> state, they will also finish. Otherwise they will continue to 717C<done> state, they will also finish. Otherwise they will continue to
617exist. 718exist.
621group. And only when all those requests have finished will the the group 722group. And only when all those requests have finished will the the group
622itself finish. 723itself finish.
623 724
624=over 4 725=over 4
625 726
727=item add $grp ...
728
626=item $grp->add (...) 729=item $grp->add (...)
627
628=item add $grp ...
629 730
630Add one or more requests to the group. Any type of L<IO::AIO::REQ> can 731Add one or more requests to the group. Any type of L<IO::AIO::REQ> can
631be added, including other groups, as long as you do not create circular 732be added, including other groups, as long as you do not create circular
632dependencies. 733dependencies.
633 734
635 736
636=item $grp->result (...) 737=item $grp->result (...)
637 738
638Set the result value(s) that will be passed to the group callback when all 739Set the result value(s) that will be passed to the group callback when all
639subrequests have finished. By default, no argument will be passed. 740subrequests have finished. By default, no argument will be passed.
741
742=item feed $grp $callback->($grp)
743
744[VERY EXPERIMENTAL]
745
746Sets a feeder/generator on this group: every group can have an attached
747generator that generates requests if idle. The idea behind this is that,
748although you could just queue as many requests as you want in a group,
749this might starve other requests for a potentially long time. For
750example, C<aio_scandir> might generate hundreds of thousands C<aio_stat>
751requests, delaying any later requests for a long time.
752
753To avoid this, and allow incremental generation of requests, you can
754instead a group and set a feeder on it that generates those requests. The
755feed callback will be called whenever there are few enough (see C<limit>,
756below) requests active in the group itself and is expected to queue more
757requests.
758
759The feed callback can queue as many requests as it likes (i.e. C<add> does
760not impose any limits).
761
762If the feed does not queue more requests when called, it will be
763automatically removed from the group.
764
765If the feed limit is C<0>, it will be set to C<2> automatically.
766
767Example:
768
769 # stat all files in @files, but only ever use four aio requests concurrently:
770
771 my $grp = aio_group sub { print "finished\n" };
772 limit $grp 4;
773 feed $grp sub {
774 my $file = pop @files
775 or return;
776
777 add $grp aio_stat $file, sub { ... };
778 };
779
780=item limit $grp $num
781
782Sets the feeder limit for the group: The feeder will be called whenever
783the group contains less than this many requests.
784
785Setting the limit to C<0> will pause the feeding process.
640 786
641=back 787=back
642 788
643=head2 SUPPORT FUNCTIONS 789=head2 SUPPORT FUNCTIONS
644 790
702 IO::AIO::poll_wait, IO::AIO::poll_cb 848 IO::AIO::poll_wait, IO::AIO::poll_cb
703 if IO::AIO::nreqs; 849 if IO::AIO::nreqs;
704 850
705=item IO::AIO::min_parallel $nthreads 851=item IO::AIO::min_parallel $nthreads
706 852
707Set the minimum number of AIO threads to C<$nthreads>. The current default 853Set the minimum number of AIO threads to C<$nthreads>. The current
708is C<4>, which means four asynchronous operations can be done at one time 854default is C<8>, which means eight asynchronous operations can execute
709(the number of outstanding operations, however, is unlimited). 855concurrently at any one time (the number of outstanding requests,
856however, is unlimited).
710 857
711IO::AIO starts threads only on demand, when an AIO request is queued and 858IO::AIO starts threads only on demand, when an AIO request is queued and
712no free thread exists. 859no free thread exists.
713 860
714It is recommended to keep the number of threads low, as some Linux 861It is recommended to keep the number of threads relatively low, as some
715kernel versions will scale negatively with the number of threads (higher 862Linux kernel versions will scale negatively with the number of threads
716parallelity => MUCH higher latency). With current Linux 2.6 versions, 4-32 863(higher parallelity => MUCH higher latency). With current Linux 2.6
717threads should be fine. 864versions, 4-32 threads should be fine.
718 865
719Under most circumstances you don't need to call this function, as the 866Under most circumstances you don't need to call this function, as the
720module selects a default that is suitable for low to moderate load. 867module selects a default that is suitable for low to moderate load.
721 868
722=item IO::AIO::max_parallel $nthreads 869=item IO::AIO::max_parallel $nthreads
733 880
734Under normal circumstances you don't need to call this function. 881Under normal circumstances you don't need to call this function.
735 882
736=item $oldnreqs = IO::AIO::max_outstanding $nreqs 883=item $oldnreqs = IO::AIO::max_outstanding $nreqs
737 884
885[DEPRECATED]
886
738Sets the maximum number of outstanding requests to C<$nreqs>. If you 887Sets the maximum number of outstanding requests to C<$nreqs>. If you
739try to queue up more than this number of requests, the caller will block until 888try to queue up more than this number of requests, the caller will block until
740some requests have been handled. 889some requests have been handled.
741 890
742The default is very large, so normally there is no practical limit. If you 891The default is very large, so normally there is no practical limit. If you
743queue up many requests in a loop it often improves speed if you set 892queue up many requests in a loop it often improves speed if you set
744this to a relatively low number, such as C<100>. 893this to a relatively low number, such as C<100>.
894
895This function does not work well together with C<aio_group>'s, and their
896feeder interface is better suited to limiting concurrency, so do not use
897this function.
745 898
746Under normal circumstances you don't need to call this function. 899Under normal circumstances you don't need to call this function.
747 900
748=back 901=back
749 902
763 or return undef; 916 or return undef;
764 917
765 *$sym 918 *$sym
766} 919}
767 920
768min_parallel 4; 921min_parallel 8;
769 922
770END { 923END {
771 max_parallel 0; 924 max_parallel 0;
772} 925}
773 926
778This module should do "the right thing" when the process using it forks: 931This module should do "the right thing" when the process using it forks:
779 932
780Before the fork, IO::AIO enters a quiescent state where no requests 933Before the fork, IO::AIO enters a quiescent state where no requests
781can be added in other threads and no results will be processed. After 934can be added in other threads and no results will be processed. After
782the fork the parent simply leaves the quiescent state and continues 935the fork the parent simply leaves the quiescent state and continues
783request/result processing, while the child clears the request/result 936request/result processing, while the child frees the request/result queue
784queue (so the requests started before the fork will only be handled in 937(so that the requests started before the fork will only be handled in the
785the parent). Threads will be started on demand until the limit ste in the 938parent). Threads will be started on demand until the limit set in the
786parent process has been reached again. 939parent process has been reached again.
787 940
788In short: the parent will, after a short pause, continue as if fork had 941In short: the parent will, after a short pause, continue as if fork had
789not been called, while the child will act as if IO::AIO has not been used 942not been called, while the child will act as if IO::AIO has not been used
790yet. 943yet.
791 944
945=head2 MEMORY USAGE
946
947Per-request usage:
948
949Each aio request uses - depending on your architecture - around 100-200
950bytes of memory. In addition, stat requests need a stat buffer (possibly
951a few hundred bytes), readdir requires a result buffer and so on. Perl
952scalars and other data passed into aio requests will also be locked and
953will consume memory till the request has entered the done state.
954
955This is now awfully much, so queuing lots of requests is not usually a
956problem.
957
958Per-thread usage:
959
960In the execution phase, some aio requests require more memory for
961temporary buffers, and each thread requires a stack and other data
962structures (usually around 16k-128k, depending on the OS).
963
964=head1 KNOWN BUGS
965
966Known bugs will be fixed in the next release.
967
792=head1 SEE ALSO 968=head1 SEE ALSO
793 969
794L<Coro>, L<Linux::AIO> (obsolete). 970L<Coro::AIO>.
795 971
796=head1 AUTHOR 972=head1 AUTHOR
797 973
798 Marc Lehmann <schmorp@schmorp.de> 974 Marc Lehmann <schmorp@schmorp.de>
799 http://home.schmorp.de/ 975 http://home.schmorp.de/

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines