ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/IO-AIO/AIO.pm
(Generate patch)

Comparing IO-AIO/AIO.pm (file contents):
Revision 1.63 by root, Mon Oct 23 00:34:36 2006 UTC vs.
Revision 1.75 by root, Tue Oct 24 20:10:26 2006 UTC

18 }; 18 };
19 19
20 # version 2+ has request and group objects 20 # version 2+ has request and group objects
21 use IO::AIO 2; 21 use IO::AIO 2;
22 22
23 aioreq_pri 4; # give next request a very high priority
23 my $req = aio_unlink "/tmp/file", sub { }; 24 my $req = aio_unlink "/tmp/file", sub { };
24 $req->cancel; # cancel request if still in queue 25 $req->cancel; # cancel request if still in queue
25 26
26 my $grp = aio_group sub { print "all stats done\n" }; 27 my $grp = aio_group sub { print "all stats done\n" };
27 add $grp aio_stat "..." for ...; 28 add $grp aio_stat "..." for ...;
50=head1 DESCRIPTION 51=head1 DESCRIPTION
51 52
52This module implements asynchronous I/O using whatever means your 53This module implements asynchronous I/O using whatever means your
53operating system supports. 54operating system supports.
54 55
55Currently, a number of threads are started that execute your read/writes 56In this version, a number of threads are started that execute your
56and signal their completion. You don't need thread support in your libc or 57requests and signal their completion. You don't need thread support
57perl, and the threads created by this module will not be visible to the 58in perl, and the threads created by this module will not be visible
58pthreads library. In the future, this module might make use of the native 59to perl. In the future, this module might make use of the native aio
59aio functions available on many operating systems. However, they are often 60functions available on many operating systems. However, they are often
60not well-supported (Linux doesn't allow them on normal files currently, 61not well-supported or restricted (Linux doesn't allow them on normal
61for example), and they would only support aio_read and aio_write, so the 62files currently, for example), and they would only support aio_read and
62remaining functionality would have to be implemented using threads anyway. 63aio_write, so the remaining functionality would have to be implemented
64using threads anyway.
63 65
64Although the module will work with in the presence of other threads, it is 66Although the module will work with in the presence of other (Perl-)
65currently not reentrant, so use appropriate locking yourself, always call 67threads, it is currently not reentrant in any way, so use appropriate
66C<poll_cb> from within the same thread, or never call C<poll_cb> (or other 68locking yourself, always call C<poll_cb> from within the same thread, or
67C<aio_> functions) recursively. 69never call C<poll_cb> (or other C<aio_> functions) recursively.
70
71=head1 REQUEST ANATOMY AND LIFETIME
72
73Every C<aio_*> function creates a request. which is a C data structure not
74directly visible to Perl.
75
76If called in non-void context, every request function returns a Perl
77object representing the request. In void context, nothing is returned,
78which saves a bit of memory.
79
80The perl object is a fairly standard ref-to-hash object. The hash contents
81are not used by IO::AIO so you are free to store anything you like in it.
82
83During their existance, aio requests travel through the following states,
84in order:
85
86=over 4
87
88=item ready
89
90Immediately after a request is created it is put into the ready state,
91waiting for a thread to execute it.
92
93=item execute
94
95A thread has accepted the request for processing and is currently
96executing it (e.g. blocking in read).
97
98=item pending
99
100The request has been executed and is waiting for result processing.
101
102While request submission and execution is fully asynchronous, result
103processing is not and relies on the perl interpreter calling C<poll_cb>
104(or another function with the same effect).
105
106=item result
107
108The request results are processed synchronously by C<poll_cb>.
109
110The C<poll_cb> function will process all outstanding aio requests by
111calling their callbacks, freeing memory associated with them and managing
112any groups they are contained in.
113
114=item done
115
116Request has reached the end of its lifetime and holds no resources anymore
117(except possibly for the Perl object, but its connection to the actual
118aio request is severed and calling its methods will either do nothing or
119result in a runtime error).
68 120
69=cut 121=cut
70 122
71package IO::AIO; 123package IO::AIO;
72 124
76use base 'Exporter'; 128use base 'Exporter';
77 129
78BEGIN { 130BEGIN {
79 our $VERSION = '2.0'; 131 our $VERSION = '2.0';
80 132
81 our @EXPORT = qw(aio_sendfile aio_read aio_write aio_open aio_close aio_stat 133 our @AIO_REQ = qw(aio_sendfile aio_read aio_write aio_open aio_close aio_stat
82 aio_lstat aio_unlink aio_rmdir aio_readdir aio_scandir aio_symlink 134 aio_lstat aio_unlink aio_rmdir aio_readdir aio_scandir aio_symlink
83 aio_fsync aio_fdatasync aio_readahead aio_rename aio_link aio_move 135 aio_fsync aio_fdatasync aio_readahead aio_rename aio_link aio_move
84 aio_group aio_nop); 136 aio_group aio_nop);
85 our @EXPORT_OK = qw(poll_fileno poll_cb min_parallel max_parallel max_outstanding nreqs); 137 our @EXPORT = (@AIO_REQ, qw(aioreq_pri aioreq_nice));
138 our @EXPORT_OK = qw(poll_fileno poll_cb poll_wait flush
139 min_parallel max_parallel max_outstanding nreqs);
86 140
87 @IO::AIO::GRP::ISA = 'IO::AIO::REQ'; 141 @IO::AIO::GRP::ISA = 'IO::AIO::REQ';
88 142
89 require XSLoader; 143 require XSLoader;
90 XSLoader::load ("IO::AIO", $VERSION); 144 XSLoader::load ("IO::AIO", $VERSION);
121environment, d) use Glib::filename_from_unicode on unicode filenames or e) 175environment, d) use Glib::filename_from_unicode on unicode filenames or e)
122use something else. 176use something else.
123 177
124=over 4 178=over 4
125 179
180=item aioreq_pri $pri
181
182Sets the priority for the next aio request. The default priority
183is C<0>, the minimum and maximum priorities are C<-4> and C<4>,
184respectively. Requests with higher priority will be serviced first.
185
186The priority will be reset to C<0> after each call to one of the C<aio_>
187functions.
188
189Example: open a file with low priority, then read something from it with
190higher priority so the read request is serviced before other low priority
191open requests (potentially spamming the cache):
192
193 aioreq_pri -3;
194 aio_open ..., sub {
195 return unless $_[0];
196
197 aioreq_pri -2;
198 aio_read $_[0], ..., sub {
199 ...
200 };
201 };
202
203=item aioreq_nice $pri_adjust
204
205Similar to C<aioreq_pri>, but subtracts the given value from the current
206priority, so effects are cumulative.
207
126=item aio_open $pathname, $flags, $mode, $callback->($fh) 208=item aio_open $pathname, $flags, $mode, $callback->($fh)
127 209
128Asynchronously open or create a file and call the callback with a newly 210Asynchronously open or create a file and call the callback with a newly
129created filehandle for the file. 211created filehandle for the file.
130 212
181 $_[0] > 0 or die "read error: $!"; 263 $_[0] > 0 or die "read error: $!";
182 print "read $_[0] bytes: <$buffer>\n"; 264 print "read $_[0] bytes: <$buffer>\n";
183 }; 265 };
184 266
185=item aio_move $srcpath, $dstpath, $callback->($status) 267=item aio_move $srcpath, $dstpath, $callback->($status)
186
187[EXPERIMENTAL due to internal aio_group use]
188 268
189Try to move the I<file> (directories not supported as either source or 269Try to move the I<file> (directories not supported as either source or
190destination) from C<$srcpath> to C<$dstpath> and call the callback with 270destination) from C<$srcpath> to C<$dstpath> and call the callback with
191the C<0> (error) or C<-1> ok. 271the C<0> (error) or C<-1> ok.
192 272
345The callback a single argument which is either C<undef> or an array-ref 425The callback a single argument which is either C<undef> or an array-ref
346with the filenames. 426with the filenames.
347 427
348=item aio_scandir $path, $maxreq, $callback->($dirs, $nondirs) 428=item aio_scandir $path, $maxreq, $callback->($dirs, $nondirs)
349 429
350[EXPERIMENTAL due to internal aio_group use]
351
352Scans a directory (similar to C<aio_readdir>) but additionally tries to 430Scans a directory (similar to C<aio_readdir>) but additionally tries to
353separate the entries of directory C<$path> into two sets of names, ones 431separate the entries of directory C<$path> into two sets of names, ones
354you can recurse into (directories or links to them), and ones you cannot 432you can recurse into (directories or links to them), and ones you cannot
355recurse into (everything else). 433recurse into (everything else).
356 434
442 map [$_, sprintf "%s%04d", (/.\./ ? "1" : "0"), length], 520 map [$_, sprintf "%s%04d", (/.\./ ? "1" : "0"), length],
443 @$entries]; 521 @$entries];
444 522
445 my (@dirs, @nondirs); 523 my (@dirs, @nondirs);
446 524
447 my ($statcb, $schedcb);
448 my $nreq = 0;
449
450 my $statgrp = add $grp aio_group; 525 my $statgrp = add $grp aio_group sub {
526 $grp->result (\@dirs, \@nondirs);
527 };
451 528
452 $schedcb = sub { 529 limit $statgrp $maxreq;
453 if (@$entries) { 530 feed $statgrp sub {
454 if ($nreq < $maxreq) { 531 return unless @$entries;
455 my $ent = pop @$entries; 532 my $entry = pop @$entries;
533
534 add $statgrp aio_stat "$path/$entry/.", sub {
535 if ($_[0] < 0) {
536 push @nondirs, $entry;
537 } else {
538 # need to check for real directory
539 add $statgrp aio_lstat "$path/$entry", sub {
540 if (-d _) {
541 push @dirs, $entry;
542
543 unless (--$ndirs) {
544 push @nondirs, @$entries;
545 feed $statgrp;
546 }
547 } else {
548 push @nondirs, $entry;
549 }
456 $nreq++; 550 }
457 add $statgrp aio_stat "$path/$ent/.", sub { $statcb->($_[0], $ent) };
458 } 551 }
459 } elsif (!$nreq) {
460 # finished
461 $statgrp->cancel;
462 undef $statcb;
463 undef $schedcb;
464 $grp->result (\@dirs, \@nondirs);
465 } 552 };
466 }; 553 };
467 $statcb = sub {
468 my ($status, $entry) = @_;
469
470 if ($status < 0) {
471 $nreq--;
472 push @nondirs, $entry;
473 &$schedcb;
474 } else {
475 # need to check for real directory
476 add $grp aio_lstat "$path/$entry", sub {
477 $nreq--;
478
479 if (-d _) {
480 push @dirs, $entry;
481
482 if (!--$ndirs) {
483 push @nondirs, @$entries;
484 $entries = [];
485 }
486 } else {
487 push @nondirs, $entry;
488 }
489
490 &$schedcb;
491 }
492 }
493 };
494
495 &$schedcb while @$entries && $nreq < $maxreq;
496 }; 554 };
497 }; 555 };
498 }; 556 };
499 557
500 $grp 558 $grp
513If this call isn't available because your OS lacks it or it couldn't be 571If this call isn't available because your OS lacks it or it couldn't be
514detected, it will be emulated by calling C<fsync> instead. 572detected, it will be emulated by calling C<fsync> instead.
515 573
516=item aio_group $callback->(...) 574=item aio_group $callback->(...)
517 575
518[EXPERIMENTAL]
519
520This is a very special aio request: Instead of doing something, it is a 576This is a very special aio request: Instead of doing something, it is a
521container for other aio requests, which is useful if you want to bundle 577container for other aio requests, which is useful if you want to bundle
522many requests into a single, composite, request. 578many requests into a single, composite, request with a definite callback
579and the ability to cancel the whole request with its subrequests.
523 580
524Returns an object of class L<IO::AIO::GRP>. See its documentation below 581Returns an object of class L<IO::AIO::GRP>. See its documentation below
525for more info. 582for more info.
526 583
527Example: 584Example:
540This is a special request - it does nothing in itself and is only used for 597This is a special request - it does nothing in itself and is only used for
541side effects, such as when you want to add a dummy request to a group so 598side effects, such as when you want to add a dummy request to a group so
542that finishing the requests in the group depends on executing the given 599that finishing the requests in the group depends on executing the given
543code. 600code.
544 601
602While this request does nothing, it still goes through the execution
603phase and still requires a worker thread. Thus, the callback will not
604be executed immediately but only after other requests in the queue have
605entered their execution phase. This can be used to measure request
606latency.
607
545=item IO::AIO::aio_sleep $fractional_seconds, $callback->() *NOT EXPORTED* 608=item IO::AIO::aio_busy $fractional_seconds, $callback->() *NOT EXPORTED*
546 609
547Mainly used for debugging and benchmarking, this aio request puts one of 610Mainly used for debugging and benchmarking, this aio request puts one of
548the request workers to sleep for the given time. 611the request workers to sleep for the given time.
549 612
550While it is theoretically handy to have simple I/O scheduling requests 613While it is theoretically handy to have simple I/O scheduling requests
551like sleep and file handle readable/writable, the overhead this creates 614like sleep and file handle readable/writable, the overhead this creates is
552is immense, so do not use this function except to put your application 615immense (it blocks a thread for a long time) so do not use this function
553under artificial I/O pressure. 616except to put your application under artificial I/O pressure.
554 617
555=back 618=back
556 619
557=head2 IO::AIO::REQ CLASS 620=head2 IO::AIO::REQ CLASS
558 621
559All non-aggregate C<aio_*> functions return an object of this class when 622All non-aggregate C<aio_*> functions return an object of this class when
560called in non-void context. 623called in non-void context.
561 624
562A request always moves through the following five states in its lifetime,
563in order: B<ready> (request has been created, but has not been executed
564yet), B<execute> (request is currently being executed), B<pending>
565(request has been executed but callback has not been called yet),
566B<result> (results are being processed synchronously, includes calling the
567callback) and B<done> (request has reached the end of its lifetime and
568holds no resources anymore).
569
570=over 4 625=over 4
571 626
572=item $req->cancel 627=item cancel $req
573 628
574Cancels the request, if possible. Has the effect of skipping execution 629Cancels the request, if possible. Has the effect of skipping execution
575when entering the B<execute> state and skipping calling the callback when 630when entering the B<execute> state and skipping calling the callback when
576entering the the B<result> state, but will leave the request otherwise 631entering the the B<result> state, but will leave the request otherwise
577untouched. That means that requests that currently execute will not be 632untouched. That means that requests that currently execute will not be
578stopped and resources held by the request will not be freed prematurely. 633stopped and resources held by the request will not be freed prematurely.
579 634
635=item cb $req $callback->(...)
636
637Replace (or simply set) the callback registered to the request.
638
580=back 639=back
581 640
582=head2 IO::AIO::GRP CLASS 641=head2 IO::AIO::GRP CLASS
583 642
584This class is a subclass of L<IO::AIO::REQ>, so all its methods apply to 643This class is a subclass of L<IO::AIO::REQ>, so all its methods apply to
641group. And only when all those requests have finished will the the group 700group. And only when all those requests have finished will the the group
642itself finish. 701itself finish.
643 702
644=over 4 703=over 4
645 704
705=item add $grp ...
706
646=item $grp->add (...) 707=item $grp->add (...)
647
648=item add $grp ...
649 708
650Add one or more requests to the group. Any type of L<IO::AIO::REQ> can 709Add one or more requests to the group. Any type of L<IO::AIO::REQ> can
651be added, including other groups, as long as you do not create circular 710be added, including other groups, as long as you do not create circular
652dependencies. 711dependencies.
653 712
654Returns all its arguments. 713Returns all its arguments.
655 714
715=item $grp->cancel_subs
716
717Cancel all subrequests and clears any feeder, but not the group request
718itself. Useful when you queued a lot of events but got a result early.
719
656=item $grp->result (...) 720=item $grp->result (...)
657 721
658Set the result value(s) that will be passed to the group callback when all 722Set the result value(s) that will be passed to the group callback when all
659subrequests have finished. By default, no argument will be passed. 723subrequests have finished. By default, no argument will be passed.
660 724
661=item $grp->set_feeder ($callback->($grp)) 725=item feed $grp $callback->($grp)
662
663[VERY EXPERIMENTAL]
664 726
665Sets a feeder/generator on this group: every group can have an attached 727Sets a feeder/generator on this group: every group can have an attached
666generator that generates requests if idle. The idea behind this is that, 728generator that generates requests if idle. The idea behind this is that,
667although you could just queue as many requests as you want in a group, 729although you could just queue as many requests as you want in a group,
668this might starve other requests for a potentially long time. For 730this might starve other requests for a potentially long time. For
669example, C<aio_scandir> might generate hundreds of thousands C<aio_stat> 731example, C<aio_scandir> might generate hundreds of thousands C<aio_stat>
670requests, delaying any later requests for a long time. 732requests, delaying any later requests for a long time.
671 733
672To avoid this, and allow incremental generation of requests, you can 734To avoid this, and allow incremental generation of requests, you can
673instead a group and set a feeder on it that generates those requests. The 735instead a group and set a feeder on it that generates those requests. The
674feeder will be called whenever there are few enough (see C<feeder_limit>, 736feed callback will be called whenever there are few enough (see C<limit>,
675below) requests active in the group itself and is expected to queue more 737below) requests active in the group itself and is expected to queue more
676requests. 738requests.
677 739
678The feeder can queue as many requests as it likes (i.e. C<add> does not 740The feed callback can queue as many requests as it likes (i.e. C<add> does
679impose any limits). 741not impose any limits).
680 742
681If the feeder does not queue more requests when called, it will be 743If the feed does not queue more requests when called, it will be
682automatically removed from the group. 744automatically removed from the group.
683 745
684If the feeder limit is C<0>, it will be set to C<2> automatically. 746If the feed limit is C<0>, it will be set to C<2> automatically.
685 747
686Example: 748Example:
687 749
688 # stat all files in @files, but only ever use four aio requests concurrently: 750 # stat all files in @files, but only ever use four aio requests concurrently:
689 751
690 my $grp = aio_group sub { print "finished\n" }; 752 my $grp = aio_group sub { print "finished\n" };
691 $grp->feeder_limit (4); 753 limit $grp 4;
692 $grp->set_feeder (sub { 754 feed $grp sub {
693 my $file = pop @files 755 my $file = pop @files
694 or return; 756 or return;
695 757
696 add $grp aio_stat $file, sub { ... }; 758 add $grp aio_stat $file, sub { ... };
697 }); 759 };
698 760
699=item $grp->feeder_limit ($num) 761=item limit $grp $num
700 762
701Sets the feeder limit for the group: The feeder will be called whenever 763Sets the feeder limit for the group: The feeder will be called whenever
702the group contains less than this many requests. 764the group contains less than this many requests.
703 765
704Setting the limit to C<0> will pause the feeding process. 766Setting the limit to C<0> will pause the feeding process.
850This module should do "the right thing" when the process using it forks: 912This module should do "the right thing" when the process using it forks:
851 913
852Before the fork, IO::AIO enters a quiescent state where no requests 914Before the fork, IO::AIO enters a quiescent state where no requests
853can be added in other threads and no results will be processed. After 915can be added in other threads and no results will be processed. After
854the fork the parent simply leaves the quiescent state and continues 916the fork the parent simply leaves the quiescent state and continues
855request/result processing, while the child clears the request/result 917request/result processing, while the child frees the request/result queue
856queue (so the requests started before the fork will only be handled in 918(so that the requests started before the fork will only be handled in the
857the parent). Threads will be started on demand until the limit ste in the 919parent). Threads will be started on demand until the limit set in the
858parent process has been reached again. 920parent process has been reached again.
859 921
860In short: the parent will, after a short pause, continue as if fork had 922In short: the parent will, after a short pause, continue as if fork had
861not been called, while the child will act as if IO::AIO has not been used 923not been called, while the child will act as if IO::AIO has not been used
862yet. 924yet.
863 925
864=head2 MEMORY USAGE 926=head2 MEMORY USAGE
865 927
928Per-request usage:
929
866Each aio request uses - depending on your architecture - around 128 bytes 930Each aio request uses - depending on your architecture - around 100-200
867of memory. In addition, stat requests need a stat buffer (possibly a few 931bytes of memory. In addition, stat requests need a stat buffer (possibly
868hundred bytes). Perl scalars and other data passed into aio requests will 932a few hundred bytes), readdir requires a result buffer and so on. Perl
869also be locked. 933scalars and other data passed into aio requests will also be locked and
934will consume memory till the request has entered the done state.
870 935
871This is now awfully much, so queuing lots of requests is not usually a 936This is now awfully much, so queuing lots of requests is not usually a
872problem. 937problem.
873 938
874Each thread needs a stack area which is usually around 16k, sometimes much 939Per-thread usage:
875larger, depending on the OS. 940
941In the execution phase, some aio requests require more memory for
942temporary buffers, and each thread requires a stack and other data
943structures (usually around 16k-128k, depending on the OS).
944
945=head1 KNOWN BUGS
946
947Known bugs will be fixed in the next release.
876 948
877=head1 SEE ALSO 949=head1 SEE ALSO
878 950
879L<Coro>, L<Linux::AIO> (obsolete). 951L<Coro::AIO>.
880 952
881=head1 AUTHOR 953=head1 AUTHOR
882 954
883 Marc Lehmann <schmorp@schmorp.de> 955 Marc Lehmann <schmorp@schmorp.de>
884 http://home.schmorp.de/ 956 http://home.schmorp.de/

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines