ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/IO-AIO/AIO.pm
(Generate patch)

Comparing IO-AIO/AIO.pm (file contents):
Revision 1.67 by root, Tue Oct 24 02:25:16 2006 UTC vs.
Revision 1.72 by root, Tue Oct 24 14:25:53 2006 UTC

18 }; 18 };
19 19
20 # version 2+ has request and group objects 20 # version 2+ has request and group objects
21 use IO::AIO 2; 21 use IO::AIO 2;
22 22
23 aioreq_pri 4; # give next request a very high priority
23 my $req = aio_unlink "/tmp/file", sub { }; 24 my $req = aio_unlink "/tmp/file", sub { };
24 $req->cancel; # cancel request if still in queue 25 $req->cancel; # cancel request if still in queue
25 26
26 my $grp = aio_group sub { print "all stats done\n" }; 27 my $grp = aio_group sub { print "all stats done\n" };
27 add $grp aio_stat "..." for ...; 28 add $grp aio_stat "..." for ...;
50=head1 DESCRIPTION 51=head1 DESCRIPTION
51 52
52This module implements asynchronous I/O using whatever means your 53This module implements asynchronous I/O using whatever means your
53operating system supports. 54operating system supports.
54 55
55Currently, a number of threads are started that execute your read/writes 56In this version, a number of threads are started that execute your
56and signal their completion. You don't need thread support in perl, and 57requests and signal their completion. You don't need thread support
57the threads created by this module will not be visible to perl. In the 58in perl, and the threads created by this module will not be visible
58future, this module might make use of the native aio functions available 59to perl. In the future, this module might make use of the native aio
59on many operating systems. However, they are often not well-supported 60functions available on many operating systems. However, they are often
60(Linux doesn't allow them on normal files currently, for example), 61not well-supported or restricted (Linux doesn't allow them on normal
61and they would only support aio_read and aio_write, so the remaining 62files currently, for example), and they would only support aio_read and
62functionality would have to be implemented using threads anyway. 63aio_write, so the remaining functionality would have to be implemented
64using threads anyway.
63 65
64Although the module will work with in the presence of other threads, it is 66Although the module will work with in the presence of other (Perl-)
65currently not reentrant, so use appropriate locking yourself, always call 67threads, it is currently not reentrant in any way, so use appropriate
66C<poll_cb> from within the same thread, or never call C<poll_cb> (or other 68locking yourself, always call C<poll_cb> from within the same thread, or
67C<aio_> functions) recursively. 69never call C<poll_cb> (or other C<aio_> functions) recursively.
70
71=head1 REQUEST ANATOMY AND LIFETIME
72
73Every C<aio_*> function creates a request. which is a C data structure not
74directly visible to Perl.
75
76If called in non-void context, every request function returns a Perl
77object representing the request. In void context, nothing is returned,
78which saves a bit of memory.
79
80The perl object is a fairly standard ref-to-hash object. The hash contents
81are not used by IO::AIO so you are free to store anything you like in it.
82
83During their existance, aio requests travel through the following states,
84in order:
85
86=over 4
87
88=item ready
89
90Immediately after a request is created it is put into the ready state,
91waiting for a thread to execute it.
92
93=item execute
94
95A thread has accepted the request for processing and is currently
96executing it (e.g. blocking in read).
97
98=item pending
99
100The request has been executed and is waiting for result processing.
101
102While request submission and execution is fully asynchronous, result
103processing is not and relies on the perl interpreter calling C<poll_cb>
104(or another function with the same effect).
105
106=item result
107
108The request results are processed synchronously by C<poll_cb>.
109
110The C<poll_cb> function will process all outstanding aio requests by
111calling their callbacks, freeing memory associated with them and managing
112any groups they are contained in.
113
114=item done
115
116Request has reached the end of its lifetime and holds no resources anymore
117(except possibly for the Perl object, but its connection to the actual
118aio request is severed and calling its methods will either do nothing or
119result in a runtime error).
68 120
69=cut 121=cut
70 122
71package IO::AIO; 123package IO::AIO;
72 124
80 132
81 our @AIO_REQ = qw(aio_sendfile aio_read aio_write aio_open aio_close aio_stat 133 our @AIO_REQ = qw(aio_sendfile aio_read aio_write aio_open aio_close aio_stat
82 aio_lstat aio_unlink aio_rmdir aio_readdir aio_scandir aio_symlink 134 aio_lstat aio_unlink aio_rmdir aio_readdir aio_scandir aio_symlink
83 aio_fsync aio_fdatasync aio_readahead aio_rename aio_link aio_move 135 aio_fsync aio_fdatasync aio_readahead aio_rename aio_link aio_move
84 aio_group aio_nop); 136 aio_group aio_nop);
85 our @EXPORT = (@AIO_REQ, qw(aioreq_pri)); 137 our @EXPORT = (@AIO_REQ, qw(aioreq_pri aioreq_nice));
86 our @EXPORT_OK = qw(poll_fileno poll_cb poll_wait flush 138 our @EXPORT_OK = qw(poll_fileno poll_cb poll_wait flush
87 min_parallel max_parallel max_outstanding nreqs); 139 min_parallel max_parallel max_outstanding nreqs);
88 140
89 @IO::AIO::GRP::ISA = 'IO::AIO::REQ'; 141 @IO::AIO::GRP::ISA = 'IO::AIO::REQ';
90 142
123environment, d) use Glib::filename_from_unicode on unicode filenames or e) 175environment, d) use Glib::filename_from_unicode on unicode filenames or e)
124use something else. 176use something else.
125 177
126=over 4 178=over 4
127 179
180=item aioreq_pri $pri
181
182Sets the priority for the next aio request. The default priority
183is C<0>, the minimum and maximum priorities are C<-4> and C<4>,
184respectively. Requests with higher priority will be serviced first.
185
186The priority will be reset to C<0> after each call to one of the C<aio_>
187functions.
188
189Example: open a file with low priority, then read something from it with
190higher priority so the read request is serviced before other low priority
191open requests (potentially spamming the cache):
192
193 aioreq_pri -3;
194 aio_open ..., sub {
195 return unless $_[0];
196
197 aioreq_pri -2;
198 aio_read $_[0], ..., sub {
199 ...
200 };
201 };
202
203=item aioreq_nice $pri_adjust
204
205Similar to C<aioreq_pri>, but subtracts the given value from the current
206priority, so effects are cumulative.
207
128=item aio_open $pathname, $flags, $mode, $callback->($fh) 208=item aio_open $pathname, $flags, $mode, $callback->($fh)
129 209
130Asynchronously open or create a file and call the callback with a newly 210Asynchronously open or create a file and call the callback with a newly
131created filehandle for the file. 211created filehandle for the file.
132 212
183 $_[0] > 0 or die "read error: $!"; 263 $_[0] > 0 or die "read error: $!";
184 print "read $_[0] bytes: <$buffer>\n"; 264 print "read $_[0] bytes: <$buffer>\n";
185 }; 265 };
186 266
187=item aio_move $srcpath, $dstpath, $callback->($status) 267=item aio_move $srcpath, $dstpath, $callback->($status)
188
189[EXPERIMENTAL due to internal aio_group use]
190 268
191Try to move the I<file> (directories not supported as either source or 269Try to move the I<file> (directories not supported as either source or
192destination) from C<$srcpath> to C<$dstpath> and call the callback with 270destination) from C<$srcpath> to C<$dstpath> and call the callback with
193the C<0> (error) or C<-1> ok. 271the C<0> (error) or C<-1> ok.
194 272
346 424
347The callback a single argument which is either C<undef> or an array-ref 425The callback a single argument which is either C<undef> or an array-ref
348with the filenames. 426with the filenames.
349 427
350=item aio_scandir $path, $maxreq, $callback->($dirs, $nondirs) 428=item aio_scandir $path, $maxreq, $callback->($dirs, $nondirs)
351
352[EXPERIMENTAL due to internal aio_group use]
353 429
354Scans a directory (similar to C<aio_readdir>) but additionally tries to 430Scans a directory (similar to C<aio_readdir>) but additionally tries to
355separate the entries of directory C<$path> into two sets of names, ones 431separate the entries of directory C<$path> into two sets of names, ones
356you can recurse into (directories or links to them), and ones you cannot 432you can recurse into (directories or links to them), and ones you cannot
357recurse into (everything else). 433recurse into (everything else).
519 595
520[EXPERIMENTAL] 596[EXPERIMENTAL]
521 597
522This is a very special aio request: Instead of doing something, it is a 598This is a very special aio request: Instead of doing something, it is a
523container for other aio requests, which is useful if you want to bundle 599container for other aio requests, which is useful if you want to bundle
524many requests into a single, composite, request. 600many requests into a single, composite, request with a definite callback
601and the ability to cancel the whole request with its subrequests.
525 602
526Returns an object of class L<IO::AIO::GRP>. See its documentation below 603Returns an object of class L<IO::AIO::GRP>. See its documentation below
527for more info. 604for more info.
528 605
529Example: 606Example:
548phase and still requires a worker thread. Thus, the callback will not 625phase and still requires a worker thread. Thus, the callback will not
549be executed immediately but only after other requests in the queue have 626be executed immediately but only after other requests in the queue have
550entered their execution phase. This can be used to measure request 627entered their execution phase. This can be used to measure request
551latency. 628latency.
552 629
553=item IO::AIO::aio_sleep $fractional_seconds, $callback->() *NOT EXPORTED* 630=item IO::AIO::aio_busy $fractional_seconds, $callback->() *NOT EXPORTED*
554 631
555Mainly used for debugging and benchmarking, this aio request puts one of 632Mainly used for debugging and benchmarking, this aio request puts one of
556the request workers to sleep for the given time. 633the request workers to sleep for the given time.
557 634
558While it is theoretically handy to have simple I/O scheduling requests 635While it is theoretically handy to have simple I/O scheduling requests
559like sleep and file handle readable/writable, the overhead this creates 636like sleep and file handle readable/writable, the overhead this creates is
560is immense, so do not use this function except to put your application 637immense (it blocks a thread for a long time) so do not use this function
561under artificial I/O pressure. 638except to put your application under artificial I/O pressure.
562 639
563=back 640=back
564 641
565=head2 IO::AIO::REQ CLASS 642=head2 IO::AIO::REQ CLASS
566 643
567All non-aggregate C<aio_*> functions return an object of this class when 644All non-aggregate C<aio_*> functions return an object of this class when
568called in non-void context. 645called in non-void context.
569
570A request always moves through the following five states in its lifetime,
571in order: B<ready> (request has been created, but has not been executed
572yet), B<execute> (request is currently being executed), B<pending>
573(request has been executed but callback has not been called yet),
574B<result> (results are being processed synchronously, includes calling the
575callback) and B<done> (request has reached the end of its lifetime and
576holds no resources anymore).
577 646
578=over 4 647=over 4
579 648
580=item cancel $req 649=item cancel $req
581 650
681example, C<aio_scandir> might generate hundreds of thousands C<aio_stat> 750example, C<aio_scandir> might generate hundreds of thousands C<aio_stat>
682requests, delaying any later requests for a long time. 751requests, delaying any later requests for a long time.
683 752
684To avoid this, and allow incremental generation of requests, you can 753To avoid this, and allow incremental generation of requests, you can
685instead a group and set a feeder on it that generates those requests. The 754instead a group and set a feeder on it that generates those requests. The
686feed callback will be called whenever there are few enough (see C<feed_limit>, 755feed callback will be called whenever there are few enough (see C<limit>,
687below) requests active in the group itself and is expected to queue more 756below) requests active in the group itself and is expected to queue more
688requests. 757requests.
689 758
690The feed can queue as many requests as it likes (i.e. C<add> does not 759The feed callback can queue as many requests as it likes (i.e. C<add> does
691impose any limits). 760not impose any limits).
692 761
693If the feed does not queue more requests when called, it will be 762If the feed does not queue more requests when called, it will be
694automatically removed from the group. 763automatically removed from the group.
695 764
696If the feed limit is C<0>, it will be set to C<2> automatically. 765If the feed limit is C<0>, it will be set to C<2> automatically.
698Example: 767Example:
699 768
700 # stat all files in @files, but only ever use four aio requests concurrently: 769 # stat all files in @files, but only ever use four aio requests concurrently:
701 770
702 my $grp = aio_group sub { print "finished\n" }; 771 my $grp = aio_group sub { print "finished\n" };
703 feed_limit $grp 4; 772 limit $grp 4;
704 feed $grp sub { 773 feed $grp sub {
705 my $file = pop @files 774 my $file = pop @files
706 or return; 775 or return;
707 776
708 add $grp aio_stat $file, sub { ... }; 777 add $grp aio_stat $file, sub { ... };
709 }; 778 };
710 779
711=item feed_limit $grp $num 780=item limit $grp $num
712 781
713Sets the feeder limit for the group: The feeder will be called whenever 782Sets the feeder limit for the group: The feeder will be called whenever
714the group contains less than this many requests. 783the group contains less than this many requests.
715 784
716Setting the limit to C<0> will pause the feeding process. 785Setting the limit to C<0> will pause the feeding process.
862This module should do "the right thing" when the process using it forks: 931This module should do "the right thing" when the process using it forks:
863 932
864Before the fork, IO::AIO enters a quiescent state where no requests 933Before the fork, IO::AIO enters a quiescent state where no requests
865can be added in other threads and no results will be processed. After 934can be added in other threads and no results will be processed. After
866the fork the parent simply leaves the quiescent state and continues 935the fork the parent simply leaves the quiescent state and continues
867request/result processing, while the child clears the request/result 936request/result processing, while the child frees the request/result queue
868queue (so the requests started before the fork will only be handled in 937(so that the requests started before the fork will only be handled in the
869the parent). Threads will be started on demand until the limit ste in the 938parent). Threads will be started on demand until the limit set in the
870parent process has been reached again. 939parent process has been reached again.
940
941Temporary memory that was allocated for request processing is not
942reclaimed in the child, however. While this is possible in some cases, it
943is almost impossible in others (threads are evil you know), so you will
944have to live with it. This is around 64k buffer (for sendfile, readahead
945emulation) + the size of the directory being scanned (readdir).
871 946
872In short: the parent will, after a short pause, continue as if fork had 947In short: the parent will, after a short pause, continue as if fork had
873not been called, while the child will act as if IO::AIO has not been used 948not been called, while the child will act as if IO::AIO has not been used
874yet. 949yet.
875 950
876=head2 MEMORY USAGE 951=head2 MEMORY USAGE
877 952
953Per-request usage:
954
878Each aio request uses - depending on your architecture - around 128 bytes 955Each aio request uses - depending on your architecture - around 100-200
879of memory. In addition, stat requests need a stat buffer (possibly a few 956bytes of memory. In addition, stat requests need a stat buffer (possibly
880hundred bytes). Perl scalars and other data passed into aio requests will 957a few hundred bytes), readdir requires a result buffer and so on. Perl
881also be locked. 958scalars and other data passed into aio requests will also be locked and
959will consume memory till the request has entered the done state.
882 960
883This is now awfully much, so queuing lots of requests is not usually a 961This is now awfully much, so queuing lots of requests is not usually a
884problem. 962problem.
885 963
886Each thread needs a stack area which is usually around 16k, sometimes much 964Per-thread usage:
887larger, depending on the OS. 965
966In the execution phase, some aio requests require more memory for
967temporary buffers, and each thread requires a stack and other data
968structures (usually around 16k-128k, depending on the OS).
969
970=head1 KNOWN BUGS
971
972See FORK BEHAVIOUR, above.
888 973
889=head1 SEE ALSO 974=head1 SEE ALSO
890 975
891L<Coro>, L<Linux::AIO> (obsolete). 976L<Coro::AIO>.
892 977
893=head1 AUTHOR 978=head1 AUTHOR
894 979
895 Marc Lehmann <schmorp@schmorp.de> 980 Marc Lehmann <schmorp@schmorp.de>
896 http://home.schmorp.de/ 981 http://home.schmorp.de/

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines