ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/IO-AIO/AIO.pm
Revision: 1.63
Committed: Mon Oct 23 00:34:36 2006 UTC (17 years, 7 months ago) by root
Branch: MAIN
Changes since 1.62: +8 -1 lines
Log Message:
*** empty log message ***

File Contents

# User Rev Content
1 root 1.1 =head1 NAME
2    
3     IO::AIO - Asynchronous Input/Output
4    
5     =head1 SYNOPSIS
6    
7     use IO::AIO;
8    
9 root 1.6 aio_open "/etc/passwd", O_RDONLY, 0, sub {
10     my ($fh) = @_;
11     ...
12     };
13    
14     aio_unlink "/tmp/file", sub { };
15    
16     aio_read $fh, 30000, 1024, $buffer, 0, sub {
17 root 1.8 $_[0] > 0 or die "read error: $!";
18 root 1.6 };
19    
20 root 1.56 # version 2+ has request and group objects
21     use IO::AIO 2;
22 root 1.52
23     my $req = aio_unlink "/tmp/file", sub { };
24     $req->cancel; # cancel request if still in queue
25    
26 root 1.56 my $grp = aio_group sub { print "all stats done\n" };
27     add $grp aio_stat "..." for ...;
28    
29     # AnyEvent integration
30 root 1.42 open my $fh, "<&=" . IO::AIO::poll_fileno or die "$!";
31     my $w = AnyEvent->io (fh => $fh, poll => 'r', cb => sub { IO::AIO::poll_cb });
32    
33 root 1.56 # Event integration
34 root 1.6 Event->io (fd => IO::AIO::poll_fileno,
35 root 1.7 poll => 'r',
36 root 1.6 cb => \&IO::AIO::poll_cb);
37    
38 root 1.56 # Glib/Gtk2 integration
39 root 1.6 add_watch Glib::IO IO::AIO::poll_fileno,
40 root 1.22 in => sub { IO::AIO::poll_cb; 1 };
41 root 1.6
42 root 1.56 # Tk integration
43 root 1.6 Tk::Event::IO->fileevent (IO::AIO::poll_fileno, "",
44     readable => \&IO::AIO::poll_cb);
45    
46 root 1.56 # Danga::Socket integration
47 root 1.11 Danga::Socket->AddOtherFds (IO::AIO::poll_fileno =>
48     \&IO::AIO::poll_cb);
49    
50 root 1.1 =head1 DESCRIPTION
51    
52     This module implements asynchronous I/O using whatever means your
53 root 1.2 operating system supports.
54 root 1.1
55 root 1.2 Currently, a number of threads are started that execute your read/writes
56     and signal their completion. You don't need thread support in your libc or
57     perl, and the threads created by this module will not be visible to the
58     pthreads library. In the future, this module might make use of the native
59     aio functions available on many operating systems. However, they are often
60     not well-supported (Linux doesn't allow them on normal files currently,
61     for example), and they would only support aio_read and aio_write, so the
62     remaining functionality would have to be implemented using threads anyway.
63 root 1.1
64     Although the module will work with in the presence of other threads, it is
65 root 1.22 currently not reentrant, so use appropriate locking yourself, always call
66     C<poll_cb> from within the same thread, or never call C<poll_cb> (or other
67     C<aio_> functions) recursively.
68 root 1.1
69     =cut
70    
71     package IO::AIO;
72    
73 root 1.23 no warnings;
74 root 1.51 use strict 'vars';
75 root 1.23
76 root 1.1 use base 'Exporter';
77    
78     BEGIN {
79 root 1.55 our $VERSION = '2.0';
80 root 1.1
81 root 1.51 our @EXPORT = qw(aio_sendfile aio_read aio_write aio_open aio_close aio_stat
82     aio_lstat aio_unlink aio_rmdir aio_readdir aio_scandir aio_symlink
83 root 1.54 aio_fsync aio_fdatasync aio_readahead aio_rename aio_link aio_move
84 root 1.63 aio_group aio_nop);
85 root 1.51 our @EXPORT_OK = qw(poll_fileno poll_cb min_parallel max_parallel max_outstanding nreqs);
86 root 1.1
87 root 1.54 @IO::AIO::GRP::ISA = 'IO::AIO::REQ';
88    
89 root 1.1 require XSLoader;
90 root 1.51 XSLoader::load ("IO::AIO", $VERSION);
91 root 1.1 }
92    
93 root 1.5 =head1 FUNCTIONS
94 root 1.1
95 root 1.5 =head2 AIO FUNCTIONS
96 root 1.1
97 root 1.5 All the C<aio_*> calls are more or less thin wrappers around the syscall
98     with the same name (sans C<aio_>). The arguments are similar or identical,
99 root 1.14 and they all accept an additional (and optional) C<$callback> argument
100     which must be a code reference. This code reference will get called with
101     the syscall return code (e.g. most syscalls return C<-1> on error, unlike
102     perl, which usually delivers "false") as it's sole argument when the given
103     syscall has been executed asynchronously.
104 root 1.1
105 root 1.23 All functions expecting a filehandle keep a copy of the filehandle
106     internally until the request has finished.
107 root 1.1
108 root 1.55 All requests return objects of type L<IO::AIO::REQ> that allow further
109     manipulation of those requests while they are in-flight.
110 root 1.52
111 root 1.28 The pathnames you pass to these routines I<must> be absolute and
112     encoded in byte form. The reason for the former is that at the time the
113     request is being executed, the current working directory could have
114     changed. Alternatively, you can make sure that you never change the
115     current working directory.
116    
117     To encode pathnames to byte form, either make sure you either: a)
118     always pass in filenames you got from outside (command line, readdir
119     etc.), b) are ASCII or ISO 8859-1, c) use the Encode module and encode
120     your pathnames to the locale (or other) encoding in effect in the user
121     environment, d) use Glib::filename_from_unicode on unicode filenames or e)
122     use something else.
123 root 1.1
124 root 1.5 =over 4
125 root 1.1
126 root 1.40 =item aio_open $pathname, $flags, $mode, $callback->($fh)
127 root 1.1
128 root 1.2 Asynchronously open or create a file and call the callback with a newly
129     created filehandle for the file.
130 root 1.1
131     The pathname passed to C<aio_open> must be absolute. See API NOTES, above,
132     for an explanation.
133    
134 root 1.20 The C<$flags> argument is a bitmask. See the C<Fcntl> module for a
135     list. They are the same as used by C<sysopen>.
136    
137     Likewise, C<$mode> specifies the mode of the newly created file, if it
138     didn't exist and C<O_CREAT> has been given, just like perl's C<sysopen>,
139     except that it is mandatory (i.e. use C<0> if you don't create new files,
140     and C<0666> or C<0777> if you do).
141 root 1.1
142     Example:
143    
144     aio_open "/etc/passwd", O_RDONLY, 0, sub {
145 root 1.2 if ($_[0]) {
146     print "open successful, fh is $_[0]\n";
147 root 1.1 ...
148     } else {
149     die "open failed: $!\n";
150     }
151     };
152    
153 root 1.40 =item aio_close $fh, $callback->($status)
154 root 1.1
155 root 1.2 Asynchronously close a file and call the callback with the result
156     code. I<WARNING:> although accepted, you should not pass in a perl
157 root 1.20 filehandle here, as perl will likely close the file descriptor another
158     time when the filehandle is destroyed. Normally, you can safely call perls
159     C<close> or just let filehandles go out of scope.
160    
161     This is supposed to be a bug in the API, so that might change. It's
162     therefore best to avoid this function.
163 root 1.1
164 root 1.40 =item aio_read $fh,$offset,$length, $data,$dataoffset, $callback->($retval)
165 root 1.1
166 root 1.40 =item aio_write $fh,$offset,$length, $data,$dataoffset, $callback->($retval)
167 root 1.1
168     Reads or writes C<length> bytes from the specified C<fh> and C<offset>
169     into the scalar given by C<data> and offset C<dataoffset> and calls the
170     callback without the actual number of bytes read (or -1 on error, just
171     like the syscall).
172    
173 root 1.31 The C<$data> scalar I<MUST NOT> be modified in any way while the request
174     is outstanding. Modifying it can result in segfaults or WW3 (if the
175     necessary/optional hardware is installed).
176    
177 root 1.17 Example: Read 15 bytes at offset 7 into scalar C<$buffer>, starting at
178 root 1.1 offset C<0> within the scalar:
179    
180     aio_read $fh, 7, 15, $buffer, 0, sub {
181 root 1.9 $_[0] > 0 or die "read error: $!";
182     print "read $_[0] bytes: <$buffer>\n";
183 root 1.1 };
184    
185 root 1.50 =item aio_move $srcpath, $dstpath, $callback->($status)
186    
187 root 1.58 [EXPERIMENTAL due to internal aio_group use]
188    
189 root 1.52 Try to move the I<file> (directories not supported as either source or
190     destination) from C<$srcpath> to C<$dstpath> and call the callback with
191     the C<0> (error) or C<-1> ok.
192 root 1.50
193     This is a composite request that tries to rename(2) the file first. If
194     rename files with C<EXDEV>, it creates the destination file with mode 0200
195     and copies the contents of the source file into it using C<aio_sendfile>,
196     followed by restoring atime, mtime, access mode and uid/gid, in that
197     order, and unlinking the C<$srcpath>.
198    
199     If an error occurs, the partial destination file will be unlinked, if
200     possible, except when setting atime, mtime, access mode and uid/gid, where
201     errors are being ignored.
202    
203     =cut
204    
205     sub aio_move($$$) {
206     my ($src, $dst, $cb) = @_;
207    
208 root 1.58 my $grp = aio_group $cb;
209 root 1.55
210     add $grp aio_rename $src, $dst, sub {
211 root 1.51 if ($_[0] && $! == EXDEV) {
212 root 1.55 add $grp aio_open $src, O_RDONLY, 0, sub {
213 root 1.50 if (my $src_fh = $_[0]) {
214     my @stat = stat $src_fh;
215    
216 root 1.55 add $grp aio_open $dst, O_WRONLY, 0200, sub {
217 root 1.50 if (my $dst_fh = $_[0]) {
218 root 1.55 add $grp aio_sendfile $dst_fh, $src_fh, 0, $stat[7], sub {
219 root 1.50 close $src_fh;
220    
221     if ($_[0] == $stat[7]) {
222     utime $stat[8], $stat[9], $dst;
223     chmod $stat[2] & 07777, $dst_fh;
224     chown $stat[4], $stat[5], $dst_fh;
225     close $dst_fh;
226    
227 root 1.55 add $grp aio_unlink $src, sub {
228 root 1.58 $grp->result ($_[0]);
229 root 1.50 };
230     } else {
231     my $errno = $!;
232 root 1.55 add $grp aio_unlink $dst, sub {
233 root 1.50 $! = $errno;
234 root 1.58 $grp->result (-1);
235 root 1.50 };
236     }
237     };
238     } else {
239 root 1.58 $grp->result (-1);
240 root 1.50 }
241     },
242    
243     } else {
244 root 1.58 $grp->result (-1);
245 root 1.50 }
246     };
247     } else {
248 root 1.58 $grp->result ($_[0]);
249 root 1.50 }
250     };
251 root 1.55
252     $grp
253 root 1.50 }
254    
255 root 1.40 =item aio_sendfile $out_fh, $in_fh, $in_offset, $length, $callback->($retval)
256 root 1.35
257     Tries to copy C<$length> bytes from C<$in_fh> to C<$out_fh>. It starts
258     reading at byte offset C<$in_offset>, and starts writing at the current
259     file offset of C<$out_fh>. Because of that, it is not safe to issue more
260     than one C<aio_sendfile> per C<$out_fh>, as they will interfere with each
261     other.
262    
263     This call tries to make use of a native C<sendfile> syscall to provide
264     zero-copy operation. For this to work, C<$out_fh> should refer to a
265     socket, and C<$in_fh> should refer to mmap'able file.
266    
267     If the native sendfile call fails or is not implemented, it will be
268 root 1.36 emulated, so you can call C<aio_sendfile> on any type of filehandle
269     regardless of the limitations of the operating system.
270 root 1.35
271     Please note, however, that C<aio_sendfile> can read more bytes from
272     C<$in_fh> than are written, and there is no way to find out how many
273 root 1.36 bytes have been read from C<aio_sendfile> alone, as C<aio_sendfile> only
274     provides the number of bytes written to C<$out_fh>. Only if the result
275     value equals C<$length> one can assume that C<$length> bytes have been
276     read.
277 root 1.35
278 root 1.40 =item aio_readahead $fh,$offset,$length, $callback->($retval)
279 root 1.1
280 root 1.20 C<aio_readahead> populates the page cache with data from a file so that
281 root 1.1 subsequent reads from that file will not block on disk I/O. The C<$offset>
282     argument specifies the starting point from which data is to be read and
283     C<$length> specifies the number of bytes to be read. I/O is performed in
284     whole pages, so that offset is effectively rounded down to a page boundary
285     and bytes are read up to the next page boundary greater than or equal to
286 root 1.20 (off-set+length). C<aio_readahead> does not read beyond the end of the
287 root 1.1 file. The current file offset of the file is left unchanged.
288    
289 root 1.26 If that syscall doesn't exist (likely if your OS isn't Linux) it will be
290     emulated by simply reading the data, which would have a similar effect.
291    
292 root 1.40 =item aio_stat $fh_or_path, $callback->($status)
293 root 1.1
294 root 1.40 =item aio_lstat $fh, $callback->($status)
295 root 1.1
296     Works like perl's C<stat> or C<lstat> in void context. The callback will
297     be called after the stat and the results will be available using C<stat _>
298     or C<-s _> etc...
299    
300     The pathname passed to C<aio_stat> must be absolute. See API NOTES, above,
301     for an explanation.
302    
303     Currently, the stats are always 64-bit-stats, i.e. instead of returning an
304     error when stat'ing a large file, the results will be silently truncated
305     unless perl itself is compiled with large file support.
306    
307     Example: Print the length of F</etc/passwd>:
308    
309     aio_stat "/etc/passwd", sub {
310     $_[0] and die "stat failed: $!";
311     print "size is ", -s _, "\n";
312     };
313    
314 root 1.40 =item aio_unlink $pathname, $callback->($status)
315 root 1.1
316     Asynchronously unlink (delete) a file and call the callback with the
317     result code.
318    
319 root 1.50 =item aio_link $srcpath, $dstpath, $callback->($status)
320    
321     Asynchronously create a new link to the existing object at C<$srcpath> at
322     the path C<$dstpath> and call the callback with the result code.
323    
324     =item aio_symlink $srcpath, $dstpath, $callback->($status)
325    
326     Asynchronously create a new symbolic link to the existing object at C<$srcpath> at
327     the path C<$dstpath> and call the callback with the result code.
328    
329     =item aio_rename $srcpath, $dstpath, $callback->($status)
330    
331     Asynchronously rename the object at C<$srcpath> to C<$dstpath>, just as
332     rename(2) and call the callback with the result code.
333    
334 root 1.40 =item aio_rmdir $pathname, $callback->($status)
335 root 1.27
336     Asynchronously rmdir (delete) a directory and call the callback with the
337     result code.
338    
339 root 1.46 =item aio_readdir $pathname, $callback->($entries)
340 root 1.37
341     Unlike the POSIX call of the same name, C<aio_readdir> reads an entire
342     directory (i.e. opendir + readdir + closedir). The entries will not be
343     sorted, and will B<NOT> include the C<.> and C<..> entries.
344    
345     The callback a single argument which is either C<undef> or an array-ref
346     with the filenames.
347    
348 root 1.40 =item aio_scandir $path, $maxreq, $callback->($dirs, $nondirs)
349    
350 root 1.58 [EXPERIMENTAL due to internal aio_group use]
351    
352 root 1.52 Scans a directory (similar to C<aio_readdir>) but additionally tries to
353     separate the entries of directory C<$path> into two sets of names, ones
354     you can recurse into (directories or links to them), and ones you cannot
355     recurse into (everything else).
356    
357 root 1.61 C<aio_scandir> is a composite request that creates of many sub requests_
358     C<$maxreq> specifies the maximum number of outstanding aio requests that
359     this function generates. If it is C<< <= 0 >>, then a suitable default
360     will be chosen (currently 6).
361 root 1.40
362     On error, the callback is called without arguments, otherwise it receives
363     two array-refs with path-relative entry names.
364    
365     Example:
366    
367     aio_scandir $dir, 0, sub {
368     my ($dirs, $nondirs) = @_;
369     print "real directories: @$dirs\n";
370     print "everything else: @$nondirs\n";
371     };
372    
373     Implementation notes.
374    
375     The C<aio_readdir> cannot be avoided, but C<stat()>'ing every entry can.
376    
377     After reading the directory, the modification time, size etc. of the
378 root 1.52 directory before and after the readdir is checked, and if they match (and
379     isn't the current time), the link count will be used to decide how many
380     entries are directories (if >= 2). Otherwise, no knowledge of the number
381     of subdirectories will be assumed.
382    
383     Then entries will be sorted into likely directories (everything without
384     a non-initial dot currently) and likely non-directories (everything
385     else). Then every entry plus an appended C</.> will be C<stat>'ed,
386     likely directories first. If that succeeds, it assumes that the entry
387     is a directory or a symlink to directory (which will be checked
388     seperately). This is often faster than stat'ing the entry itself because
389     filesystems might detect the type of the entry without reading the inode
390     data (e.g. ext2fs filetype feature).
391    
392     If the known number of directories (link count - 2) has been reached, the
393     rest of the entries is assumed to be non-directories.
394    
395     This only works with certainty on POSIX (= UNIX) filesystems, which
396     fortunately are the vast majority of filesystems around.
397    
398     It will also likely work on non-POSIX filesystems with reduced efficiency
399     as those tend to return 0 or 1 as link counts, which disables the
400     directory counting heuristic.
401 root 1.40
402     =cut
403    
404     sub aio_scandir($$$) {
405     my ($path, $maxreq, $cb) = @_;
406    
407 root 1.58 my $grp = aio_group $cb;
408 root 1.55
409 root 1.61 $maxreq = 6 if $maxreq <= 0;
410 root 1.40
411     # stat once
412 root 1.55 add $grp aio_stat $path, sub {
413 root 1.58 return $grp->result () if $_[0];
414 root 1.52 my $now = time;
415 root 1.40 my $hash1 = join ":", (stat _)[0,1,3,7,9];
416    
417     # read the directory entries
418 root 1.55 add $grp aio_readdir $path, sub {
419 root 1.40 my $entries = shift
420 root 1.58 or return $grp->result ();
421 root 1.40
422     # stat the dir another time
423 root 1.55 add $grp aio_stat $path, sub {
424 root 1.40 my $hash2 = join ":", (stat _)[0,1,3,7,9];
425    
426     my $ndirs;
427    
428     # take the slow route if anything looks fishy
429 root 1.52 if ($hash1 ne $hash2 or (stat _)[9] == $now) {
430 root 1.40 $ndirs = -1;
431     } else {
432     # if nlink == 2, we are finished
433     # on non-posix-fs's, we rely on nlink < 2
434     $ndirs = (stat _)[3] - 2
435 root 1.58 or return $grp->result ([], $entries);
436 root 1.40 }
437    
438     # sort into likely dirs and likely nondirs
439     # dirs == files without ".", short entries first
440     $entries = [map $_->[0],
441     sort { $b->[1] cmp $a->[1] }
442     map [$_, sprintf "%s%04d", (/.\./ ? "1" : "0"), length],
443     @$entries];
444    
445     my (@dirs, @nondirs);
446    
447     my ($statcb, $schedcb);
448     my $nreq = 0;
449    
450 root 1.60 my $statgrp = add $grp aio_group;
451    
452 root 1.40 $schedcb = sub {
453     if (@$entries) {
454     if ($nreq < $maxreq) {
455     my $ent = pop @$entries;
456     $nreq++;
457 root 1.60 add $statgrp aio_stat "$path/$ent/.", sub { $statcb->($_[0], $ent) };
458 root 1.40 }
459     } elsif (!$nreq) {
460     # finished
461 root 1.60 $statgrp->cancel;
462 root 1.40 undef $statcb;
463     undef $schedcb;
464 root 1.60 $grp->result (\@dirs, \@nondirs);
465 root 1.40 }
466     };
467     $statcb = sub {
468     my ($status, $entry) = @_;
469    
470     if ($status < 0) {
471     $nreq--;
472     push @nondirs, $entry;
473     &$schedcb;
474     } else {
475     # need to check for real directory
476 root 1.55 add $grp aio_lstat "$path/$entry", sub {
477 root 1.40 $nreq--;
478    
479     if (-d _) {
480     push @dirs, $entry;
481    
482     if (!--$ndirs) {
483     push @nondirs, @$entries;
484     $entries = [];
485     }
486     } else {
487     push @nondirs, $entry;
488     }
489    
490     &$schedcb;
491     }
492     }
493     };
494    
495     &$schedcb while @$entries && $nreq < $maxreq;
496     };
497     };
498     };
499 root 1.55
500     $grp
501 root 1.40 }
502    
503     =item aio_fsync $fh, $callback->($status)
504 root 1.1
505     Asynchronously call fsync on the given filehandle and call the callback
506     with the fsync result code.
507    
508 root 1.40 =item aio_fdatasync $fh, $callback->($status)
509 root 1.1
510     Asynchronously call fdatasync on the given filehandle and call the
511 root 1.26 callback with the fdatasync result code.
512    
513     If this call isn't available because your OS lacks it or it couldn't be
514     detected, it will be emulated by calling C<fsync> instead.
515 root 1.1
516 root 1.58 =item aio_group $callback->(...)
517 root 1.54
518 root 1.55 [EXPERIMENTAL]
519    
520     This is a very special aio request: Instead of doing something, it is a
521     container for other aio requests, which is useful if you want to bundle
522     many requests into a single, composite, request.
523    
524     Returns an object of class L<IO::AIO::GRP>. See its documentation below
525     for more info.
526    
527     Example:
528    
529     my $grp = aio_group sub {
530     print "all stats done\n";
531     };
532    
533     add $grp
534     (aio_stat ...),
535     (aio_stat ...),
536     ...;
537    
538 root 1.63 =item aio_nop $callback->()
539    
540     This is a special request - it does nothing in itself and is only used for
541     side effects, such as when you want to add a dummy request to a group so
542     that finishing the requests in the group depends on executing the given
543     code.
544    
545 root 1.56 =item IO::AIO::aio_sleep $fractional_seconds, $callback->() *NOT EXPORTED*
546 root 1.54
547     Mainly used for debugging and benchmarking, this aio request puts one of
548     the request workers to sleep for the given time.
549    
550 root 1.56 While it is theoretically handy to have simple I/O scheduling requests
551     like sleep and file handle readable/writable, the overhead this creates
552     is immense, so do not use this function except to put your application
553     under artificial I/O pressure.
554    
555 root 1.5 =back
556    
557 root 1.53 =head2 IO::AIO::REQ CLASS
558 root 1.52
559     All non-aggregate C<aio_*> functions return an object of this class when
560     called in non-void context.
561    
562     A request always moves through the following five states in its lifetime,
563     in order: B<ready> (request has been created, but has not been executed
564     yet), B<execute> (request is currently being executed), B<pending>
565     (request has been executed but callback has not been called yet),
566     B<result> (results are being processed synchronously, includes calling the
567     callback) and B<done> (request has reached the end of its lifetime and
568     holds no resources anymore).
569    
570     =over 4
571    
572     =item $req->cancel
573    
574     Cancels the request, if possible. Has the effect of skipping execution
575     when entering the B<execute> state and skipping calling the callback when
576     entering the the B<result> state, but will leave the request otherwise
577     untouched. That means that requests that currently execute will not be
578     stopped and resources held by the request will not be freed prematurely.
579    
580     =back
581    
582 root 1.55 =head2 IO::AIO::GRP CLASS
583    
584     This class is a subclass of L<IO::AIO::REQ>, so all its methods apply to
585     objects of this class, too.
586    
587     A IO::AIO::GRP object is a special request that can contain multiple other
588     aio requests.
589    
590     You create one by calling the C<aio_group> constructing function with a
591     callback that will be called when all contained requests have entered the
592     C<done> state:
593    
594     my $grp = aio_group sub {
595     print "all requests are done\n";
596     };
597    
598     You add requests by calling the C<add> method with one or more
599     C<IO::AIO::REQ> objects:
600    
601     $grp->add (aio_unlink "...");
602    
603 root 1.58 add $grp aio_stat "...", sub {
604     $_[0] or return $grp->result ("error");
605    
606     # add another request dynamically, if first succeeded
607     add $grp aio_open "...", sub {
608     $grp->result ("ok");
609     };
610     };
611 root 1.55
612     This makes it very easy to create composite requests (see the source of
613     C<aio_move> for an application) that work and feel like simple requests.
614    
615 root 1.62 =over 4
616    
617     =item * The IO::AIO::GRP objects will be cleaned up during calls to
618 root 1.55 C<IO::AIO::poll_cb>, just like any other request.
619    
620 root 1.62 =item * They can be canceled like any other request. Canceling will cancel not
621 root 1.59 only the request itself, but also all requests it contains.
622 root 1.55
623 root 1.62 =item * They can also can also be added to other IO::AIO::GRP objects.
624 root 1.55
625 root 1.62 =item * You must not add requests to a group from within the group callback (or
626 root 1.60 any later time).
627    
628 root 1.62 =item * This does not harmonise well with C<max_outstanding>, so best do
629     not combine C<aio_group> with it. Groups and feeders are recommended for
630     this kind of concurrency-limiting.
631    
632     =back
633    
634 root 1.55 Their lifetime, simplified, looks like this: when they are empty, they
635     will finish very quickly. If they contain only requests that are in the
636     C<done> state, they will also finish. Otherwise they will continue to
637     exist.
638    
639 root 1.57 That means after creating a group you have some time to add requests. And
640     in the callbacks of those requests, you can add further requests to the
641     group. And only when all those requests have finished will the the group
642     itself finish.
643    
644 root 1.55 =over 4
645    
646     =item $grp->add (...)
647    
648     =item add $grp ...
649    
650 root 1.57 Add one or more requests to the group. Any type of L<IO::AIO::REQ> can
651     be added, including other groups, as long as you do not create circular
652     dependencies.
653    
654     Returns all its arguments.
655 root 1.55
656 root 1.58 =item $grp->result (...)
657    
658     Set the result value(s) that will be passed to the group callback when all
659     subrequests have finished. By default, no argument will be passed.
660    
661 root 1.60 =item $grp->set_feeder ($callback->($grp))
662    
663     [VERY EXPERIMENTAL]
664    
665     Sets a feeder/generator on this group: every group can have an attached
666     generator that generates requests if idle. The idea behind this is that,
667     although you could just queue as many requests as you want in a group,
668     this might starve other requests for a potentially long time. For
669     example, C<aio_scandir> might generate hundreds of thousands C<aio_stat>
670     requests, delaying any later requests for a long time.
671    
672     To avoid this, and allow incremental generation of requests, you can
673     instead a group and set a feeder on it that generates those requests. The
674     feeder will be called whenever there are few enough (see C<feeder_limit>,
675     below) requests active in the group itself and is expected to queue more
676     requests.
677    
678     The feeder can queue as many requests as it likes (i.e. C<add> does not
679     impose any limits).
680    
681     If the feeder does not queue more requests when called, it will be
682     automatically removed from the group.
683    
684     If the feeder limit is C<0>, it will be set to C<2> automatically.
685    
686     Example:
687    
688     # stat all files in @files, but only ever use four aio requests concurrently:
689    
690     my $grp = aio_group sub { print "finished\n" };
691     $grp->feeder_limit (4);
692     $grp->set_feeder (sub {
693     my $file = pop @files
694     or return;
695    
696     add $grp aio_stat $file, sub { ... };
697     });
698    
699     =item $grp->feeder_limit ($num)
700    
701     Sets the feeder limit for the group: The feeder will be called whenever
702     the group contains less than this many requests.
703    
704     Setting the limit to C<0> will pause the feeding process.
705    
706 root 1.55 =back
707    
708 root 1.5 =head2 SUPPORT FUNCTIONS
709    
710     =over 4
711    
712     =item $fileno = IO::AIO::poll_fileno
713    
714 root 1.20 Return the I<request result pipe file descriptor>. This filehandle must be
715     polled for reading by some mechanism outside this module (e.g. Event or
716     select, see below or the SYNOPSIS). If the pipe becomes readable you have
717     to call C<poll_cb> to check the results.
718 root 1.5
719     See C<poll_cb> for an example.
720    
721     =item IO::AIO::poll_cb
722    
723     Process all outstanding events on the result pipe. You have to call this
724     regularly. Returns the number of events processed. Returns immediately
725     when no events are outstanding.
726    
727 root 1.20 Example: Install an Event watcher that automatically calls
728     IO::AIO::poll_cb with high priority:
729 root 1.5
730     Event->io (fd => IO::AIO::poll_fileno,
731     poll => 'r', async => 1,
732     cb => \&IO::AIO::poll_cb);
733    
734     =item IO::AIO::poll_wait
735    
736     Wait till the result filehandle becomes ready for reading (simply does a
737 root 1.20 C<select> on the filehandle. This is useful if you want to synchronously wait
738 root 1.5 for some requests to finish).
739    
740     See C<nreqs> for an example.
741    
742     =item IO::AIO::nreqs
743    
744 root 1.20 Returns the number of requests currently outstanding (i.e. for which their
745     callback has not been invoked yet).
746 root 1.5
747     Example: wait till there are no outstanding requests anymore:
748    
749     IO::AIO::poll_wait, IO::AIO::poll_cb
750     while IO::AIO::nreqs;
751    
752 root 1.12 =item IO::AIO::flush
753    
754     Wait till all outstanding AIO requests have been handled.
755    
756 root 1.13 Strictly equivalent to:
757    
758     IO::AIO::poll_wait, IO::AIO::poll_cb
759     while IO::AIO::nreqs;
760    
761     =item IO::AIO::poll
762    
763     Waits until some requests have been handled.
764    
765     Strictly equivalent to:
766    
767     IO::AIO::poll_wait, IO::AIO::poll_cb
768     if IO::AIO::nreqs;
769    
770 root 1.5 =item IO::AIO::min_parallel $nthreads
771    
772 root 1.61 Set the minimum number of AIO threads to C<$nthreads>. The current
773     default is C<8>, which means eight asynchronous operations can execute
774     concurrently at any one time (the number of outstanding requests,
775     however, is unlimited).
776 root 1.5
777 root 1.34 IO::AIO starts threads only on demand, when an AIO request is queued and
778     no free thread exists.
779    
780 root 1.61 It is recommended to keep the number of threads relatively low, as some
781     Linux kernel versions will scale negatively with the number of threads
782     (higher parallelity => MUCH higher latency). With current Linux 2.6
783     versions, 4-32 threads should be fine.
784 root 1.5
785 root 1.34 Under most circumstances you don't need to call this function, as the
786     module selects a default that is suitable for low to moderate load.
787 root 1.5
788     =item IO::AIO::max_parallel $nthreads
789    
790 root 1.34 Sets the maximum number of AIO threads to C<$nthreads>. If more than the
791     specified number of threads are currently running, this function kills
792     them. This function blocks until the limit is reached.
793    
794     While C<$nthreads> are zero, aio requests get queued but not executed
795     until the number of threads has been increased again.
796 root 1.5
797     This module automatically runs C<max_parallel 0> at program end, to ensure
798     that all threads are killed and that there are no outstanding requests.
799    
800     Under normal circumstances you don't need to call this function.
801    
802     =item $oldnreqs = IO::AIO::max_outstanding $nreqs
803    
804 root 1.62 [DEPRECATED]
805    
806 root 1.5 Sets the maximum number of outstanding requests to C<$nreqs>. If you
807     try to queue up more than this number of requests, the caller will block until
808     some requests have been handled.
809    
810     The default is very large, so normally there is no practical limit. If you
811 root 1.34 queue up many requests in a loop it often improves speed if you set
812 root 1.5 this to a relatively low number, such as C<100>.
813    
814 root 1.62 This function does not work well together with C<aio_group>'s, and their
815     feeder interface is better suited to limiting concurrency, so do not use
816     this function.
817    
818 root 1.5 Under normal circumstances you don't need to call this function.
819    
820     =back
821    
822 root 1.1 =cut
823    
824 root 1.2 # support function to convert a fd into a perl filehandle
825     sub _fd2fh {
826     return undef if $_[0] < 0;
827    
828 root 1.23 # try to generate nice filehandles
829     my $sym = "IO::AIO::fd#$_[0]";
830     local *$sym;
831 root 1.25
832 root 1.27 open *$sym, "+<&=$_[0]" # usually works under any unix
833     or open *$sym, "<&=$_[0]" # cygwin needs this
834     or open *$sym, ">&=$_[0]" # or this
835 root 1.2 or return undef;
836    
837 root 1.23 *$sym
838 root 1.2 }
839    
840 root 1.61 min_parallel 8;
841 root 1.1
842     END {
843     max_parallel 0;
844     }
845    
846     1;
847    
848 root 1.27 =head2 FORK BEHAVIOUR
849    
850 root 1.52 This module should do "the right thing" when the process using it forks:
851    
852 root 1.34 Before the fork, IO::AIO enters a quiescent state where no requests
853     can be added in other threads and no results will be processed. After
854     the fork the parent simply leaves the quiescent state and continues
855     request/result processing, while the child clears the request/result
856     queue (so the requests started before the fork will only be handled in
857 root 1.52 the parent). Threads will be started on demand until the limit ste in the
858 root 1.34 parent process has been reached again.
859 root 1.27
860 root 1.52 In short: the parent will, after a short pause, continue as if fork had
861     not been called, while the child will act as if IO::AIO has not been used
862     yet.
863    
864 root 1.60 =head2 MEMORY USAGE
865    
866     Each aio request uses - depending on your architecture - around 128 bytes
867     of memory. In addition, stat requests need a stat buffer (possibly a few
868     hundred bytes). Perl scalars and other data passed into aio requests will
869     also be locked.
870    
871     This is now awfully much, so queuing lots of requests is not usually a
872     problem.
873    
874     Each thread needs a stack area which is usually around 16k, sometimes much
875     larger, depending on the OS.
876    
877 root 1.1 =head1 SEE ALSO
878    
879 root 1.52 L<Coro>, L<Linux::AIO> (obsolete).
880 root 1.1
881     =head1 AUTHOR
882    
883     Marc Lehmann <schmorp@schmorp.de>
884     http://home.schmorp.de/
885    
886     =cut
887