ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/IO-AIO/README
Revision: 1.30
Committed: Sat May 10 22:58:16 2008 UTC (16 years ago) by root
Branch: MAIN
CVS Tags: rel-3_0, rel-3_01, rel-3_02
Changes since 1.29: +6 -6 lines
Log Message:
*** empty log message ***

File Contents

# User Rev Content
1 root 1.1 NAME
2     IO::AIO - Asynchronous Input/Output
3    
4     SYNOPSIS
5     use IO::AIO;
6    
7 root 1.5 aio_open "/etc/passwd", O_RDONLY, 0, sub {
8 root 1.21 my $fh = shift
9     or die "/etc/passwd: $!";
10 root 1.5 ...
11     };
12    
13     aio_unlink "/tmp/file", sub { };
14    
15     aio_read $fh, 30000, 1024, $buffer, 0, sub {
16     $_[0] > 0 or die "read error: $!";
17     };
18    
19 root 1.18 # version 2+ has request and group objects
20     use IO::AIO 2;
21    
22     aioreq_pri 4; # give next request a very high priority
23     my $req = aio_unlink "/tmp/file", sub { };
24     $req->cancel; # cancel request if still in queue
25    
26     my $grp = aio_group sub { print "all stats done\n" };
27     add $grp aio_stat "..." for ...;
28    
29 root 1.30 # AnyEvent integration (EV, Event, Glib, Tk, POE, urxvt, pureperl...)
30     use AnyEvent::AIO;
31 root 1.14
32 root 1.28 # EV integration
33     my $w = EV::io IO::AIO::poll_fileno, EV::READ, \&IO::AIO::poll_cb;
34    
35 root 1.18 # Event integration
36 root 1.5 Event->io (fd => IO::AIO::poll_fileno,
37     poll => 'r',
38     cb => \&IO::AIO::poll_cb);
39    
40 root 1.18 # Glib/Gtk2 integration
41 root 1.5 add_watch Glib::IO IO::AIO::poll_fileno,
42 root 1.7 in => sub { IO::AIO::poll_cb; 1 };
43 root 1.5
44 root 1.18 # Tk integration
45 root 1.5 Tk::Event::IO->fileevent (IO::AIO::poll_fileno, "",
46     readable => \&IO::AIO::poll_cb);
47    
48 root 1.18 # Danga::Socket integration
49 root 1.6 Danga::Socket->AddOtherFds (IO::AIO::poll_fileno =>
50     \&IO::AIO::poll_cb);
51    
52 root 1.1 DESCRIPTION
53     This module implements asynchronous I/O using whatever means your
54 root 1.2 operating system supports.
55 root 1.1
56 root 1.19 Asynchronous means that operations that can normally block your program
57     (e.g. reading from disk) will be done asynchronously: the operation will
58     still block, but you can do something else in the meantime. This is
59     extremely useful for programs that need to stay interactive even when
60     doing heavy I/O (GUI programs, high performance network servers etc.),
61     but can also be used to easily do operations in parallel that are
62     normally done sequentially, e.g. stat'ing many files, which is much
63     faster on a RAID volume or over NFS when you do a number of stat
64     operations concurrently.
65    
66 root 1.20 While most of this works on all types of file descriptors (for example
67     sockets), using these functions on file descriptors that support
68 root 1.24 nonblocking operation (again, sockets, pipes etc.) is very inefficient.
69     Use an event loop for that (such as the Event module): IO::AIO will
70     naturally fit into such an event loop itself.
71 root 1.19
72 root 1.18 In this version, a number of threads are started that execute your
73     requests and signal their completion. You don't need thread support in
74     perl, and the threads created by this module will not be visible to
75     perl. In the future, this module might make use of the native aio
76     functions available on many operating systems. However, they are often
77 root 1.19 not well-supported or restricted (GNU/Linux doesn't allow them on normal
78 root 1.18 files currently, for example), and they would only support aio_read and
79 root 1.2 aio_write, so the remaining functionality would have to be implemented
80     using threads anyway.
81 root 1.1
82 root 1.24 Although the module will work in the presence of other (Perl-) threads,
83     it is currently not reentrant in any way, so use appropriate locking
84     yourself, always call "poll_cb" from within the same thread, or never
85     call "poll_cb" (or other "aio_" functions) recursively.
86 root 1.18
87 root 1.19 EXAMPLE
88     This is a simple example that uses the Event module and loads
89     /etc/passwd asynchronously:
90    
91     use Fcntl;
92     use Event;
93     use IO::AIO;
94    
95     # register the IO::AIO callback with Event
96     Event->io (fd => IO::AIO::poll_fileno,
97     poll => 'r',
98     cb => \&IO::AIO::poll_cb);
99    
100     # queue the request to open /etc/passwd
101     aio_open "/etc/passwd", O_RDONLY, 0, sub {
102 root 1.21 my $fh = shift
103 root 1.19 or die "error while opening: $!";
104    
105     # stat'ing filehandles is generally non-blocking
106     my $size = -s $fh;
107    
108     # queue a request to read the file
109     my $contents;
110     aio_read $fh, 0, $size, $contents, 0, sub {
111     $_[0] == $size
112     or die "short read: $!";
113    
114     close $fh;
115    
116     # file contents now in $contents
117     print $contents;
118    
119     # exit event loop and program
120     Event::unloop;
121     };
122     };
123    
124     # possibly queue up other requests, or open GUI windows,
125     # check for sockets etc. etc.
126    
127     # process events as long as there are some:
128     Event::loop;
129    
130 root 1.18 REQUEST ANATOMY AND LIFETIME
131     Every "aio_*" function creates a request. which is a C data structure
132     not directly visible to Perl.
133    
134     If called in non-void context, every request function returns a Perl
135     object representing the request. In void context, nothing is returned,
136     which saves a bit of memory.
137    
138     The perl object is a fairly standard ref-to-hash object. The hash
139     contents are not used by IO::AIO so you are free to store anything you
140     like in it.
141    
142     During their existance, aio requests travel through the following
143     states, in order:
144    
145     ready
146     Immediately after a request is created it is put into the ready
147     state, waiting for a thread to execute it.
148    
149     execute
150     A thread has accepted the request for processing and is currently
151     executing it (e.g. blocking in read).
152    
153     pending
154     The request has been executed and is waiting for result processing.
155    
156     While request submission and execution is fully asynchronous, result
157     processing is not and relies on the perl interpreter calling
158     "poll_cb" (or another function with the same effect).
159    
160     result
161     The request results are processed synchronously by "poll_cb".
162    
163     The "poll_cb" function will process all outstanding aio requests by
164     calling their callbacks, freeing memory associated with them and
165     managing any groups they are contained in.
166    
167     done
168     Request has reached the end of its lifetime and holds no resources
169     anymore (except possibly for the Perl object, but its connection to
170     the actual aio request is severed and calling its methods will
171     either do nothing or result in a runtime error).
172 root 1.1
173 root 1.4 FUNCTIONS
174 root 1.19 AIO REQUEST FUNCTIONS
175 root 1.20 All the "aio_*" calls are more or less thin wrappers around the syscall
176     with the same name (sans "aio_"). The arguments are similar or
177     identical, and they all accept an additional (and optional) $callback
178     argument which must be a code reference. This code reference will get
179     called with the syscall return code (e.g. most syscalls return -1 on
180     error, unlike perl, which usually delivers "false") as it's sole
181     argument when the given syscall has been executed asynchronously.
182    
183     All functions expecting a filehandle keep a copy of the filehandle
184     internally until the request has finished.
185    
186     All functions return request objects of type IO::AIO::REQ that allow
187     further manipulation of those requests while they are in-flight.
188    
189     The pathnames you pass to these routines *must* be absolute and encoded
190     as octets. The reason for the former is that at the time the request is
191     being executed, the current working directory could have changed.
192     Alternatively, you can make sure that you never change the current
193     working directory anywhere in the program and then use relative paths.
194    
195     To encode pathnames as octets, either make sure you either: a) always
196     pass in filenames you got from outside (command line, readdir etc.)
197     without tinkering, b) are ASCII or ISO 8859-1, c) use the Encode module
198     and encode your pathnames to the locale (or other) encoding in effect in
199     the user environment, d) use Glib::filename_from_unicode on unicode
200     filenames or e) use something else to ensure your scalar has the correct
201     contents.
202    
203     This works, btw. independent of the internal UTF-8 bit, which IO::AIO
204     handles correctly wether it is set or not.
205    
206     $prev_pri = aioreq_pri [$pri]
207     Returns the priority value that would be used for the next request
208     and, if $pri is given, sets the priority for the next aio request.
209    
210     The default priority is 0, the minimum and maximum priorities are -4
211     and 4, respectively. Requests with higher priority will be serviced
212     first.
213    
214     The priority will be reset to 0 after each call to one of the
215     "aio_*" functions.
216    
217     Example: open a file with low priority, then read something from it
218     with higher priority so the read request is serviced before other
219     low priority open requests (potentially spamming the cache):
220    
221     aioreq_pri -3;
222     aio_open ..., sub {
223     return unless $_[0];
224    
225     aioreq_pri -2;
226     aio_read $_[0], ..., sub {
227     ...
228     };
229     };
230    
231     aioreq_nice $pri_adjust
232     Similar to "aioreq_pri", but subtracts the given value from the
233     current priority, so the effect is cumulative.
234    
235     aio_open $pathname, $flags, $mode, $callback->($fh)
236     Asynchronously open or create a file and call the callback with a
237     newly created filehandle for the file.
238    
239     The pathname passed to "aio_open" must be absolute. See API NOTES,
240     above, for an explanation.
241    
242     The $flags argument is a bitmask. See the "Fcntl" module for a list.
243     They are the same as used by "sysopen".
244    
245     Likewise, $mode specifies the mode of the newly created file, if it
246     didn't exist and "O_CREAT" has been given, just like perl's
247     "sysopen", except that it is mandatory (i.e. use 0 if you don't
248 root 1.23 create new files, and 0666 or 0777 if you do). Note that the $mode
249     will be modified by the umask in effect then the request is being
250     executed, so better never change the umask.
251 root 1.20
252     Example:
253    
254     aio_open "/etc/passwd", O_RDONLY, 0, sub {
255     if ($_[0]) {
256     print "open successful, fh is $_[0]\n";
257     ...
258     } else {
259     die "open failed: $!\n";
260     }
261     };
262    
263     aio_close $fh, $callback->($status)
264     Asynchronously close a file and call the callback with the result
265 root 1.26 code.
266 root 1.20
267 root 1.27 Unfortunately, you can't do this to perl. Perl *insists* very
268     strongly on closing the file descriptor associated with the
269 root 1.29 filehandle itself.
270 root 1.27
271 root 1.29 Therefore, "aio_close" will not close the filehandle - instead it
272     will use dup2 to overwrite the file descriptor with the write-end of
273     a pipe (the pipe fd will be created on demand and will be cached).
274 root 1.27
275 root 1.29 Or in other words: the file descriptor will be closed, but it will
276     not be free for reuse until the perl filehandle is closed.
277 root 1.20
278     aio_read $fh,$offset,$length, $data,$dataoffset, $callback->($retval)
279     aio_write $fh,$offset,$length, $data,$dataoffset, $callback->($retval)
280 root 1.24 Reads or writes $length bytes from the specified $fh and $offset
281     into the scalar given by $data and offset $dataoffset and calls the
282     callback without the actual number of bytes read (or -1 on error,
283     just like the syscall).
284    
285 root 1.25 If $offset is undefined, then the current file descriptor offset
286     will be used (and updated), otherwise the file descriptor offset
287     will not be changed by these calls.
288 root 1.24
289     If $length is undefined in "aio_write", use the remaining length of
290     $data.
291    
292     If $dataoffset is less than zero, it will be counted from the end of
293     $data.
294 root 1.20
295     The $data scalar *MUST NOT* be modified in any way while the request
296 root 1.24 is outstanding. Modifying it can result in segfaults or World War
297     III (if the necessary/optional hardware is installed).
298 root 1.20
299     Example: Read 15 bytes at offset 7 into scalar $buffer, starting at
300     offset 0 within the scalar:
301    
302     aio_read $fh, 7, 15, $buffer, 0, sub {
303     $_[0] > 0 or die "read error: $!";
304     print "read $_[0] bytes: <$buffer>\n";
305     };
306    
307     aio_sendfile $out_fh, $in_fh, $in_offset, $length, $callback->($retval)
308     Tries to copy $length bytes from $in_fh to $out_fh. It starts
309     reading at byte offset $in_offset, and starts writing at the current
310     file offset of $out_fh. Because of that, it is not safe to issue
311     more than one "aio_sendfile" per $out_fh, as they will interfere
312     with each other.
313    
314     This call tries to make use of a native "sendfile" syscall to
315     provide zero-copy operation. For this to work, $out_fh should refer
316     to a socket, and $in_fh should refer to mmap'able file.
317    
318     If the native sendfile call fails or is not implemented, it will be
319     emulated, so you can call "aio_sendfile" on any type of filehandle
320     regardless of the limitations of the operating system.
321    
322     Please note, however, that "aio_sendfile" can read more bytes from
323     $in_fh than are written, and there is no way to find out how many
324     bytes have been read from "aio_sendfile" alone, as "aio_sendfile"
325     only provides the number of bytes written to $out_fh. Only if the
326     result value equals $length one can assume that $length bytes have
327     been read.
328    
329     aio_readahead $fh,$offset,$length, $callback->($retval)
330     "aio_readahead" populates the page cache with data from a file so
331     that subsequent reads from that file will not block on disk I/O. The
332     $offset argument specifies the starting point from which data is to
333     be read and $length specifies the number of bytes to be read. I/O is
334     performed in whole pages, so that offset is effectively rounded down
335     to a page boundary and bytes are read up to the next page boundary
336     greater than or equal to (off-set+length). "aio_readahead" does not
337     read beyond the end of the file. The current file offset of the file
338     is left unchanged.
339    
340     If that syscall doesn't exist (likely if your OS isn't Linux) it
341     will be emulated by simply reading the data, which would have a
342     similar effect.
343    
344     aio_stat $fh_or_path, $callback->($status)
345     aio_lstat $fh, $callback->($status)
346     Works like perl's "stat" or "lstat" in void context. The callback
347     will be called after the stat and the results will be available
348     using "stat _" or "-s _" etc...
349    
350     The pathname passed to "aio_stat" must be absolute. See API NOTES,
351     above, for an explanation.
352    
353     Currently, the stats are always 64-bit-stats, i.e. instead of
354     returning an error when stat'ing a large file, the results will be
355     silently truncated unless perl itself is compiled with large file
356     support.
357    
358     Example: Print the length of /etc/passwd:
359    
360     aio_stat "/etc/passwd", sub {
361     $_[0] and die "stat failed: $!";
362     print "size is ", -s _, "\n";
363     };
364    
365 root 1.24 aio_utime $fh_or_path, $atime, $mtime, $callback->($status)
366     Works like perl's "utime" function (including the special case of
367     $atime and $mtime being undef). Fractional times are supported if
368     the underlying syscalls support them.
369    
370     When called with a pathname, uses utimes(2) if available, otherwise
371     utime(2). If called on a file descriptor, uses futimes(2) if
372     available, otherwise returns ENOSYS, so this is not portable.
373    
374     Examples:
375    
376     # set atime and mtime to current time (basically touch(1)):
377     aio_utime "path", undef, undef;
378     # set atime to current time and mtime to beginning of the epoch:
379     aio_utime "path", time, undef; # undef==0
380    
381     aio_chown $fh_or_path, $uid, $gid, $callback->($status)
382     Works like perl's "chown" function, except that "undef" for either
383     $uid or $gid is being interpreted as "do not change" (but -1 can
384     also be used).
385    
386     Examples:
387    
388     # same as "chown root path" in the shell:
389     aio_chown "path", 0, -1;
390     # same as above:
391     aio_chown "path", 0, undef;
392    
393     aio_truncate $fh_or_path, $offset, $callback->($status)
394     Works like truncate(2) or ftruncate(2).
395    
396     aio_chmod $fh_or_path, $mode, $callback->($status)
397     Works like perl's "chmod" function.
398    
399 root 1.20 aio_unlink $pathname, $callback->($status)
400     Asynchronously unlink (delete) a file and call the callback with the
401     result code.
402    
403     aio_mknod $path, $mode, $dev, $callback->($status)
404     [EXPERIMENTAL]
405    
406     Asynchronously create a device node (or fifo). See mknod(2).
407    
408     The only (POSIX-) portable way of calling this function is:
409    
410     aio_mknod $path, IO::AIO::S_IFIFO | $mode, 0, sub { ...
411    
412     aio_link $srcpath, $dstpath, $callback->($status)
413     Asynchronously create a new link to the existing object at $srcpath
414     at the path $dstpath and call the callback with the result code.
415    
416     aio_symlink $srcpath, $dstpath, $callback->($status)
417     Asynchronously create a new symbolic link to the existing object at
418     $srcpath at the path $dstpath and call the callback with the result
419     code.
420    
421     aio_readlink $path, $callback->($link)
422     Asynchronously read the symlink specified by $path and pass it to
423     the callback. If an error occurs, nothing or undef gets passed to
424     the callback.
425    
426     aio_rename $srcpath, $dstpath, $callback->($status)
427     Asynchronously rename the object at $srcpath to $dstpath, just as
428     rename(2) and call the callback with the result code.
429    
430 root 1.23 aio_mkdir $pathname, $mode, $callback->($status)
431     Asynchronously mkdir (create) a directory and call the callback with
432     the result code. $mode will be modified by the umask at the time the
433     request is executed, so do not change your umask.
434    
435 root 1.20 aio_rmdir $pathname, $callback->($status)
436     Asynchronously rmdir (delete) a directory and call the callback with
437     the result code.
438    
439     aio_readdir $pathname, $callback->($entries)
440     Unlike the POSIX call of the same name, "aio_readdir" reads an
441     entire directory (i.e. opendir + readdir + closedir). The entries
442     will not be sorted, and will NOT include the "." and ".." entries.
443    
444     The callback a single argument which is either "undef" or an
445     array-ref with the filenames.
446    
447 root 1.22 aio_load $path, $data, $callback->($status)
448     This is a composite request that tries to fully load the given file
449     into memory. Status is the same as with aio_read.
450    
451 root 1.20 aio_copy $srcpath, $dstpath, $callback->($status)
452     Try to copy the *file* (directories not supported as either source
453     or destination) from $srcpath to $dstpath and call the callback with
454     the 0 (error) or -1 ok.
455    
456     This is a composite request that it creates the destination file
457     with mode 0200 and copies the contents of the source file into it
458     using "aio_sendfile", followed by restoring atime, mtime, access
459     mode and uid/gid, in that order.
460    
461     If an error occurs, the partial destination file will be unlinked,
462     if possible, except when setting atime, mtime, access mode and
463     uid/gid, where errors are being ignored.
464    
465     aio_move $srcpath, $dstpath, $callback->($status)
466     Try to move the *file* (directories not supported as either source
467     or destination) from $srcpath to $dstpath and call the callback with
468     the 0 (error) or -1 ok.
469    
470     This is a composite request that tries to rename(2) the file first.
471     If rename files with "EXDEV", it copies the file with "aio_copy"
472     and, if that is successful, unlinking the $srcpath.
473    
474     aio_scandir $path, $maxreq, $callback->($dirs, $nondirs)
475     Scans a directory (similar to "aio_readdir") but additionally tries
476     to efficiently separate the entries of directory $path into two sets
477     of names, directories you can recurse into (directories), and ones
478     you cannot recurse into (everything else, including symlinks to
479     directories).
480    
481     "aio_scandir" is a composite request that creates of many sub
482     requests_ $maxreq specifies the maximum number of outstanding aio
483     requests that this function generates. If it is "<= 0", then a
484     suitable default will be chosen (currently 4).
485    
486     On error, the callback is called without arguments, otherwise it
487     receives two array-refs with path-relative entry names.
488    
489     Example:
490    
491     aio_scandir $dir, 0, sub {
492     my ($dirs, $nondirs) = @_;
493     print "real directories: @$dirs\n";
494     print "everything else: @$nondirs\n";
495     };
496    
497     Implementation notes.
498    
499     The "aio_readdir" cannot be avoided, but "stat()"'ing every entry
500     can.
501    
502     After reading the directory, the modification time, size etc. of the
503     directory before and after the readdir is checked, and if they match
504     (and isn't the current time), the link count will be used to decide
505     how many entries are directories (if >= 2). Otherwise, no knowledge
506     of the number of subdirectories will be assumed.
507    
508     Then entries will be sorted into likely directories (everything
509     without a non-initial dot currently) and likely non-directories
510     (everything else). Then every entry plus an appended "/." will be
511     "stat"'ed, likely directories first. If that succeeds, it assumes
512     that the entry is a directory or a symlink to directory (which will
513     be checked seperately). This is often faster than stat'ing the entry
514     itself because filesystems might detect the type of the entry
515     without reading the inode data (e.g. ext2fs filetype feature).
516    
517     If the known number of directories (link count - 2) has been
518     reached, the rest of the entries is assumed to be non-directories.
519    
520     This only works with certainty on POSIX (= UNIX) filesystems, which
521     fortunately are the vast majority of filesystems around.
522    
523     It will also likely work on non-POSIX filesystems with reduced
524     efficiency as those tend to return 0 or 1 as link counts, which
525     disables the directory counting heuristic.
526    
527 root 1.23 aio_rmtree $path, $callback->($status)
528     Delete a directory tree starting (and including) $path, return the
529     status of the final "rmdir" only. This is a composite request that
530     uses "aio_scandir" to recurse into and rmdir directories, and unlink
531     everything else.
532    
533 root 1.28 aio_sync $callback->($status)
534     Asynchronously call sync and call the callback when finished.
535    
536 root 1.20 aio_fsync $fh, $callback->($status)
537     Asynchronously call fsync on the given filehandle and call the
538     callback with the fsync result code.
539    
540     aio_fdatasync $fh, $callback->($status)
541     Asynchronously call fdatasync on the given filehandle and call the
542     callback with the fdatasync result code.
543    
544     If this call isn't available because your OS lacks it or it couldn't
545     be detected, it will be emulated by calling "fsync" instead.
546    
547 root 1.28 aio_pathsync $path, $callback->($status)
548     This request tries to open, fsync and close the given path. This is
549     a composite request intended tosync directories after directory
550     operations (E.g. rename). This might not work on all operating
551     systems or have any specific effect, but usually it makes sure that
552     directory changes get written to disc. It works for anything that
553     can be opened for read-only, not just directories.
554    
555     Passes 0 when everything went ok, and -1 on error.
556    
557 root 1.20 aio_group $callback->(...)
558     This is a very special aio request: Instead of doing something, it
559     is a container for other aio requests, which is useful if you want
560     to bundle many requests into a single, composite, request with a
561     definite callback and the ability to cancel the whole request with
562     its subrequests.
563    
564     Returns an object of class IO::AIO::GRP. See its documentation below
565     for more info.
566    
567     Example:
568    
569     my $grp = aio_group sub {
570     print "all stats done\n";
571     };
572    
573     add $grp
574     (aio_stat ...),
575     (aio_stat ...),
576     ...;
577    
578     aio_nop $callback->()
579     This is a special request - it does nothing in itself and is only
580     used for side effects, such as when you want to add a dummy request
581     to a group so that finishing the requests in the group depends on
582     executing the given code.
583    
584     While this request does nothing, it still goes through the execution
585     phase and still requires a worker thread. Thus, the callback will
586     not be executed immediately but only after other requests in the
587     queue have entered their execution phase. This can be used to
588     measure request latency.
589    
590     IO::AIO::aio_busy $fractional_seconds, $callback->() *NOT EXPORTED*
591     Mainly used for debugging and benchmarking, this aio request puts
592     one of the request workers to sleep for the given time.
593    
594     While it is theoretically handy to have simple I/O scheduling
595     requests like sleep and file handle readable/writable, the overhead
596     this creates is immense (it blocks a thread for a long time) so do
597     not use this function except to put your application under
598     artificial I/O pressure.
599 root 1.18
600     IO::AIO::REQ CLASS
601 root 1.20 All non-aggregate "aio_*" functions return an object of this class when
602     called in non-void context.
603 root 1.18
604 root 1.20 cancel $req
605     Cancels the request, if possible. Has the effect of skipping
606     execution when entering the execute state and skipping calling the
607     callback when entering the the result state, but will leave the
608     request otherwise untouched. That means that requests that currently
609     execute will not be stopped and resources held by the request will
610     not be freed prematurely.
611 root 1.18
612 root 1.20 cb $req $callback->(...)
613     Replace (or simply set) the callback registered to the request.
614 root 1.18
615     IO::AIO::GRP CLASS
616 root 1.20 This class is a subclass of IO::AIO::REQ, so all its methods apply to
617     objects of this class, too.
618 root 1.18
619 root 1.20 A IO::AIO::GRP object is a special request that can contain multiple
620     other aio requests.
621 root 1.18
622 root 1.20 You create one by calling the "aio_group" constructing function with a
623     callback that will be called when all contained requests have entered
624     the "done" state:
625 root 1.18
626 root 1.20 my $grp = aio_group sub {
627     print "all requests are done\n";
628     };
629    
630     You add requests by calling the "add" method with one or more
631     "IO::AIO::REQ" objects:
632    
633     $grp->add (aio_unlink "...");
634    
635     add $grp aio_stat "...", sub {
636     $_[0] or return $grp->result ("error");
637 root 1.1
638 root 1.20 # add another request dynamically, if first succeeded
639     add $grp aio_open "...", sub {
640     $grp->result ("ok");
641     };
642     };
643 root 1.18
644 root 1.20 This makes it very easy to create composite requests (see the source of
645     "aio_move" for an application) that work and feel like simple requests.
646 root 1.18
647 root 1.28 * The IO::AIO::GRP objects will be cleaned up during calls to
648     "IO::AIO::poll_cb", just like any other request.
649    
650     * They can be canceled like any other request. Canceling will cancel
651     not only the request itself, but also all requests it contains.
652    
653     * They can also can also be added to other IO::AIO::GRP objects.
654    
655     * You must not add requests to a group from within the group callback
656     (or any later time).
657 root 1.20
658     Their lifetime, simplified, looks like this: when they are empty, they
659     will finish very quickly. If they contain only requests that are in the
660     "done" state, they will also finish. Otherwise they will continue to
661     exist.
662    
663     That means after creating a group you have some time to add requests.
664     And in the callbacks of those requests, you can add further requests to
665     the group. And only when all those requests have finished will the the
666     group itself finish.
667    
668     add $grp ...
669     $grp->add (...)
670     Add one or more requests to the group. Any type of IO::AIO::REQ can
671     be added, including other groups, as long as you do not create
672     circular dependencies.
673    
674     Returns all its arguments.
675    
676     $grp->cancel_subs
677     Cancel all subrequests and clears any feeder, but not the group
678     request itself. Useful when you queued a lot of events but got a
679     result early.
680    
681     $grp->result (...)
682     Set the result value(s) that will be passed to the group callback
683 root 1.28 when all subrequests have finished and set the groups errno to the
684 root 1.20 current value of errno (just like calling "errno" without an error
685     number). By default, no argument will be passed and errno is zero.
686    
687     $grp->errno ([$errno])
688     Sets the group errno value to $errno, or the current value of errno
689     when the argument is missing.
690    
691     Every aio request has an associated errno value that is restored
692     when the callback is invoked. This method lets you change this value
693     from its default (0).
694    
695     Calling "result" will also set errno, so make sure you either set $!
696     before the call to "result", or call c<errno> after it.
697    
698     feed $grp $callback->($grp)
699     Sets a feeder/generator on this group: every group can have an
700     attached generator that generates requests if idle. The idea behind
701     this is that, although you could just queue as many requests as you
702     want in a group, this might starve other requests for a potentially
703     long time. For example, "aio_scandir" might generate hundreds of
704     thousands "aio_stat" requests, delaying any later requests for a
705     long time.
706    
707     To avoid this, and allow incremental generation of requests, you can
708     instead a group and set a feeder on it that generates those
709     requests. The feed callback will be called whenever there are few
710     enough (see "limit", below) requests active in the group itself and
711     is expected to queue more requests.
712    
713     The feed callback can queue as many requests as it likes (i.e. "add"
714     does not impose any limits).
715    
716     If the feed does not queue more requests when called, it will be
717     automatically removed from the group.
718    
719     If the feed limit is 0, it will be set to 2 automatically.
720    
721     Example:
722    
723     # stat all files in @files, but only ever use four aio requests concurrently:
724    
725     my $grp = aio_group sub { print "finished\n" };
726     limit $grp 4;
727     feed $grp sub {
728     my $file = pop @files
729     or return;
730 root 1.18
731 root 1.20 add $grp aio_stat $file, sub { ... };
732 root 1.1 };
733    
734 root 1.20 limit $grp $num
735     Sets the feeder limit for the group: The feeder will be called
736     whenever the group contains less than this many requests.
737 root 1.18
738 root 1.20 Setting the limit to 0 will pause the feeding process.
739 root 1.17
740 root 1.18 SUPPORT FUNCTIONS
741 root 1.19 EVENT PROCESSING AND EVENT LOOP INTEGRATION
742 root 1.20 $fileno = IO::AIO::poll_fileno
743     Return the *request result pipe file descriptor*. This filehandle
744     must be polled for reading by some mechanism outside this module
745     (e.g. Event or select, see below or the SYNOPSIS). If the pipe
746     becomes readable you have to call "poll_cb" to check the results.
747    
748     See "poll_cb" for an example.
749    
750     IO::AIO::poll_cb
751     Process some outstanding events on the result pipe. You have to call
752     this regularly. Returns the number of events processed. Returns
753     immediately when no events are outstanding. The amount of events
754     processed depends on the settings of "IO::AIO::max_poll_req" and
755     "IO::AIO::max_poll_time".
756    
757     If not all requests were processed for whatever reason, the
758     filehandle will still be ready when "poll_cb" returns.
759    
760     Example: Install an Event watcher that automatically calls
761     IO::AIO::poll_cb with high priority:
762    
763     Event->io (fd => IO::AIO::poll_fileno,
764     poll => 'r', async => 1,
765     cb => \&IO::AIO::poll_cb);
766    
767     IO::AIO::max_poll_reqs $nreqs
768     IO::AIO::max_poll_time $seconds
769     These set the maximum number of requests (default 0, meaning
770     infinity) that are being processed by "IO::AIO::poll_cb" in one
771     call, respectively the maximum amount of time (default 0, meaning
772     infinity) spent in "IO::AIO::poll_cb" to process requests (more
773     correctly the mininum amount of time "poll_cb" is allowed to use).
774    
775     Setting "max_poll_time" to a non-zero value creates an overhead of
776     one syscall per request processed, which is not normally a problem
777     unless your callbacks are really really fast or your OS is really
778     really slow (I am not mentioning Solaris here). Using
779     "max_poll_reqs" incurs no overhead.
780    
781     Setting these is useful if you want to ensure some level of
782     interactiveness when perl is not fast enough to process all requests
783     in time.
784    
785     For interactive programs, values such as 0.01 to 0.1 should be fine.
786 root 1.4
787 root 1.20 Example: Install an Event watcher that automatically calls
788     IO::AIO::poll_cb with low priority, to ensure that other parts of
789     the program get the CPU sometimes even under high AIO load.
790 root 1.4
791 root 1.20 # try not to spend much more than 0.1s in poll_cb
792     IO::AIO::max_poll_time 0.1;
793 root 1.4
794 root 1.20 # use a low priority so other tasks have priority
795     Event->io (fd => IO::AIO::poll_fileno,
796     poll => 'r', nice => 1,
797     cb => &IO::AIO::poll_cb);
798    
799     IO::AIO::poll_wait
800 root 1.21 If there are any outstanding requests and none of them in the result
801     phase, wait till the result filehandle becomes ready for reading
802     (simply does a "select" on the filehandle. This is useful if you
803     want to synchronously wait for some requests to finish).
804 root 1.20
805     See "nreqs" for an example.
806    
807     IO::AIO::poll
808     Waits until some requests have been handled.
809    
810 root 1.21 Returns the number of requests processed, but is otherwise strictly
811     equivalent to:
812 root 1.20
813     IO::AIO::poll_wait, IO::AIO::poll_cb
814    
815     IO::AIO::flush
816     Wait till all outstanding AIO requests have been handled.
817    
818     Strictly equivalent to:
819    
820     IO::AIO::poll_wait, IO::AIO::poll_cb
821     while IO::AIO::nreqs;
822 root 1.6
823 root 1.19 CONTROLLING THE NUMBER OF THREADS
824 root 1.20 IO::AIO::min_parallel $nthreads
825     Set the minimum number of AIO threads to $nthreads. The current
826     default is 8, which means eight asynchronous operations can execute
827     concurrently at any one time (the number of outstanding requests,
828     however, is unlimited).
829    
830     IO::AIO starts threads only on demand, when an AIO request is queued
831     and no free thread exists. Please note that queueing up a hundred
832     requests can create demand for a hundred threads, even if it turns
833     out that everything is in the cache and could have been processed
834     faster by a single thread.
835    
836     It is recommended to keep the number of threads relatively low, as
837     some Linux kernel versions will scale negatively with the number of
838     threads (higher parallelity => MUCH higher latency). With current
839     Linux 2.6 versions, 4-32 threads should be fine.
840    
841     Under most circumstances you don't need to call this function, as
842     the module selects a default that is suitable for low to moderate
843     load.
844    
845     IO::AIO::max_parallel $nthreads
846     Sets the maximum number of AIO threads to $nthreads. If more than
847     the specified number of threads are currently running, this function
848     kills them. This function blocks until the limit is reached.
849    
850     While $nthreads are zero, aio requests get queued but not executed
851     until the number of threads has been increased again.
852    
853     This module automatically runs "max_parallel 0" at program end, to
854     ensure that all threads are killed and that there are no outstanding
855     requests.
856    
857     Under normal circumstances you don't need to call this function.
858    
859     IO::AIO::max_idle $nthreads
860     Limit the number of threads (default: 4) that are allowed to idle
861     (i.e., threads that did not get a request to process within 10
862     seconds). That means if a thread becomes idle while $nthreads other
863     threads are also idle, it will free its resources and exit.
864    
865     This is useful when you allow a large number of threads (e.g. 100 or
866     1000) to allow for extremely high load situations, but want to free
867     resources under normal circumstances (1000 threads can easily
868     consume 30MB of RAM).
869    
870     The default is probably ok in most situations, especially if thread
871     creation is fast. If thread creation is very slow on your system you
872     might want to use larger values.
873    
874 root 1.30 IO::AIO::max_outstanding $maxreqs
875 root 1.20 This is a very bad function to use in interactive programs because
876     it blocks, and a bad way to reduce concurrency because it is
877     inexact: Better use an "aio_group" together with a feed callback.
878    
879 root 1.25 Sets the maximum number of outstanding requests to $nreqs. If you do
880 root 1.20 queue up more than this number of requests, the next call to the
881     "poll_cb" (and "poll_some" and other functions calling "poll_cb")
882     function will block until the limit is no longer exceeded.
883    
884     The default value is very large, so there is no practical limit on
885     the number of outstanding requests.
886    
887     You can still queue as many requests as you want. Therefore,
888 root 1.30 "max_outstanding" is mainly useful in simple scripts (with low
889 root 1.20 values) or as a stop gap to shield against fatal memory overflow
890     (with large values).
891 root 1.1
892 root 1.19 STATISTICAL INFORMATION
893 root 1.20 IO::AIO::nreqs
894     Returns the number of requests currently in the ready, execute or
895     pending states (i.e. for which their callback has not been invoked
896     yet).
897    
898     Example: wait till there are no outstanding requests anymore:
899    
900     IO::AIO::poll_wait, IO::AIO::poll_cb
901     while IO::AIO::nreqs;
902    
903     IO::AIO::nready
904     Returns the number of requests currently in the ready state (not yet
905     executed).
906    
907     IO::AIO::npending
908     Returns the number of requests currently in the pending state
909     (executed, but not yet processed by poll_cb).
910 root 1.19
911 root 1.9 FORK BEHAVIOUR
912 root 1.20 This module should do "the right thing" when the process using it forks:
913 root 1.18
914 root 1.20 Before the fork, IO::AIO enters a quiescent state where no requests can
915     be added in other threads and no results will be processed. After the
916     fork the parent simply leaves the quiescent state and continues
917     request/result processing, while the child frees the request/result
918     queue (so that the requests started before the fork will only be handled
919     in the parent). Threads will be started on demand until the limit set in
920     the parent process has been reached again.
921    
922     In short: the parent will, after a short pause, continue as if fork had
923     not been called, while the child will act as if IO::AIO has not been
924     used yet.
925 root 1.18
926     MEMORY USAGE
927 root 1.20 Per-request usage:
928 root 1.18
929 root 1.20 Each aio request uses - depending on your architecture - around 100-200
930     bytes of memory. In addition, stat requests need a stat buffer (possibly
931     a few hundred bytes), readdir requires a result buffer and so on. Perl
932     scalars and other data passed into aio requests will also be locked and
933     will consume memory till the request has entered the done state.
934    
935 root 1.25 This is not awfully much, so queuing lots of requests is not usually a
936 root 1.20 problem.
937    
938     Per-thread usage:
939    
940     In the execution phase, some aio requests require more memory for
941     temporary buffers, and each thread requires a stack and other data
942     structures (usually around 16k-128k, depending on the OS).
943 root 1.18
944     KNOWN BUGS
945 root 1.20 Known bugs will be fixed in the next release.
946 root 1.9
947 root 1.1 SEE ALSO
948 root 1.30 AnyEvent::AIO for easy integration into event loops, Coro::AIO for a
949     more natural syntax.
950 root 1.1
951     AUTHOR
952 root 1.20 Marc Lehmann <schmorp@schmorp.de>
953     http://home.schmorp.de/
954 root 1.1