ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/IO-AIO/AIO.pm
(Generate patch)

Comparing IO-AIO/AIO.pm (file contents):
Revision 1.85 by root, Sat Oct 28 01:40:30 2006 UTC vs.
Revision 1.118 by root, Thu Nov 29 20:57:02 2007 UTC

5=head1 SYNOPSIS 5=head1 SYNOPSIS
6 6
7 use IO::AIO; 7 use IO::AIO;
8 8
9 aio_open "/etc/passwd", O_RDONLY, 0, sub { 9 aio_open "/etc/passwd", O_RDONLY, 0, sub {
10 my ($fh) = @_; 10 my $fh = shift
11 or die "/etc/passwd: $!";
11 ... 12 ...
12 }; 13 };
13 14
14 aio_unlink "/tmp/file", sub { }; 15 aio_unlink "/tmp/file", sub { };
15 16
25 $req->cancel; # cancel request if still in queue 26 $req->cancel; # cancel request if still in queue
26 27
27 my $grp = aio_group sub { print "all stats done\n" }; 28 my $grp = aio_group sub { print "all stats done\n" };
28 add $grp aio_stat "..." for ...; 29 add $grp aio_stat "..." for ...;
29 30
30 # AnyEvent integration 31 # AnyEvent integration (EV, Event, Glib, Tk, urxvt, pureperl...)
31 open my $fh, "<&=" . IO::AIO::poll_fileno or die "$!"; 32 open my $fh, "<&=" . IO::AIO::poll_fileno or die "$!";
32 my $w = AnyEvent->io (fh => $fh, poll => 'r', cb => sub { IO::AIO::poll_cb }); 33 my $w = AnyEvent->io (fh => $fh, poll => 'r', cb => sub { IO::AIO::poll_cb });
34
35 # EV integration
36 my $w = EV::io IO::AIO::poll_fileno, EV::READ, \&IO::AIO::poll_cb;
33 37
34 # Event integration 38 # Event integration
35 Event->io (fd => IO::AIO::poll_fileno, 39 Event->io (fd => IO::AIO::poll_fileno,
36 poll => 'r', 40 poll => 'r',
37 cb => \&IO::AIO::poll_cb); 41 cb => \&IO::AIO::poll_cb);
61etc.), but can also be used to easily do operations in parallel that are 65etc.), but can also be used to easily do operations in parallel that are
62normally done sequentially, e.g. stat'ing many files, which is much faster 66normally done sequentially, e.g. stat'ing many files, which is much faster
63on a RAID volume or over NFS when you do a number of stat operations 67on a RAID volume or over NFS when you do a number of stat operations
64concurrently. 68concurrently.
65 69
66While this works on all types of file descriptors (for example sockets), 70While most of this works on all types of file descriptors (for
67using these functions on file descriptors that support nonblocking 71example sockets), using these functions on file descriptors that
68operation (again, sockets, pipes etc.) is very inefficient. Use an event 72support nonblocking operation (again, sockets, pipes etc.) is very
69loop for that (such as the L<Event|Event> module): IO::AIO will naturally 73inefficient. Use an event loop for that (such as the L<Event|Event>
70fit into such an event loop itself. 74module): IO::AIO will naturally fit into such an event loop itself.
71 75
72In this version, a number of threads are started that execute your 76In this version, a number of threads are started that execute your
73requests and signal their completion. You don't need thread support 77requests and signal their completion. You don't need thread support
74in perl, and the threads created by this module will not be visible 78in perl, and the threads created by this module will not be visible
75to perl. In the future, this module might make use of the native aio 79to perl. In the future, this module might make use of the native aio
77not well-supported or restricted (GNU/Linux doesn't allow them on normal 81not well-supported or restricted (GNU/Linux doesn't allow them on normal
78files currently, for example), and they would only support aio_read and 82files currently, for example), and they would only support aio_read and
79aio_write, so the remaining functionality would have to be implemented 83aio_write, so the remaining functionality would have to be implemented
80using threads anyway. 84using threads anyway.
81 85
82Although the module will work with in the presence of other (Perl-) 86Although the module will work in the presence of other (Perl-) threads,
83threads, it is currently not reentrant in any way, so use appropriate 87it is currently not reentrant in any way, so use appropriate locking
84locking yourself, always call C<poll_cb> from within the same thread, or 88yourself, always call C<poll_cb> from within the same thread, or never
85never call C<poll_cb> (or other C<aio_> functions) recursively. 89call C<poll_cb> (or other C<aio_> functions) recursively.
90
91=head2 EXAMPLE
92
93This is a simple example that uses the Event module and loads
94F</etc/passwd> asynchronously:
95
96 use Fcntl;
97 use Event;
98 use IO::AIO;
99
100 # register the IO::AIO callback with Event
101 Event->io (fd => IO::AIO::poll_fileno,
102 poll => 'r',
103 cb => \&IO::AIO::poll_cb);
104
105 # queue the request to open /etc/passwd
106 aio_open "/etc/passwd", O_RDONLY, 0, sub {
107 my $fh = shift
108 or die "error while opening: $!";
109
110 # stat'ing filehandles is generally non-blocking
111 my $size = -s $fh;
112
113 # queue a request to read the file
114 my $contents;
115 aio_read $fh, 0, $size, $contents, 0, sub {
116 $_[0] == $size
117 or die "short read: $!";
118
119 close $fh;
120
121 # file contents now in $contents
122 print $contents;
123
124 # exit event loop and program
125 Event::unloop;
126 };
127 };
128
129 # possibly queue up other requests, or open GUI windows,
130 # check for sockets etc. etc.
131
132 # process events as long as there are some:
133 Event::loop;
86 134
87=head1 REQUEST ANATOMY AND LIFETIME 135=head1 REQUEST ANATOMY AND LIFETIME
88 136
89Every C<aio_*> function creates a request. which is a C data structure not 137Every C<aio_*> function creates a request. which is a C data structure not
90directly visible to Perl. 138directly visible to Perl.
132Request has reached the end of its lifetime and holds no resources anymore 180Request has reached the end of its lifetime and holds no resources anymore
133(except possibly for the Perl object, but its connection to the actual 181(except possibly for the Perl object, but its connection to the actual
134aio request is severed and calling its methods will either do nothing or 182aio request is severed and calling its methods will either do nothing or
135result in a runtime error). 183result in a runtime error).
136 184
185=back
186
137=cut 187=cut
138 188
139package IO::AIO; 189package IO::AIO;
190
191use Carp ();
140 192
141no warnings; 193no warnings;
142use strict 'vars'; 194use strict 'vars';
143 195
144use base 'Exporter'; 196use base 'Exporter';
145 197
146BEGIN { 198BEGIN {
147 our $VERSION = '2.0'; 199 our $VERSION = '2.51';
148 200
149 our @AIO_REQ = qw(aio_sendfile aio_read aio_write aio_open aio_close aio_stat 201 our @AIO_REQ = qw(aio_sendfile aio_read aio_write aio_open aio_close aio_stat
150 aio_lstat aio_unlink aio_rmdir aio_readdir aio_scandir aio_symlink 202 aio_lstat aio_unlink aio_rmdir aio_readdir aio_scandir aio_symlink
151 aio_fsync aio_fdatasync aio_readahead aio_rename aio_link aio_move 203 aio_readlink aio_fsync aio_fdatasync aio_readahead aio_rename aio_link
152 aio_copy aio_group aio_nop aio_mknod); 204 aio_move aio_copy aio_group aio_nop aio_mknod aio_load aio_rmtree aio_mkdir
205 aio_chown aio_chmod aio_utime aio_truncate);
153 our @EXPORT = (@AIO_REQ, qw(aioreq_pri aioreq_nice)); 206 our @EXPORT = (@AIO_REQ, qw(aioreq_pri aioreq_nice aio_block));
154 our @EXPORT_OK = qw(poll_fileno poll_cb poll_wait flush 207 our @EXPORT_OK = qw(poll_fileno poll_cb poll_wait flush
155 min_parallel max_parallel nreqs nready npending); 208 min_parallel max_parallel max_idle
209 nreqs nready npending nthreads
210 max_poll_time max_poll_reqs);
156 211
157 @IO::AIO::GRP::ISA = 'IO::AIO::REQ'; 212 @IO::AIO::GRP::ISA = 'IO::AIO::REQ';
158 213
159 require XSLoader; 214 require XSLoader;
160 XSLoader::load ("IO::AIO", $VERSION); 215 XSLoader::load ("IO::AIO", $VERSION);
161} 216}
162 217
163=head1 FUNCTIONS 218=head1 FUNCTIONS
164 219
165=head2 AIO FUNCTIONS 220=head2 AIO REQUEST FUNCTIONS
166 221
167All the C<aio_*> calls are more or less thin wrappers around the syscall 222All the C<aio_*> calls are more or less thin wrappers around the syscall
168with the same name (sans C<aio_>). The arguments are similar or identical, 223with the same name (sans C<aio_>). The arguments are similar or identical,
169and they all accept an additional (and optional) C<$callback> argument 224and they all accept an additional (and optional) C<$callback> argument
170which must be a code reference. This code reference will get called with 225which must be a code reference. This code reference will get called with
173syscall has been executed asynchronously. 228syscall has been executed asynchronously.
174 229
175All functions expecting a filehandle keep a copy of the filehandle 230All functions expecting a filehandle keep a copy of the filehandle
176internally until the request has finished. 231internally until the request has finished.
177 232
178All requests return objects of type L<IO::AIO::REQ> that allow further 233All functions return request objects of type L<IO::AIO::REQ> that allow
179manipulation of those requests while they are in-flight. 234further manipulation of those requests while they are in-flight.
180 235
181The pathnames you pass to these routines I<must> be absolute and 236The pathnames you pass to these routines I<must> be absolute and
182encoded in byte form. The reason for the former is that at the time the 237encoded as octets. The reason for the former is that at the time the
183request is being executed, the current working directory could have 238request is being executed, the current working directory could have
184changed. Alternatively, you can make sure that you never change the 239changed. Alternatively, you can make sure that you never change the
185current working directory. 240current working directory anywhere in the program and then use relative
241paths.
186 242
187To encode pathnames to byte form, either make sure you either: a) 243To encode pathnames as octets, either make sure you either: a) always pass
188always pass in filenames you got from outside (command line, readdir 244in filenames you got from outside (command line, readdir etc.) without
189etc.), b) are ASCII or ISO 8859-1, c) use the Encode module and encode 245tinkering, b) are ASCII or ISO 8859-1, c) use the Encode module and encode
190your pathnames to the locale (or other) encoding in effect in the user 246your pathnames to the locale (or other) encoding in effect in the user
191environment, d) use Glib::filename_from_unicode on unicode filenames or e) 247environment, d) use Glib::filename_from_unicode on unicode filenames or e)
192use something else. 248use something else to ensure your scalar has the correct contents.
249
250This works, btw. independent of the internal UTF-8 bit, which IO::AIO
251handles correctly wether it is set or not.
193 252
194=over 4 253=over 4
195 254
196=item $prev_pri = aioreq_pri [$pri] 255=item $prev_pri = aioreq_pri [$pri]
197 256
217 aio_read $_[0], ..., sub { 276 aio_read $_[0], ..., sub {
218 ... 277 ...
219 }; 278 };
220 }; 279 };
221 280
281
222=item aioreq_nice $pri_adjust 282=item aioreq_nice $pri_adjust
223 283
224Similar to C<aioreq_pri>, but subtracts the given value from the current 284Similar to C<aioreq_pri>, but subtracts the given value from the current
225priority, so effects are cumulative. 285priority, so the effect is cumulative.
286
226 287
227=item aio_open $pathname, $flags, $mode, $callback->($fh) 288=item aio_open $pathname, $flags, $mode, $callback->($fh)
228 289
229Asynchronously open or create a file and call the callback with a newly 290Asynchronously open or create a file and call the callback with a newly
230created filehandle for the file. 291created filehandle for the file.
236list. They are the same as used by C<sysopen>. 297list. They are the same as used by C<sysopen>.
237 298
238Likewise, C<$mode> specifies the mode of the newly created file, if it 299Likewise, C<$mode> specifies the mode of the newly created file, if it
239didn't exist and C<O_CREAT> has been given, just like perl's C<sysopen>, 300didn't exist and C<O_CREAT> has been given, just like perl's C<sysopen>,
240except that it is mandatory (i.e. use C<0> if you don't create new files, 301except that it is mandatory (i.e. use C<0> if you don't create new files,
241and C<0666> or C<0777> if you do). 302and C<0666> or C<0777> if you do). Note that the C<$mode> will be modified
303by the umask in effect then the request is being executed, so better never
304change the umask.
242 305
243Example: 306Example:
244 307
245 aio_open "/etc/passwd", O_RDONLY, 0, sub { 308 aio_open "/etc/passwd", O_RDONLY, 0, sub {
246 if ($_[0]) { 309 if ($_[0]) {
249 } else { 312 } else {
250 die "open failed: $!\n"; 313 die "open failed: $!\n";
251 } 314 }
252 }; 315 };
253 316
317
254=item aio_close $fh, $callback->($status) 318=item aio_close $fh, $callback->($status)
255 319
256Asynchronously close a file and call the callback with the result 320Asynchronously close a file and call the callback with the result
257code. I<WARNING:> although accepted, you should not pass in a perl 321code.
258filehandle here, as perl will likely close the file descriptor another
259time when the filehandle is destroyed. Normally, you can safely call perls
260C<close> or just let filehandles go out of scope.
261 322
262This is supposed to be a bug in the API, so that might change. It's 323Unfortunately, you can't do this to perl. Perl I<insists> very strongly on
263therefore best to avoid this function. 324closing the file descriptor associated with the filehandle itself. Here is
325what aio_close will try:
326
327 1. dup()licate the fd
328 2. asynchronously close() the duplicated fd
329 3. dup()licate the fd once more
330 4. let perl close() the filehandle
331 5. asynchronously close the duplicated fd
332
333The idea is that the first close() flushes stuff to disk that closing an
334fd will flush, so when perl closes the fd, nothing much will need to be
335flushed. The second async. close() will then flush stuff to disk that
336closing the last fd to the file will flush.
337
338Just FYI, SuSv3 has this to say on close:
339
340 All outstanding record locks owned by the process on the file
341 associated with the file descriptor shall be removed.
342
343 If fildes refers to a socket, close() shall cause the socket to be
344 destroyed. ... close() shall block for up to the current linger
345 interval until all data is transmitted.
346 [this actually sounds like a specification bug, but who knows]
347
348And at least Linux additionally actually flushes stuff on every close,
349even when the file itself is still open.
350
351Sounds enourmously inefficient and complicated? Yes... please show me how
352to nuke perl's fd out of existence...
353
354=cut
355
356sub aio_close($;$) {
357 aio_block {
358 my ($fh, $cb) = @_;
359
360 my $pri = aioreq_pri;
361 my $grp = aio_group $cb;
362
363 my $fd = fileno $fh;
364
365 defined $fd or Carp::croak "aio_close called with fd-less filehandle";
366
367 # if the dups fail we will simply get EBADF
368 my $fd2 = _dup $fd;
369 aioreq_pri $pri;
370 add $grp _aio_close $fd2, sub {
371 my $fd2 = _dup $fd;
372 close $fh;
373 aioreq_pri $pri;
374 add $grp _aio_close $fd2, sub {
375 $grp->result ($_[0]);
376 };
377 };
378
379 $grp
380 }
381}
382
264 383
265=item aio_read $fh,$offset,$length, $data,$dataoffset, $callback->($retval) 384=item aio_read $fh,$offset,$length, $data,$dataoffset, $callback->($retval)
266 385
267=item aio_write $fh,$offset,$length, $data,$dataoffset, $callback->($retval) 386=item aio_write $fh,$offset,$length, $data,$dataoffset, $callback->($retval)
268 387
269Reads or writes C<length> bytes from the specified C<fh> and C<offset> 388Reads or writes C<$length> bytes from the specified C<$fh> and C<$offset>
270into the scalar given by C<data> and offset C<dataoffset> and calls the 389into the scalar given by C<$data> and offset C<$dataoffset> and calls the
271callback without the actual number of bytes read (or -1 on error, just 390callback without the actual number of bytes read (or -1 on error, just
272like the syscall). 391like the syscall).
273 392
393If C<$offset> is undefined, then the current file descriptor offset will
394be used (and updated), otherwise the file descriptor offset will not be
395changed by these calls.
396
397If C<$length> is undefined in C<aio_write>, use the remaining length of C<$data>.
398
399If C<$dataoffset> is less than zero, it will be counted from the end of
400C<$data>.
401
274The C<$data> scalar I<MUST NOT> be modified in any way while the request 402The C<$data> scalar I<MUST NOT> be modified in any way while the request
275is outstanding. Modifying it can result in segfaults or WW3 (if the 403is outstanding. Modifying it can result in segfaults or World War III (if
276necessary/optional hardware is installed). 404the necessary/optional hardware is installed).
277 405
278Example: Read 15 bytes at offset 7 into scalar C<$buffer>, starting at 406Example: Read 15 bytes at offset 7 into scalar C<$buffer>, starting at
279offset C<0> within the scalar: 407offset C<0> within the scalar:
280 408
281 aio_read $fh, 7, 15, $buffer, 0, sub { 409 aio_read $fh, 7, 15, $buffer, 0, sub {
282 $_[0] > 0 or die "read error: $!"; 410 $_[0] > 0 or die "read error: $!";
283 print "read $_[0] bytes: <$buffer>\n"; 411 print "read $_[0] bytes: <$buffer>\n";
284 }; 412 };
413
285 414
286=item aio_sendfile $out_fh, $in_fh, $in_offset, $length, $callback->($retval) 415=item aio_sendfile $out_fh, $in_fh, $in_offset, $length, $callback->($retval)
287 416
288Tries to copy C<$length> bytes from C<$in_fh> to C<$out_fh>. It starts 417Tries to copy C<$length> bytes from C<$in_fh> to C<$out_fh>. It starts
289reading at byte offset C<$in_offset>, and starts writing at the current 418reading at byte offset C<$in_offset>, and starts writing at the current
303C<$in_fh> than are written, and there is no way to find out how many 432C<$in_fh> than are written, and there is no way to find out how many
304bytes have been read from C<aio_sendfile> alone, as C<aio_sendfile> only 433bytes have been read from C<aio_sendfile> alone, as C<aio_sendfile> only
305provides the number of bytes written to C<$out_fh>. Only if the result 434provides the number of bytes written to C<$out_fh>. Only if the result
306value equals C<$length> one can assume that C<$length> bytes have been 435value equals C<$length> one can assume that C<$length> bytes have been
307read. 436read.
437
308 438
309=item aio_readahead $fh,$offset,$length, $callback->($retval) 439=item aio_readahead $fh,$offset,$length, $callback->($retval)
310 440
311C<aio_readahead> populates the page cache with data from a file so that 441C<aio_readahead> populates the page cache with data from a file so that
312subsequent reads from that file will not block on disk I/O. The C<$offset> 442subsequent reads from that file will not block on disk I/O. The C<$offset>
318file. The current file offset of the file is left unchanged. 448file. The current file offset of the file is left unchanged.
319 449
320If that syscall doesn't exist (likely if your OS isn't Linux) it will be 450If that syscall doesn't exist (likely if your OS isn't Linux) it will be
321emulated by simply reading the data, which would have a similar effect. 451emulated by simply reading the data, which would have a similar effect.
322 452
453
323=item aio_stat $fh_or_path, $callback->($status) 454=item aio_stat $fh_or_path, $callback->($status)
324 455
325=item aio_lstat $fh, $callback->($status) 456=item aio_lstat $fh, $callback->($status)
326 457
327Works like perl's C<stat> or C<lstat> in void context. The callback will 458Works like perl's C<stat> or C<lstat> in void context. The callback will
340 aio_stat "/etc/passwd", sub { 471 aio_stat "/etc/passwd", sub {
341 $_[0] and die "stat failed: $!"; 472 $_[0] and die "stat failed: $!";
342 print "size is ", -s _, "\n"; 473 print "size is ", -s _, "\n";
343 }; 474 };
344 475
476
477=item aio_utime $fh_or_path, $atime, $mtime, $callback->($status)
478
479Works like perl's C<utime> function (including the special case of $atime
480and $mtime being undef). Fractional times are supported if the underlying
481syscalls support them.
482
483When called with a pathname, uses utimes(2) if available, otherwise
484utime(2). If called on a file descriptor, uses futimes(2) if available,
485otherwise returns ENOSYS, so this is not portable.
486
487Examples:
488
489 # set atime and mtime to current time (basically touch(1)):
490 aio_utime "path", undef, undef;
491 # set atime to current time and mtime to beginning of the epoch:
492 aio_utime "path", time, undef; # undef==0
493
494
495=item aio_chown $fh_or_path, $uid, $gid, $callback->($status)
496
497Works like perl's C<chown> function, except that C<undef> for either $uid
498or $gid is being interpreted as "do not change" (but -1 can also be used).
499
500Examples:
501
502 # same as "chown root path" in the shell:
503 aio_chown "path", 0, -1;
504 # same as above:
505 aio_chown "path", 0, undef;
506
507
508=item aio_truncate $fh_or_path, $offset, $callback->($status)
509
510Works like truncate(2) or ftruncate(2).
511
512
513=item aio_chmod $fh_or_path, $mode, $callback->($status)
514
515Works like perl's C<chmod> function.
516
517
345=item aio_unlink $pathname, $callback->($status) 518=item aio_unlink $pathname, $callback->($status)
346 519
347Asynchronously unlink (delete) a file and call the callback with the 520Asynchronously unlink (delete) a file and call the callback with the
348result code. 521result code.
349 522
523
350=item aio_mknod $path, $mode, $dev, $callback->($status) 524=item aio_mknod $path, $mode, $dev, $callback->($status)
351 525
526[EXPERIMENTAL]
527
352Asynchronously create a device node (or fifo). See mknod(2). 528Asynchronously create a device node (or fifo). See mknod(2).
353 529
354The only portable (POSIX) way of calling this function is: 530The only (POSIX-) portable way of calling this function is:
355 531
356 aio_mknod $path, IO::AIO::S_IFIFO | $mode, 0, sub { ... 532 aio_mknod $path, IO::AIO::S_IFIFO | $mode, 0, sub { ...
533
357 534
358=item aio_link $srcpath, $dstpath, $callback->($status) 535=item aio_link $srcpath, $dstpath, $callback->($status)
359 536
360Asynchronously create a new link to the existing object at C<$srcpath> at 537Asynchronously create a new link to the existing object at C<$srcpath> at
361the path C<$dstpath> and call the callback with the result code. 538the path C<$dstpath> and call the callback with the result code.
362 539
540
363=item aio_symlink $srcpath, $dstpath, $callback->($status) 541=item aio_symlink $srcpath, $dstpath, $callback->($status)
364 542
365Asynchronously create a new symbolic link to the existing object at C<$srcpath> at 543Asynchronously create a new symbolic link to the existing object at C<$srcpath> at
366the path C<$dstpath> and call the callback with the result code. 544the path C<$dstpath> and call the callback with the result code.
367 545
546
547=item aio_readlink $path, $callback->($link)
548
549Asynchronously read the symlink specified by C<$path> and pass it to
550the callback. If an error occurs, nothing or undef gets passed to the
551callback.
552
553
368=item aio_rename $srcpath, $dstpath, $callback->($status) 554=item aio_rename $srcpath, $dstpath, $callback->($status)
369 555
370Asynchronously rename the object at C<$srcpath> to C<$dstpath>, just as 556Asynchronously rename the object at C<$srcpath> to C<$dstpath>, just as
371rename(2) and call the callback with the result code. 557rename(2) and call the callback with the result code.
372 558
559
560=item aio_mkdir $pathname, $mode, $callback->($status)
561
562Asynchronously mkdir (create) a directory and call the callback with
563the result code. C<$mode> will be modified by the umask at the time the
564request is executed, so do not change your umask.
565
566
373=item aio_rmdir $pathname, $callback->($status) 567=item aio_rmdir $pathname, $callback->($status)
374 568
375Asynchronously rmdir (delete) a directory and call the callback with the 569Asynchronously rmdir (delete) a directory and call the callback with the
376result code. 570result code.
571
377 572
378=item aio_readdir $pathname, $callback->($entries) 573=item aio_readdir $pathname, $callback->($entries)
379 574
380Unlike the POSIX call of the same name, C<aio_readdir> reads an entire 575Unlike the POSIX call of the same name, C<aio_readdir> reads an entire
381directory (i.e. opendir + readdir + closedir). The entries will not be 576directory (i.e. opendir + readdir + closedir). The entries will not be
382sorted, and will B<NOT> include the C<.> and C<..> entries. 577sorted, and will B<NOT> include the C<.> and C<..> entries.
383 578
384The callback a single argument which is either C<undef> or an array-ref 579The callback a single argument which is either C<undef> or an array-ref
385with the filenames. 580with the filenames.
581
582
583=item aio_load $path, $data, $callback->($status)
584
585This is a composite request that tries to fully load the given file into
586memory. Status is the same as with aio_read.
587
588=cut
589
590sub aio_load($$;$) {
591 aio_block {
592 my ($path, undef, $cb) = @_;
593 my $data = \$_[1];
594
595 my $pri = aioreq_pri;
596 my $grp = aio_group $cb;
597
598 aioreq_pri $pri;
599 add $grp aio_open $path, O_RDONLY, 0, sub {
600 my $fh = shift
601 or return $grp->result (-1);
602
603 aioreq_pri $pri;
604 add $grp aio_read $fh, 0, (-s $fh), $$data, 0, sub {
605 $grp->result ($_[0]);
606 };
607 };
608
609 $grp
610 }
611}
386 612
387=item aio_copy $srcpath, $dstpath, $callback->($status) 613=item aio_copy $srcpath, $dstpath, $callback->($status)
388 614
389Try to copy the I<file> (directories not supported as either source or 615Try to copy the I<file> (directories not supported as either source or
390destination) from C<$srcpath> to C<$dstpath> and call the callback with 616destination) from C<$srcpath> to C<$dstpath> and call the callback with
400errors are being ignored. 626errors are being ignored.
401 627
402=cut 628=cut
403 629
404sub aio_copy($$;$) { 630sub aio_copy($$;$) {
631 aio_block {
405 my ($src, $dst, $cb) = @_; 632 my ($src, $dst, $cb) = @_;
406 633
407 my $pri = aioreq_pri; 634 my $pri = aioreq_pri;
408 my $grp = aio_group $cb; 635 my $grp = aio_group $cb;
409 636
410 aioreq_pri $pri; 637 aioreq_pri $pri;
411 add $grp aio_open $src, O_RDONLY, 0, sub { 638 add $grp aio_open $src, O_RDONLY, 0, sub {
412 if (my $src_fh = $_[0]) { 639 if (my $src_fh = $_[0]) {
413 my @stat = stat $src_fh; 640 my @stat = stat $src_fh;
414 641
415 aioreq_pri $pri; 642 aioreq_pri $pri;
416 add $grp aio_open $dst, O_CREAT | O_WRONLY | O_TRUNC, 0200, sub { 643 add $grp aio_open $dst, O_CREAT | O_WRONLY | O_TRUNC, 0200, sub {
417 if (my $dst_fh = $_[0]) { 644 if (my $dst_fh = $_[0]) {
418 aioreq_pri $pri; 645 aioreq_pri $pri;
419 add $grp aio_sendfile $dst_fh, $src_fh, 0, $stat[7], sub { 646 add $grp aio_sendfile $dst_fh, $src_fh, 0, $stat[7], sub {
420 if ($_[0] == $stat[7]) { 647 if ($_[0] == $stat[7]) {
421 $grp->result (0); 648 $grp->result (0);
422 close $src_fh; 649 close $src_fh;
423 650
424 # those should not normally block. should. should. 651 # those should not normally block. should. should.
425 utime $stat[8], $stat[9], $dst; 652 utime $stat[8], $stat[9], $dst;
426 chmod $stat[2] & 07777, $dst_fh; 653 chmod $stat[2] & 07777, $dst_fh;
427 chown $stat[4], $stat[5], $dst_fh; 654 chown $stat[4], $stat[5], $dst_fh;
428 close $dst_fh; 655 close $dst_fh;
429 } else { 656 } else {
430 $grp->result (-1); 657 $grp->result (-1);
431 close $src_fh; 658 close $src_fh;
432 close $dst_fh; 659 close $dst_fh;
433 660
434 aioreq $pri; 661 aioreq $pri;
435 add $grp aio_unlink $dst; 662 add $grp aio_unlink $dst;
663 }
436 } 664 };
665 } else {
666 $grp->result (-1);
437 }; 667 }
438 } else {
439 $grp->result (-1);
440 } 668 },
669
670 } else {
671 $grp->result (-1);
441 }, 672 }
442
443 } else {
444 $grp->result (-1);
445 } 673 };
674
675 $grp
446 }; 676 }
447
448 $grp
449} 677}
450 678
451=item aio_move $srcpath, $dstpath, $callback->($status) 679=item aio_move $srcpath, $dstpath, $callback->($status)
452 680
453Try to move the I<file> (directories not supported as either source or 681Try to move the I<file> (directories not supported as either source or
459that is successful, unlinking the C<$srcpath>. 687that is successful, unlinking the C<$srcpath>.
460 688
461=cut 689=cut
462 690
463sub aio_move($$;$) { 691sub aio_move($$;$) {
692 aio_block {
464 my ($src, $dst, $cb) = @_; 693 my ($src, $dst, $cb) = @_;
465 694
466 my $pri = aioreq_pri; 695 my $pri = aioreq_pri;
467 my $grp = aio_group $cb; 696 my $grp = aio_group $cb;
468 697
469 aioreq_pri $pri; 698 aioreq_pri $pri;
470 add $grp aio_rename $src, $dst, sub { 699 add $grp aio_rename $src, $dst, sub {
471 if ($_[0] && $! == EXDEV) { 700 if ($_[0] && $! == EXDEV) {
472 aioreq_pri $pri; 701 aioreq_pri $pri;
473 add $grp aio_copy $src, $dst, sub { 702 add $grp aio_copy $src, $dst, sub {
703 $grp->result ($_[0]);
704
705 if (!$_[0]) {
706 aioreq_pri $pri;
707 add $grp aio_unlink $src;
708 }
709 };
710 } else {
474 $grp->result ($_[0]); 711 $grp->result ($_[0]);
475
476 if (!$_[0]) {
477 aioreq_pri $pri;
478 add $grp aio_unlink $src;
479 }
480 }; 712 }
481 } else {
482 $grp->result ($_[0]);
483 } 713 };
714
715 $grp
484 }; 716 }
485
486 $grp
487} 717}
488 718
489=item aio_scandir $path, $maxreq, $callback->($dirs, $nondirs) 719=item aio_scandir $path, $maxreq, $callback->($dirs, $nondirs)
490 720
491Scans a directory (similar to C<aio_readdir>) but additionally tries to 721Scans a directory (similar to C<aio_readdir>) but additionally tries to
538as those tend to return 0 or 1 as link counts, which disables the 768as those tend to return 0 or 1 as link counts, which disables the
539directory counting heuristic. 769directory counting heuristic.
540 770
541=cut 771=cut
542 772
543sub aio_scandir($$$) { 773sub aio_scandir($$;$) {
774 aio_block {
544 my ($path, $maxreq, $cb) = @_; 775 my ($path, $maxreq, $cb) = @_;
545 776
546 my $pri = aioreq_pri; 777 my $pri = aioreq_pri;
547 778
548 my $grp = aio_group $cb; 779 my $grp = aio_group $cb;
549 780
550 $maxreq = 4 if $maxreq <= 0; 781 $maxreq = 4 if $maxreq <= 0;
551 782
552 # stat once 783 # stat once
553 aioreq_pri $pri;
554 add $grp aio_stat $path, sub {
555 return $grp->result () if $_[0];
556 my $now = time;
557 my $hash1 = join ":", (stat _)[0,1,3,7,9];
558
559 # read the directory entries
560 aioreq_pri $pri; 784 aioreq_pri $pri;
561 add $grp aio_readdir $path, sub { 785 add $grp aio_stat $path, sub {
562 my $entries = shift
563 or return $grp->result (); 786 return $grp->result () if $_[0];
787 my $now = time;
788 my $hash1 = join ":", (stat _)[0,1,3,7,9];
564 789
565 # stat the dir another time 790 # read the directory entries
566 aioreq_pri $pri; 791 aioreq_pri $pri;
792 add $grp aio_readdir $path, sub {
793 my $entries = shift
794 or return $grp->result ();
795
796 # stat the dir another time
797 aioreq_pri $pri;
567 add $grp aio_stat $path, sub { 798 add $grp aio_stat $path, sub {
568 my $hash2 = join ":", (stat _)[0,1,3,7,9]; 799 my $hash2 = join ":", (stat _)[0,1,3,7,9];
569 800
570 my $ndirs; 801 my $ndirs;
571 802
572 # take the slow route if anything looks fishy 803 # take the slow route if anything looks fishy
573 if ($hash1 ne $hash2 or (stat _)[9] == $now) { 804 if ($hash1 ne $hash2 or (stat _)[9] == $now) {
574 $ndirs = -1; 805 $ndirs = -1;
575 } else { 806 } else {
576 # if nlink == 2, we are finished 807 # if nlink == 2, we are finished
577 # on non-posix-fs's, we rely on nlink < 2 808 # on non-posix-fs's, we rely on nlink < 2
578 $ndirs = (stat _)[3] - 2 809 $ndirs = (stat _)[3] - 2
579 or return $grp->result ([], $entries); 810 or return $grp->result ([], $entries);
580 } 811 }
581 812
582 # sort into likely dirs and likely nondirs 813 # sort into likely dirs and likely nondirs
583 # dirs == files without ".", short entries first 814 # dirs == files without ".", short entries first
584 $entries = [map $_->[0], 815 $entries = [map $_->[0],
585 sort { $b->[1] cmp $a->[1] } 816 sort { $b->[1] cmp $a->[1] }
586 map [$_, sprintf "%s%04d", (/.\./ ? "1" : "0"), length], 817 map [$_, sprintf "%s%04d", (/.\./ ? "1" : "0"), length],
587 @$entries]; 818 @$entries];
588 819
589 my (@dirs, @nondirs); 820 my (@dirs, @nondirs);
590 821
591 my $statgrp = add $grp aio_group sub { 822 my $statgrp = add $grp aio_group sub {
592 $grp->result (\@dirs, \@nondirs); 823 $grp->result (\@dirs, \@nondirs);
593 }; 824 };
594 825
595 limit $statgrp $maxreq; 826 limit $statgrp $maxreq;
596 feed $statgrp sub { 827 feed $statgrp sub {
597 return unless @$entries; 828 return unless @$entries;
598 my $entry = pop @$entries; 829 my $entry = pop @$entries;
599 830
600 aioreq_pri $pri; 831 aioreq_pri $pri;
601 add $statgrp aio_stat "$path/$entry/.", sub { 832 add $statgrp aio_stat "$path/$entry/.", sub {
602 if ($_[0] < 0) { 833 if ($_[0] < 0) {
603 push @nondirs, $entry; 834 push @nondirs, $entry;
604 } else { 835 } else {
605 # need to check for real directory 836 # need to check for real directory
606 aioreq_pri $pri; 837 aioreq_pri $pri;
607 add $statgrp aio_lstat "$path/$entry", sub { 838 add $statgrp aio_lstat "$path/$entry", sub {
608 if (-d _) { 839 if (-d _) {
609 push @dirs, $entry; 840 push @dirs, $entry;
610 841
611 unless (--$ndirs) { 842 unless (--$ndirs) {
612 push @nondirs, @$entries; 843 push @nondirs, @$entries;
613 feed $statgrp; 844 feed $statgrp;
845 }
846 } else {
847 push @nondirs, $entry;
614 } 848 }
615 } else {
616 push @nondirs, $entry;
617 } 849 }
618 } 850 }
619 } 851 };
620 }; 852 };
621 }; 853 };
622 }; 854 };
623 }; 855 };
856
857 $grp
624 }; 858 }
859}
625 860
861=item aio_rmtree $path, $callback->($status)
862
863Delete a directory tree starting (and including) C<$path>, return the
864status of the final C<rmdir> only. This is a composite request that
865uses C<aio_scandir> to recurse into and rmdir directories, and unlink
866everything else.
867
868=cut
869
870sub aio_rmtree;
871sub aio_rmtree($;$) {
872 aio_block {
873 my ($path, $cb) = @_;
874
875 my $pri = aioreq_pri;
876 my $grp = aio_group $cb;
877
878 aioreq_pri $pri;
879 add $grp aio_scandir $path, 0, sub {
880 my ($dirs, $nondirs) = @_;
881
882 my $dirgrp = aio_group sub {
883 add $grp aio_rmdir $path, sub {
884 $grp->result ($_[0]);
885 };
886 };
887
888 (aioreq_pri $pri), add $dirgrp aio_rmtree "$path/$_" for @$dirs;
889 (aioreq_pri $pri), add $dirgrp aio_unlink "$path/$_" for @$nondirs;
890
891 add $grp $dirgrp;
892 };
893
626 $grp 894 $grp
895 }
627} 896}
628 897
629=item aio_fsync $fh, $callback->($status) 898=item aio_fsync $fh, $callback->($status)
630 899
631Asynchronously call fsync on the given filehandle and call the callback 900Asynchronously call fsync on the given filehandle and call the callback
845 1114
846=back 1115=back
847 1116
848=head2 SUPPORT FUNCTIONS 1117=head2 SUPPORT FUNCTIONS
849 1118
1119=head3 EVENT PROCESSING AND EVENT LOOP INTEGRATION
1120
850=over 4 1121=over 4
851 1122
852=item $fileno = IO::AIO::poll_fileno 1123=item $fileno = IO::AIO::poll_fileno
853 1124
854Return the I<request result pipe file descriptor>. This filehandle must be 1125Return the I<request result pipe file descriptor>. This filehandle must be
858 1129
859See C<poll_cb> for an example. 1130See C<poll_cb> for an example.
860 1131
861=item IO::AIO::poll_cb 1132=item IO::AIO::poll_cb
862 1133
863Process all outstanding events on the result pipe. You have to call this 1134Process some outstanding events on the result pipe. You have to call this
864regularly. Returns the number of events processed. Returns immediately 1135regularly. Returns the number of events processed. Returns immediately
865when no events are outstanding. 1136when no events are outstanding. The amount of events processed depends on
1137the settings of C<IO::AIO::max_poll_req> and C<IO::AIO::max_poll_time>.
866 1138
867If not all requests were processed for whatever reason, the filehandle 1139If not all requests were processed for whatever reason, the filehandle
868will still be ready when C<poll_cb> returns. 1140will still be ready when C<poll_cb> returns.
869 1141
870Example: Install an Event watcher that automatically calls 1142Example: Install an Event watcher that automatically calls
872 1144
873 Event->io (fd => IO::AIO::poll_fileno, 1145 Event->io (fd => IO::AIO::poll_fileno,
874 poll => 'r', async => 1, 1146 poll => 'r', async => 1,
875 cb => \&IO::AIO::poll_cb); 1147 cb => \&IO::AIO::poll_cb);
876 1148
877=item IO::AIO::poll_some $max_requests 1149=item IO::AIO::max_poll_reqs $nreqs
878 1150
879Similar to C<poll_cb>, but only processes up to C<$max_requests> requests 1151=item IO::AIO::max_poll_time $seconds
880at a time.
881 1152
882Useful if you want to ensure some level of interactiveness when perl is 1153These set the maximum number of requests (default C<0>, meaning infinity)
883not fast enough to process all requests in time. 1154that are being processed by C<IO::AIO::poll_cb> in one call, respectively
1155the maximum amount of time (default C<0>, meaning infinity) spent in
1156C<IO::AIO::poll_cb> to process requests (more correctly the mininum amount
1157of time C<poll_cb> is allowed to use).
1158
1159Setting C<max_poll_time> to a non-zero value creates an overhead of one
1160syscall per request processed, which is not normally a problem unless your
1161callbacks are really really fast or your OS is really really slow (I am
1162not mentioning Solaris here). Using C<max_poll_reqs> incurs no overhead.
1163
1164Setting these is useful if you want to ensure some level of
1165interactiveness when perl is not fast enough to process all requests in
1166time.
1167
1168For interactive programs, values such as C<0.01> to C<0.1> should be fine.
884 1169
885Example: Install an Event watcher that automatically calls 1170Example: Install an Event watcher that automatically calls
886IO::AIO::poll_some with low priority, to ensure that other parts of the 1171IO::AIO::poll_cb with low priority, to ensure that other parts of the
887program get the CPU sometimes even under high AIO load. 1172program get the CPU sometimes even under high AIO load.
888 1173
1174 # try not to spend much more than 0.1s in poll_cb
1175 IO::AIO::max_poll_time 0.1;
1176
1177 # use a low priority so other tasks have priority
889 Event->io (fd => IO::AIO::poll_fileno, 1178 Event->io (fd => IO::AIO::poll_fileno,
890 poll => 'r', nice => 1, 1179 poll => 'r', nice => 1,
891 cb => sub { IO::AIO::poll_some 256 }); 1180 cb => &IO::AIO::poll_cb);
892 1181
893=item IO::AIO::poll_wait 1182=item IO::AIO::poll_wait
894 1183
1184If there are any outstanding requests and none of them in the result
895Wait till the result filehandle becomes ready for reading (simply does a 1185phase, wait till the result filehandle becomes ready for reading (simply
896C<select> on the filehandle. This is useful if you want to synchronously wait 1186does a C<select> on the filehandle. This is useful if you want to
897for some requests to finish). 1187synchronously wait for some requests to finish).
898 1188
899See C<nreqs> for an example. 1189See C<nreqs> for an example.
900 1190
1191=item IO::AIO::poll
1192
1193Waits until some requests have been handled.
1194
1195Returns the number of requests processed, but is otherwise strictly
1196equivalent to:
1197
1198 IO::AIO::poll_wait, IO::AIO::poll_cb
1199
901=item IO::AIO::nreqs 1200=item IO::AIO::flush
902 1201
903Returns the number of requests currently in the ready, execute or pending 1202Wait till all outstanding AIO requests have been handled.
904states (i.e. for which their callback has not been invoked yet).
905 1203
906Example: wait till there are no outstanding requests anymore: 1204Strictly equivalent to:
907 1205
908 IO::AIO::poll_wait, IO::AIO::poll_cb 1206 IO::AIO::poll_wait, IO::AIO::poll_cb
909 while IO::AIO::nreqs; 1207 while IO::AIO::nreqs;
910 1208
911=item IO::AIO::nready 1209=back
912 1210
913Returns the number of requests currently in the ready state (not yet 1211=head3 CONTROLLING THE NUMBER OF THREADS
914executed).
915 1212
916=item IO::AIO::npending 1213=over
917
918Returns the number of requests currently in the pending state (executed,
919but not yet processed by poll_cb).
920
921=item IO::AIO::flush
922
923Wait till all outstanding AIO requests have been handled.
924
925Strictly equivalent to:
926
927 IO::AIO::poll_wait, IO::AIO::poll_cb
928 while IO::AIO::nreqs;
929
930=item IO::AIO::poll
931
932Waits until some requests have been handled.
933
934Strictly equivalent to:
935
936 IO::AIO::poll_wait, IO::AIO::poll_cb
937 if IO::AIO::nreqs;
938 1214
939=item IO::AIO::min_parallel $nthreads 1215=item IO::AIO::min_parallel $nthreads
940 1216
941Set the minimum number of AIO threads to C<$nthreads>. The current 1217Set the minimum number of AIO threads to C<$nthreads>. The current
942default is C<8>, which means eight asynchronous operations can execute 1218default is C<8>, which means eight asynchronous operations can execute
943concurrently at any one time (the number of outstanding requests, 1219concurrently at any one time (the number of outstanding requests,
944however, is unlimited). 1220however, is unlimited).
945 1221
946IO::AIO starts threads only on demand, when an AIO request is queued and 1222IO::AIO starts threads only on demand, when an AIO request is queued and
947no free thread exists. 1223no free thread exists. Please note that queueing up a hundred requests can
1224create demand for a hundred threads, even if it turns out that everything
1225is in the cache and could have been processed faster by a single thread.
948 1226
949It is recommended to keep the number of threads relatively low, as some 1227It is recommended to keep the number of threads relatively low, as some
950Linux kernel versions will scale negatively with the number of threads 1228Linux kernel versions will scale negatively with the number of threads
951(higher parallelity => MUCH higher latency). With current Linux 2.6 1229(higher parallelity => MUCH higher latency). With current Linux 2.6
952versions, 4-32 threads should be fine. 1230versions, 4-32 threads should be fine.
966This module automatically runs C<max_parallel 0> at program end, to ensure 1244This module automatically runs C<max_parallel 0> at program end, to ensure
967that all threads are killed and that there are no outstanding requests. 1245that all threads are killed and that there are no outstanding requests.
968 1246
969Under normal circumstances you don't need to call this function. 1247Under normal circumstances you don't need to call this function.
970 1248
1249=item IO::AIO::max_idle $nthreads
1250
1251Limit the number of threads (default: 4) that are allowed to idle (i.e.,
1252threads that did not get a request to process within 10 seconds). That
1253means if a thread becomes idle while C<$nthreads> other threads are also
1254idle, it will free its resources and exit.
1255
1256This is useful when you allow a large number of threads (e.g. 100 or 1000)
1257to allow for extremely high load situations, but want to free resources
1258under normal circumstances (1000 threads can easily consume 30MB of RAM).
1259
1260The default is probably ok in most situations, especially if thread
1261creation is fast. If thread creation is very slow on your system you might
1262want to use larger values.
1263
971=item $oldmaxreqs = IO::AIO::max_outstanding $maxreqs 1264=item $oldmaxreqs = IO::AIO::max_outstanding $maxreqs
972 1265
973This is a very bad function to use in interactive programs because it 1266This is a very bad function to use in interactive programs because it
974blocks, and a bad way to reduce concurrency because it is inexact: Better 1267blocks, and a bad way to reduce concurrency because it is inexact: Better
975use an C<aio_group> together with a feed callback. 1268use an C<aio_group> together with a feed callback.
976 1269
977Sets the maximum number of outstanding requests to C<$nreqs>. If you 1270Sets the maximum number of outstanding requests to C<$nreqs>. If you
978to queue up more than this number of requests, the next call to the 1271do queue up more than this number of requests, the next call to the
979C<poll_cb> (and C<poll_some> and other functions calling C<poll_cb>) 1272C<poll_cb> (and C<poll_some> and other functions calling C<poll_cb>)
980function will block until the limit is no longer exceeded. 1273function will block until the limit is no longer exceeded.
981 1274
982The default value is very large, so there is no practical limit on the 1275The default value is very large, so there is no practical limit on the
983number of outstanding requests. 1276number of outstanding requests.
986C<max_oustsanding> is mainly useful in simple scripts (with low values) or 1279C<max_oustsanding> is mainly useful in simple scripts (with low values) or
987as a stop gap to shield against fatal memory overflow (with large values). 1280as a stop gap to shield against fatal memory overflow (with large values).
988 1281
989=back 1282=back
990 1283
1284=head3 STATISTICAL INFORMATION
1285
1286=over
1287
1288=item IO::AIO::nreqs
1289
1290Returns the number of requests currently in the ready, execute or pending
1291states (i.e. for which their callback has not been invoked yet).
1292
1293Example: wait till there are no outstanding requests anymore:
1294
1295 IO::AIO::poll_wait, IO::AIO::poll_cb
1296 while IO::AIO::nreqs;
1297
1298=item IO::AIO::nready
1299
1300Returns the number of requests currently in the ready state (not yet
1301executed).
1302
1303=item IO::AIO::npending
1304
1305Returns the number of requests currently in the pending state (executed,
1306but not yet processed by poll_cb).
1307
1308=back
1309
991=cut 1310=cut
992 1311
993# support function to convert a fd into a perl filehandle
994sub _fd2fh {
995 return undef if $_[0] < 0;
996
997 # try to generate nice filehandles
998 my $sym = "IO::AIO::fd#$_[0]";
999 local *$sym;
1000
1001 open *$sym, "+<&=$_[0]" # usually works under any unix
1002 or open *$sym, "<&=$_[0]" # cygwin needs this
1003 or open *$sym, ">&=$_[0]" # or this
1004 or return undef;
1005
1006 *$sym
1007}
1008
1009min_parallel 8; 1312min_parallel 8;
1010 1313
1011END { 1314END { flush }
1012 min_parallel 1;
1013 flush;
1014};
1015 1315
10161; 13161;
1017 1317
1018=head2 FORK BEHAVIOUR 1318=head2 FORK BEHAVIOUR
1019 1319
1039bytes of memory. In addition, stat requests need a stat buffer (possibly 1339bytes of memory. In addition, stat requests need a stat buffer (possibly
1040a few hundred bytes), readdir requires a result buffer and so on. Perl 1340a few hundred bytes), readdir requires a result buffer and so on. Perl
1041scalars and other data passed into aio requests will also be locked and 1341scalars and other data passed into aio requests will also be locked and
1042will consume memory till the request has entered the done state. 1342will consume memory till the request has entered the done state.
1043 1343
1044This is now awfully much, so queuing lots of requests is not usually a 1344This is not awfully much, so queuing lots of requests is not usually a
1045problem. 1345problem.
1046 1346
1047Per-thread usage: 1347Per-thread usage:
1048 1348
1049In the execution phase, some aio requests require more memory for 1349In the execution phase, some aio requests require more memory for

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines