ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/IO-AIO/AIO.pm
(Generate patch)

Comparing IO-AIO/AIO.pm (file contents):
Revision 1.105 by root, Sun Mar 25 00:20:27 2007 UTC vs.
Revision 1.117 by root, Sat Oct 6 14:05:19 2007 UTC

62etc.), but can also be used to easily do operations in parallel that are 62etc.), but can also be used to easily do operations in parallel that are
63normally done sequentially, e.g. stat'ing many files, which is much faster 63normally done sequentially, e.g. stat'ing many files, which is much faster
64on a RAID volume or over NFS when you do a number of stat operations 64on a RAID volume or over NFS when you do a number of stat operations
65concurrently. 65concurrently.
66 66
67While most of this works on all types of file descriptors (for example 67While most of this works on all types of file descriptors (for
68sockets), using these functions on file descriptors that support 68example sockets), using these functions on file descriptors that
69nonblocking operation (again, sockets, pipes etc.) is very inefficient or 69support nonblocking operation (again, sockets, pipes etc.) is very
70might not work (aio_read fails on sockets/pipes/fifos). Use an event loop 70inefficient. Use an event loop for that (such as the L<Event|Event>
71for that (such as the L<Event|Event> module): IO::AIO will naturally fit 71module): IO::AIO will naturally fit into such an event loop itself.
72into such an event loop itself.
73 72
74In this version, a number of threads are started that execute your 73In this version, a number of threads are started that execute your
75requests and signal their completion. You don't need thread support 74requests and signal their completion. You don't need thread support
76in perl, and the threads created by this module will not be visible 75in perl, and the threads created by this module will not be visible
77to perl. In the future, this module might make use of the native aio 76to perl. In the future, this module might make use of the native aio
79not well-supported or restricted (GNU/Linux doesn't allow them on normal 78not well-supported or restricted (GNU/Linux doesn't allow them on normal
80files currently, for example), and they would only support aio_read and 79files currently, for example), and they would only support aio_read and
81aio_write, so the remaining functionality would have to be implemented 80aio_write, so the remaining functionality would have to be implemented
82using threads anyway. 81using threads anyway.
83 82
84Although the module will work with in the presence of other (Perl-) 83Although the module will work in the presence of other (Perl-) threads,
85threads, it is currently not reentrant in any way, so use appropriate 84it is currently not reentrant in any way, so use appropriate locking
86locking yourself, always call C<poll_cb> from within the same thread, or 85yourself, always call C<poll_cb> from within the same thread, or never
87never call C<poll_cb> (or other C<aio_> functions) recursively. 86call C<poll_cb> (or other C<aio_> functions) recursively.
88 87
89=head2 EXAMPLE 88=head2 EXAMPLE
90 89
91This is a simple example that uses the Event module and loads 90This is a simple example that uses the Event module and loads
92F</etc/passwd> asynchronously: 91F</etc/passwd> asynchronously:
184 183
185=cut 184=cut
186 185
187package IO::AIO; 186package IO::AIO;
188 187
188use Carp ();
189
189no warnings; 190no warnings;
190use strict 'vars'; 191use strict 'vars';
191 192
192use base 'Exporter'; 193use base 'Exporter';
193 194
194BEGIN { 195BEGIN {
195 our $VERSION = '2.33'; 196 our $VERSION = '2.51';
196 197
197 our @AIO_REQ = qw(aio_sendfile aio_read aio_write aio_open aio_close aio_stat 198 our @AIO_REQ = qw(aio_sendfile aio_read aio_write aio_open aio_close aio_stat
198 aio_lstat aio_unlink aio_rmdir aio_readdir aio_scandir aio_symlink 199 aio_lstat aio_unlink aio_rmdir aio_readdir aio_scandir aio_symlink
199 aio_readlink aio_fsync aio_fdatasync aio_readahead aio_rename aio_link 200 aio_readlink aio_fsync aio_fdatasync aio_readahead aio_rename aio_link
200 aio_move aio_copy aio_group aio_nop aio_mknod aio_load aio_rmtree aio_mkdir); 201 aio_move aio_copy aio_group aio_nop aio_mknod aio_load aio_rmtree aio_mkdir
202 aio_chown aio_chmod aio_utime aio_truncate);
201 our @EXPORT = (@AIO_REQ, qw(aioreq_pri aioreq_nice aio_block)); 203 our @EXPORT = (@AIO_REQ, qw(aioreq_pri aioreq_nice aio_block));
202 our @EXPORT_OK = qw(poll_fileno poll_cb poll_wait flush 204 our @EXPORT_OK = qw(poll_fileno poll_cb poll_wait flush
203 min_parallel max_parallel max_idle 205 min_parallel max_parallel max_idle
204 nreqs nready npending nthreads 206 nreqs nready npending nthreads
205 max_poll_time max_poll_reqs); 207 max_poll_time max_poll_reqs);
271 aio_read $_[0], ..., sub { 273 aio_read $_[0], ..., sub {
272 ... 274 ...
273 }; 275 };
274 }; 276 };
275 277
278
276=item aioreq_nice $pri_adjust 279=item aioreq_nice $pri_adjust
277 280
278Similar to C<aioreq_pri>, but subtracts the given value from the current 281Similar to C<aioreq_pri>, but subtracts the given value from the current
279priority, so the effect is cumulative. 282priority, so the effect is cumulative.
283
280 284
281=item aio_open $pathname, $flags, $mode, $callback->($fh) 285=item aio_open $pathname, $flags, $mode, $callback->($fh)
282 286
283Asynchronously open or create a file and call the callback with a newly 287Asynchronously open or create a file and call the callback with a newly
284created filehandle for the file. 288created filehandle for the file.
305 } else { 309 } else {
306 die "open failed: $!\n"; 310 die "open failed: $!\n";
307 } 311 }
308 }; 312 };
309 313
314
310=item aio_close $fh, $callback->($status) 315=item aio_close $fh, $callback->($status)
311 316
312Asynchronously close a file and call the callback with the result 317Asynchronously close a file and call the callback with the result
313code. I<WARNING:> although accepted, you should not pass in a perl 318code.
314filehandle here, as perl will likely close the file descriptor another
315time when the filehandle is destroyed. Normally, you can safely call perls
316C<close> or just let filehandles go out of scope.
317 319
318This is supposed to be a bug in the API, so that might change. It's 320Unfortunately, you can't do this to perl. Perl I<insists> very strongly on
319therefore best to avoid this function. 321closing the file descriptor associated with the filehandle itself. Here is
322what aio_close will try:
323
324 1. dup()licate the fd
325 2. asynchronously close() the duplicated fd
326 3. dup()licate the fd once more
327 4. let perl close() the filehandle
328 5. asynchronously close the duplicated fd
329
330The idea is that the first close() flushes stuff to disk that closing an
331fd will flush, so when perl closes the fd, nothing much will need to be
332flushed. The second async. close() will then flush stuff to disk that
333closing the last fd to the file will flush.
334
335Just FYI, SuSv3 has this to say on close:
336
337 All outstanding record locks owned by the process on the file
338 associated with the file descriptor shall be removed.
339
340 If fildes refers to a socket, close() shall cause the socket to be
341 destroyed. ... close() shall block for up to the current linger
342 interval until all data is transmitted.
343 [this actually sounds like a specification bug, but who knows]
344
345And at least Linux additionally actually flushes stuff on every close,
346even when the file itself is still open.
347
348Sounds enourmously inefficient and complicated? Yes... please show me how
349to nuke perl's fd out of existence...
350
351=cut
352
353sub aio_close($;$) {
354 aio_block {
355 my ($fh, $cb) = @_;
356
357 my $pri = aioreq_pri;
358 my $grp = aio_group $cb;
359
360 my $fd = fileno $fh;
361
362 defined $fd or Carp::croak "aio_close called with fd-less filehandle";
363
364 # if the dups fail we will simply get EBADF
365 my $fd2 = _dup $fd;
366 aioreq_pri $pri;
367 add $grp _aio_close $fd2, sub {
368 my $fd2 = _dup $fd;
369 close $fh;
370 aioreq_pri $pri;
371 add $grp _aio_close $fd2, sub {
372 $grp->result ($_[0]);
373 };
374 };
375
376 $grp
377 }
378}
379
320 380
321=item aio_read $fh,$offset,$length, $data,$dataoffset, $callback->($retval) 381=item aio_read $fh,$offset,$length, $data,$dataoffset, $callback->($retval)
322 382
323=item aio_write $fh,$offset,$length, $data,$dataoffset, $callback->($retval) 383=item aio_write $fh,$offset,$length, $data,$dataoffset, $callback->($retval)
324 384
325Reads or writes C<length> bytes from the specified C<fh> and C<offset> 385Reads or writes C<$length> bytes from the specified C<$fh> and C<$offset>
326into the scalar given by C<data> and offset C<dataoffset> and calls the 386into the scalar given by C<$data> and offset C<$dataoffset> and calls the
327callback without the actual number of bytes read (or -1 on error, just 387callback without the actual number of bytes read (or -1 on error, just
328like the syscall). 388like the syscall).
329 389
390If C<$offset> is undefined, then the current file descriptor offset will
391be used (and updated), otherwise the file descriptor offset will not be
392changed by these calls.
393
394If C<$length> is undefined in C<aio_write>, use the remaining length of C<$data>.
395
396If C<$dataoffset> is less than zero, it will be counted from the end of
397C<$data>.
398
330The C<$data> scalar I<MUST NOT> be modified in any way while the request 399The C<$data> scalar I<MUST NOT> be modified in any way while the request
331is outstanding. Modifying it can result in segfaults or WW3 (if the 400is outstanding. Modifying it can result in segfaults or World War III (if
332necessary/optional hardware is installed). 401the necessary/optional hardware is installed).
333 402
334Example: Read 15 bytes at offset 7 into scalar C<$buffer>, starting at 403Example: Read 15 bytes at offset 7 into scalar C<$buffer>, starting at
335offset C<0> within the scalar: 404offset C<0> within the scalar:
336 405
337 aio_read $fh, 7, 15, $buffer, 0, sub { 406 aio_read $fh, 7, 15, $buffer, 0, sub {
338 $_[0] > 0 or die "read error: $!"; 407 $_[0] > 0 or die "read error: $!";
339 print "read $_[0] bytes: <$buffer>\n"; 408 print "read $_[0] bytes: <$buffer>\n";
340 }; 409 };
410
341 411
342=item aio_sendfile $out_fh, $in_fh, $in_offset, $length, $callback->($retval) 412=item aio_sendfile $out_fh, $in_fh, $in_offset, $length, $callback->($retval)
343 413
344Tries to copy C<$length> bytes from C<$in_fh> to C<$out_fh>. It starts 414Tries to copy C<$length> bytes from C<$in_fh> to C<$out_fh>. It starts
345reading at byte offset C<$in_offset>, and starts writing at the current 415reading at byte offset C<$in_offset>, and starts writing at the current
359C<$in_fh> than are written, and there is no way to find out how many 429C<$in_fh> than are written, and there is no way to find out how many
360bytes have been read from C<aio_sendfile> alone, as C<aio_sendfile> only 430bytes have been read from C<aio_sendfile> alone, as C<aio_sendfile> only
361provides the number of bytes written to C<$out_fh>. Only if the result 431provides the number of bytes written to C<$out_fh>. Only if the result
362value equals C<$length> one can assume that C<$length> bytes have been 432value equals C<$length> one can assume that C<$length> bytes have been
363read. 433read.
434
364 435
365=item aio_readahead $fh,$offset,$length, $callback->($retval) 436=item aio_readahead $fh,$offset,$length, $callback->($retval)
366 437
367C<aio_readahead> populates the page cache with data from a file so that 438C<aio_readahead> populates the page cache with data from a file so that
368subsequent reads from that file will not block on disk I/O. The C<$offset> 439subsequent reads from that file will not block on disk I/O. The C<$offset>
374file. The current file offset of the file is left unchanged. 445file. The current file offset of the file is left unchanged.
375 446
376If that syscall doesn't exist (likely if your OS isn't Linux) it will be 447If that syscall doesn't exist (likely if your OS isn't Linux) it will be
377emulated by simply reading the data, which would have a similar effect. 448emulated by simply reading the data, which would have a similar effect.
378 449
450
379=item aio_stat $fh_or_path, $callback->($status) 451=item aio_stat $fh_or_path, $callback->($status)
380 452
381=item aio_lstat $fh, $callback->($status) 453=item aio_lstat $fh, $callback->($status)
382 454
383Works like perl's C<stat> or C<lstat> in void context. The callback will 455Works like perl's C<stat> or C<lstat> in void context. The callback will
396 aio_stat "/etc/passwd", sub { 468 aio_stat "/etc/passwd", sub {
397 $_[0] and die "stat failed: $!"; 469 $_[0] and die "stat failed: $!";
398 print "size is ", -s _, "\n"; 470 print "size is ", -s _, "\n";
399 }; 471 };
400 472
473
474=item aio_utime $fh_or_path, $atime, $mtime, $callback->($status)
475
476Works like perl's C<utime> function (including the special case of $atime
477and $mtime being undef). Fractional times are supported if the underlying
478syscalls support them.
479
480When called with a pathname, uses utimes(2) if available, otherwise
481utime(2). If called on a file descriptor, uses futimes(2) if available,
482otherwise returns ENOSYS, so this is not portable.
483
484Examples:
485
486 # set atime and mtime to current time (basically touch(1)):
487 aio_utime "path", undef, undef;
488 # set atime to current time and mtime to beginning of the epoch:
489 aio_utime "path", time, undef; # undef==0
490
491
492=item aio_chown $fh_or_path, $uid, $gid, $callback->($status)
493
494Works like perl's C<chown> function, except that C<undef> for either $uid
495or $gid is being interpreted as "do not change" (but -1 can also be used).
496
497Examples:
498
499 # same as "chown root path" in the shell:
500 aio_chown "path", 0, -1;
501 # same as above:
502 aio_chown "path", 0, undef;
503
504
505=item aio_truncate $fh_or_path, $offset, $callback->($status)
506
507Works like truncate(2) or ftruncate(2).
508
509
510=item aio_chmod $fh_or_path, $mode, $callback->($status)
511
512Works like perl's C<chmod> function.
513
514
401=item aio_unlink $pathname, $callback->($status) 515=item aio_unlink $pathname, $callback->($status)
402 516
403Asynchronously unlink (delete) a file and call the callback with the 517Asynchronously unlink (delete) a file and call the callback with the
404result code. 518result code.
405 519
520
406=item aio_mknod $path, $mode, $dev, $callback->($status) 521=item aio_mknod $path, $mode, $dev, $callback->($status)
407 522
408[EXPERIMENTAL] 523[EXPERIMENTAL]
409 524
410Asynchronously create a device node (or fifo). See mknod(2). 525Asynchronously create a device node (or fifo). See mknod(2).
411 526
412The only (POSIX-) portable way of calling this function is: 527The only (POSIX-) portable way of calling this function is:
413 528
414 aio_mknod $path, IO::AIO::S_IFIFO | $mode, 0, sub { ... 529 aio_mknod $path, IO::AIO::S_IFIFO | $mode, 0, sub { ...
530
415 531
416=item aio_link $srcpath, $dstpath, $callback->($status) 532=item aio_link $srcpath, $dstpath, $callback->($status)
417 533
418Asynchronously create a new link to the existing object at C<$srcpath> at 534Asynchronously create a new link to the existing object at C<$srcpath> at
419the path C<$dstpath> and call the callback with the result code. 535the path C<$dstpath> and call the callback with the result code.
420 536
537
421=item aio_symlink $srcpath, $dstpath, $callback->($status) 538=item aio_symlink $srcpath, $dstpath, $callback->($status)
422 539
423Asynchronously create a new symbolic link to the existing object at C<$srcpath> at 540Asynchronously create a new symbolic link to the existing object at C<$srcpath> at
424the path C<$dstpath> and call the callback with the result code. 541the path C<$dstpath> and call the callback with the result code.
542
425 543
426=item aio_readlink $path, $callback->($link) 544=item aio_readlink $path, $callback->($link)
427 545
428Asynchronously read the symlink specified by C<$path> and pass it to 546Asynchronously read the symlink specified by C<$path> and pass it to
429the callback. If an error occurs, nothing or undef gets passed to the 547the callback. If an error occurs, nothing or undef gets passed to the
430callback. 548callback.
431 549
550
432=item aio_rename $srcpath, $dstpath, $callback->($status) 551=item aio_rename $srcpath, $dstpath, $callback->($status)
433 552
434Asynchronously rename the object at C<$srcpath> to C<$dstpath>, just as 553Asynchronously rename the object at C<$srcpath> to C<$dstpath>, just as
435rename(2) and call the callback with the result code. 554rename(2) and call the callback with the result code.
555
436 556
437=item aio_mkdir $pathname, $mode, $callback->($status) 557=item aio_mkdir $pathname, $mode, $callback->($status)
438 558
439Asynchronously mkdir (create) a directory and call the callback with 559Asynchronously mkdir (create) a directory and call the callback with
440the result code. C<$mode> will be modified by the umask at the time the 560the result code. C<$mode> will be modified by the umask at the time the
441request is executed, so do not change your umask. 561request is executed, so do not change your umask.
442 562
563
443=item aio_rmdir $pathname, $callback->($status) 564=item aio_rmdir $pathname, $callback->($status)
444 565
445Asynchronously rmdir (delete) a directory and call the callback with the 566Asynchronously rmdir (delete) a directory and call the callback with the
446result code. 567result code.
568
447 569
448=item aio_readdir $pathname, $callback->($entries) 570=item aio_readdir $pathname, $callback->($entries)
449 571
450Unlike the POSIX call of the same name, C<aio_readdir> reads an entire 572Unlike the POSIX call of the same name, C<aio_readdir> reads an entire
451directory (i.e. opendir + readdir + closedir). The entries will not be 573directory (i.e. opendir + readdir + closedir). The entries will not be
452sorted, and will B<NOT> include the C<.> and C<..> entries. 574sorted, and will B<NOT> include the C<.> and C<..> entries.
453 575
454The callback a single argument which is either C<undef> or an array-ref 576The callback a single argument which is either C<undef> or an array-ref
455with the filenames. 577with the filenames.
578
456 579
457=item aio_load $path, $data, $callback->($status) 580=item aio_load $path, $data, $callback->($status)
458 581
459This is a composite request that tries to fully load the given file into 582This is a composite request that tries to fully load the given file into
460memory. Status is the same as with aio_read. 583memory. Status is the same as with aio_read.
1140This is a very bad function to use in interactive programs because it 1263This is a very bad function to use in interactive programs because it
1141blocks, and a bad way to reduce concurrency because it is inexact: Better 1264blocks, and a bad way to reduce concurrency because it is inexact: Better
1142use an C<aio_group> together with a feed callback. 1265use an C<aio_group> together with a feed callback.
1143 1266
1144Sets the maximum number of outstanding requests to C<$nreqs>. If you 1267Sets the maximum number of outstanding requests to C<$nreqs>. If you
1145to queue up more than this number of requests, the next call to the 1268do queue up more than this number of requests, the next call to the
1146C<poll_cb> (and C<poll_some> and other functions calling C<poll_cb>) 1269C<poll_cb> (and C<poll_some> and other functions calling C<poll_cb>)
1147function will block until the limit is no longer exceeded. 1270function will block until the limit is no longer exceeded.
1148 1271
1149The default value is very large, so there is no practical limit on the 1272The default value is very large, so there is no practical limit on the
1150number of outstanding requests. 1273number of outstanding requests.
1180but not yet processed by poll_cb). 1303but not yet processed by poll_cb).
1181 1304
1182=back 1305=back
1183 1306
1184=cut 1307=cut
1185
1186# support function to convert a fd into a perl filehandle
1187sub _fd2fh {
1188 return undef if $_[0] < 0;
1189
1190 # try to generate nice filehandles
1191 my $sym = "IO::AIO::fd#$_[0]";
1192 local *$sym;
1193
1194 open *$sym, "+<&=$_[0]" # usually works under any unix
1195 or open *$sym, "<&=$_[0]" # cygwin needs this
1196 or open *$sym, ">&=$_[0]" # or this
1197 or return undef;
1198
1199 *$sym
1200}
1201 1308
1202min_parallel 8; 1309min_parallel 8;
1203 1310
1204END { flush } 1311END { flush }
1205 1312
1229bytes of memory. In addition, stat requests need a stat buffer (possibly 1336bytes of memory. In addition, stat requests need a stat buffer (possibly
1230a few hundred bytes), readdir requires a result buffer and so on. Perl 1337a few hundred bytes), readdir requires a result buffer and so on. Perl
1231scalars and other data passed into aio requests will also be locked and 1338scalars and other data passed into aio requests will also be locked and
1232will consume memory till the request has entered the done state. 1339will consume memory till the request has entered the done state.
1233 1340
1234This is now awfully much, so queuing lots of requests is not usually a 1341This is not awfully much, so queuing lots of requests is not usually a
1235problem. 1342problem.
1236 1343
1237Per-thread usage: 1344Per-thread usage:
1238 1345
1239In the execution phase, some aio requests require more memory for 1346In the execution phase, some aio requests require more memory for

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines