ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/IO-AIO/AIO.pm
(Generate patch)

Comparing IO-AIO/AIO.pm (file contents):
Revision 1.22 by root, Wed Jul 20 21:55:27 2005 UTC vs.
Revision 1.37 by root, Tue Aug 23 12:37:19 2005 UTC

56 56
57=cut 57=cut
58 58
59package IO::AIO; 59package IO::AIO;
60 60
61no warnings;
62
61use base 'Exporter'; 63use base 'Exporter';
62 64
63use Fcntl (); 65use Fcntl ();
64 66
65BEGIN { 67BEGIN {
66 $VERSION = 0.9; 68 $VERSION = 1.6;
67 69
68 @EXPORT = qw(aio_read aio_write aio_open aio_close aio_stat aio_lstat aio_unlink 70 @EXPORT = qw(aio_read aio_write aio_open aio_close aio_stat aio_lstat aio_unlink
69 aio_fsync aio_fdatasync aio_readahead); 71 aio_rmdir aio_readdir aio_symlink aio_fsync aio_fdatasync aio_readahead);
70 @EXPORT_OK = qw(poll_fileno poll_cb min_parallel max_parallel max_outstanding nreqs); 72 @EXPORT_OK = qw(poll_fileno poll_cb min_parallel max_parallel max_outstanding nreqs);
71 73
72 require XSLoader; 74 require XSLoader;
73 XSLoader::load IO::AIO, $VERSION; 75 XSLoader::load IO::AIO, $VERSION;
74} 76}
83which must be a code reference. This code reference will get called with 85which must be a code reference. This code reference will get called with
84the syscall return code (e.g. most syscalls return C<-1> on error, unlike 86the syscall return code (e.g. most syscalls return C<-1> on error, unlike
85perl, which usually delivers "false") as it's sole argument when the given 87perl, which usually delivers "false") as it's sole argument when the given
86syscall has been executed asynchronously. 88syscall has been executed asynchronously.
87 89
88All functions that expect a filehandle will also accept a file descriptor. 90All functions expecting a filehandle keep a copy of the filehandle
91internally until the request has finished.
89 92
90The filenames you pass to these routines I<must> be absolute. The reason 93The pathnames you pass to these routines I<must> be absolute and
91for this is that at the time the request is being executed, the current 94encoded in byte form. The reason for the former is that at the time the
92working directory could have changed. Alternatively, you can make sure 95request is being executed, the current working directory could have
93that you never change the current working directory. 96changed. Alternatively, you can make sure that you never change the
97current working directory.
98
99To encode pathnames to byte form, either make sure you either: a)
100always pass in filenames you got from outside (command line, readdir
101etc.), b) are ASCII or ISO 8859-1, c) use the Encode module and encode
102your pathnames to the locale (or other) encoding in effect in the user
103environment, d) use Glib::filename_from_unicode on unicode filenames or e)
104use something else.
94 105
95=over 4 106=over 4
96 107
97=item aio_open $pathname, $flags, $mode, $callback 108=item aio_open $pathname, $flags, $mode, $callback
98 109
139Reads or writes C<length> bytes from the specified C<fh> and C<offset> 150Reads or writes C<length> bytes from the specified C<fh> and C<offset>
140into the scalar given by C<data> and offset C<dataoffset> and calls the 151into the scalar given by C<data> and offset C<dataoffset> and calls the
141callback without the actual number of bytes read (or -1 on error, just 152callback without the actual number of bytes read (or -1 on error, just
142like the syscall). 153like the syscall).
143 154
155The C<$data> scalar I<MUST NOT> be modified in any way while the request
156is outstanding. Modifying it can result in segfaults or WW3 (if the
157necessary/optional hardware is installed).
158
144Example: Read 15 bytes at offset 7 into scalar C<$buffer>, starting at 159Example: Read 15 bytes at offset 7 into scalar C<$buffer>, starting at
145offset C<0> within the scalar: 160offset C<0> within the scalar:
146 161
147 aio_read $fh, 7, 15, $buffer, 0, sub { 162 aio_read $fh, 7, 15, $buffer, 0, sub {
148 $_[0] > 0 or die "read error: $!"; 163 $_[0] > 0 or die "read error: $!";
149 print "read $_[0] bytes: <$buffer>\n"; 164 print "read $_[0] bytes: <$buffer>\n";
150 }; 165 };
151 166
167=item aio_sendfile $out_fh, $in_fh, $in_offset, $length, $callback
168
169Tries to copy C<$length> bytes from C<$in_fh> to C<$out_fh>. It starts
170reading at byte offset C<$in_offset>, and starts writing at the current
171file offset of C<$out_fh>. Because of that, it is not safe to issue more
172than one C<aio_sendfile> per C<$out_fh>, as they will interfere with each
173other.
174
175This call tries to make use of a native C<sendfile> syscall to provide
176zero-copy operation. For this to work, C<$out_fh> should refer to a
177socket, and C<$in_fh> should refer to mmap'able file.
178
179If the native sendfile call fails or is not implemented, it will be
180emulated, so you can call C<aio_sendfile> on any type of filehandle
181regardless of the limitations of the operating system.
182
183Please note, however, that C<aio_sendfile> can read more bytes from
184C<$in_fh> than are written, and there is no way to find out how many
185bytes have been read from C<aio_sendfile> alone, as C<aio_sendfile> only
186provides the number of bytes written to C<$out_fh>. Only if the result
187value equals C<$length> one can assume that C<$length> bytes have been
188read.
189
152=item aio_readahead $fh,$offset,$length, $callback 190=item aio_readahead $fh,$offset,$length, $callback
153
154Asynchronously reads the specified byte range into the page cache, using
155the C<readahead> syscall. If that syscall doesn't exist (likely if your OS
156isn't Linux) the status will be C<-1> and C<$!> is set to C<ENOSYS>.
157 191
158C<aio_readahead> populates the page cache with data from a file so that 192C<aio_readahead> populates the page cache with data from a file so that
159subsequent reads from that file will not block on disk I/O. The C<$offset> 193subsequent reads from that file will not block on disk I/O. The C<$offset>
160argument specifies the starting point from which data is to be read and 194argument specifies the starting point from which data is to be read and
161C<$length> specifies the number of bytes to be read. I/O is performed in 195C<$length> specifies the number of bytes to be read. I/O is performed in
162whole pages, so that offset is effectively rounded down to a page boundary 196whole pages, so that offset is effectively rounded down to a page boundary
163and bytes are read up to the next page boundary greater than or equal to 197and bytes are read up to the next page boundary greater than or equal to
164(off-set+length). C<aio_readahead> does not read beyond the end of the 198(off-set+length). C<aio_readahead> does not read beyond the end of the
165file. The current file offset of the file is left unchanged. 199file. The current file offset of the file is left unchanged.
166 200
201If that syscall doesn't exist (likely if your OS isn't Linux) it will be
202emulated by simply reading the data, which would have a similar effect.
203
167=item aio_stat $fh_or_path, $callback 204=item aio_stat $fh_or_path, $callback
168 205
169=item aio_lstat $fh, $callback 206=item aio_lstat $fh, $callback
170 207
171Works like perl's C<stat> or C<lstat> in void context. The callback will 208Works like perl's C<stat> or C<lstat> in void context. The callback will
189=item aio_unlink $pathname, $callback 226=item aio_unlink $pathname, $callback
190 227
191Asynchronously unlink (delete) a file and call the callback with the 228Asynchronously unlink (delete) a file and call the callback with the
192result code. 229result code.
193 230
231=item aio_rmdir $pathname, $callback
232
233Asynchronously rmdir (delete) a directory and call the callback with the
234result code.
235
236=item aio_readdir $pathname $callback
237
238Unlike the POSIX call of the same name, C<aio_readdir> reads an entire
239directory (i.e. opendir + readdir + closedir). The entries will not be
240sorted, and will B<NOT> include the C<.> and C<..> entries.
241
242The callback a single argument which is either C<undef> or an array-ref
243with the filenames.
244
194=item aio_fsync $fh, $callback 245=item aio_fsync $fh, $callback
195 246
196Asynchronously call fsync on the given filehandle and call the callback 247Asynchronously call fsync on the given filehandle and call the callback
197with the fsync result code. 248with the fsync result code.
198 249
199=item aio_fdatasync $fh, $callback 250=item aio_fdatasync $fh, $callback
200 251
201Asynchronously call fdatasync on the given filehandle and call the 252Asynchronously call fdatasync on the given filehandle and call the
202callback with the fdatasync result code. Might set C<$!> to C<ENOSYS> if 253callback with the fdatasync result code.
203C<fdatasync> is not available. 254
255If this call isn't available because your OS lacks it or it couldn't be
256detected, it will be emulated by calling C<fsync> instead.
204 257
205=back 258=back
206 259
207=head2 SUPPORT FUNCTIONS 260=head2 SUPPORT FUNCTIONS
208 261
266 IO::AIO::poll_wait, IO::AIO::poll_cb 319 IO::AIO::poll_wait, IO::AIO::poll_cb
267 if IO::AIO::nreqs; 320 if IO::AIO::nreqs;
268 321
269=item IO::AIO::min_parallel $nthreads 322=item IO::AIO::min_parallel $nthreads
270 323
271Set the minimum number of AIO threads to C<$nthreads>. The default is 324Set the minimum number of AIO threads to C<$nthreads>. The current default
272C<1>, which means a single asynchronous operation can be done at one time 325is C<4>, which means four asynchronous operations can be done at one time
273(the number of outstanding operations, however, is unlimited). 326(the number of outstanding operations, however, is unlimited).
327
328IO::AIO starts threads only on demand, when an AIO request is queued and
329no free thread exists.
274 330
275It is recommended to keep the number of threads low, as some Linux 331It is recommended to keep the number of threads low, as some Linux
276kernel versions will scale negatively with the number of threads (higher 332kernel versions will scale negatively with the number of threads (higher
277parallelity => MUCH higher latency). With current Linux 2.6 versions, 4-32 333parallelity => MUCH higher latency). With current Linux 2.6 versions, 4-32
278threads should be fine. 334threads should be fine.
279 335
280Under normal circumstances you don't need to call this function, as this 336Under most circumstances you don't need to call this function, as the
281module automatically starts some threads (the exact number might change, 337module selects a default that is suitable for low to moderate load.
282and is currently 4).
283 338
284=item IO::AIO::max_parallel $nthreads 339=item IO::AIO::max_parallel $nthreads
285 340
286Sets the maximum number of AIO threads to C<$nthreads>. If more than 341Sets the maximum number of AIO threads to C<$nthreads>. If more than the
287the specified number of threads are currently running, kill them. This 342specified number of threads are currently running, this function kills
288function blocks until the limit is reached. 343them. This function blocks until the limit is reached.
344
345While C<$nthreads> are zero, aio requests get queued but not executed
346until the number of threads has been increased again.
289 347
290This module automatically runs C<max_parallel 0> at program end, to ensure 348This module automatically runs C<max_parallel 0> at program end, to ensure
291that all threads are killed and that there are no outstanding requests. 349that all threads are killed and that there are no outstanding requests.
292 350
293Under normal circumstances you don't need to call this function. 351Under normal circumstances you don't need to call this function.
297Sets the maximum number of outstanding requests to C<$nreqs>. If you 355Sets the maximum number of outstanding requests to C<$nreqs>. If you
298try to queue up more than this number of requests, the caller will block until 356try to queue up more than this number of requests, the caller will block until
299some requests have been handled. 357some requests have been handled.
300 358
301The default is very large, so normally there is no practical limit. If you 359The default is very large, so normally there is no practical limit. If you
302queue up many requests in a loop it it often improves speed if you set 360queue up many requests in a loop it often improves speed if you set
303this to a relatively low number, such as C<100>. 361this to a relatively low number, such as C<100>.
304 362
305Under normal circumstances you don't need to call this function. 363Under normal circumstances you don't need to call this function.
306 364
307=back 365=back
310 368
311# support function to convert a fd into a perl filehandle 369# support function to convert a fd into a perl filehandle
312sub _fd2fh { 370sub _fd2fh {
313 return undef if $_[0] < 0; 371 return undef if $_[0] < 0;
314 372
315 # try to be perl5.6-compatible 373 # try to generate nice filehandles
316 local *AIO_FH; 374 my $sym = "IO::AIO::fd#$_[0]";
317 open AIO_FH, "+<&=$_[0]" 375 local *$sym;
376
377 open *$sym, "+<&=$_[0]" # usually works under any unix
378 or open *$sym, "<&=$_[0]" # cygwin needs this
379 or open *$sym, ">&=$_[0]" # or this
318 or return undef; 380 or return undef;
319 381
320 *AIO_FH 382 *$sym
321} 383}
322 384
323min_parallel 4; 385min_parallel 4;
324 386
325END { 387END {
326 max_parallel 0; 388 max_parallel 0;
327} 389}
328 390
3291; 3911;
330 392
393=head2 FORK BEHAVIOUR
394
395Before the fork, IO::AIO enters a quiescent state where no requests
396can be added in other threads and no results will be processed. After
397the fork the parent simply leaves the quiescent state and continues
398request/result processing, while the child clears the request/result
399queue (so the requests started before the fork will only be handled in
400the parent). Threats will be started on demand until the limit ste in the
401parent process has been reached again.
402
331=head1 SEE ALSO 403=head1 SEE ALSO
332 404
333L<Coro>, L<Linux::AIO>. 405L<Coro>, L<Linux::AIO>.
334 406
335=head1 AUTHOR 407=head1 AUTHOR

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines