ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/libeio/eio.pod
(Generate patch)

Comparing libeio/eio.pod (file contents):
Revision 1.8 by root, Sun Jun 5 22:45:49 2011 UTC vs.
Revision 1.36 by root, Sun Jan 24 16:36:20 2016 UTC

25similar functions, as well as less rarely ones such as C<mknod>, C<futime> 25similar functions, as well as less rarely ones such as C<mknod>, C<futime>
26or C<readlink>. 26or C<readlink>.
27 27
28It also offers wrappers around C<sendfile> (Solaris, Linux, HP-UX and 28It also offers wrappers around C<sendfile> (Solaris, Linux, HP-UX and
29FreeBSD, with emulation on other platforms) and C<readahead> (Linux, with 29FreeBSD, with emulation on other platforms) and C<readahead> (Linux, with
30emulation elsewhere>). 30emulation elsewhere).
31 31
32The goal is to enable you to write fully non-blocking programs. For 32The goal is to enable you to write fully non-blocking programs. For
33example, in a game server, you would not want to freeze for a few seconds 33example, in a game server, you would not want to freeze for a few seconds
34just because the server is running a backup and you happen to call 34just because the server is running a backup and you happen to call
35C<readdir>. 35C<readdir>.
45Unlike the name component C<stamp> might indicate, it is also used for 45Unlike the name component C<stamp> might indicate, it is also used for
46time differences throughout libeio. 46time differences throughout libeio.
47 47
48=head2 FORK SUPPORT 48=head2 FORK SUPPORT
49 49
50Calling C<fork ()> is fully supported by this module. It is implemented in these steps: 50Usage of pthreads in a program changes the semantics of fork
51considerably. Specifically, only async-safe functions can be called after
52fork. Libeio uses pthreads, so this applies, and makes using fork hard for
53anything but relatively fork + exec uses.
51 54
52 1. wait till all requests in "execute" state have been handled 55This library only works in the process that initialised it: Forking is
53 (basically requests that are already handed over to the kernel). 56fully supported, but using libeio in any other process than the one that
54 2. fork 57called C<eio_init> is not.
55 3. in the parent, continue business as usual, done
56 4. in the child, destroy all ready and pending requests and free the
57 memory used by the worker threads. This gives you a fully empty
58 libeio queue.
59 58
60Note, however, since libeio does use threads, thr above guarantee doesn't 59You might get around by not I<using> libeio before (or after) forking in
61cover your libc, for example, malloc and other libc functions are not 60the parent, and using it in the child afterwards. You could also try to
62fork-safe, so there is very little you can do after a fork, and in fatc, 61call the L<eio_init> function again in the child, which will brutally
63the above might crash, and thus change. 62reinitialise all data structures, which isn't POSIX conformant, but
63typically works.
64
65Otherwise, the only recommendation you should follow is: treat fork code
66the same way you treat signal handlers, and only ever call C<eio_init> in
67the process that uses it, and only once ever.
64 68
65=head1 INITIALISATION/INTEGRATION 69=head1 INITIALISATION/INTEGRATION
66 70
67Before you can call any eio functions you first have to initialise the 71Before you can call any eio functions you first have to initialise the
68library. The library integrates into any event loop, but can also be used 72library. The library integrates into any event loop, but can also be used
77This function initialises the library. On success it returns C<0>, on 81This function initialises the library. On success it returns C<0>, on
78failure it returns C<-1> and sets C<errno> appropriately. 82failure it returns C<-1> and sets C<errno> appropriately.
79 83
80It accepts two function pointers specifying callbacks as argument, both of 84It accepts two function pointers specifying callbacks as argument, both of
81which can be C<0>, in which case the callback isn't called. 85which can be C<0>, in which case the callback isn't called.
86
87There is currently no way to change these callbacks later, or to
88"uninitialise" the library again.
82 89
83=item want_poll callback 90=item want_poll callback
84 91
85The C<want_poll> callback is invoked whenever libeio wants attention (i.e. 92The C<want_poll> callback is invoked whenever libeio wants attention (i.e.
86it wants to be polled by calling C<eio_poll>). It is "edge-triggered", 93it wants to be polled by calling C<eio_poll>). It is "edge-triggered",
97This callback is invoked when libeio detects that all pending requests 104This callback is invoked when libeio detects that all pending requests
98have been handled. It is "edge-triggered", that is, it will only be 105have been handled. It is "edge-triggered", that is, it will only be
99called once after C<want_poll>. To put it differently, C<want_poll> and 106called once after C<want_poll>. To put it differently, C<want_poll> and
100C<done_poll> are invoked in pairs: after C<want_poll> you have to call 107C<done_poll> are invoked in pairs: after C<want_poll> you have to call
101C<eio_poll ()> until either C<eio_poll> indicates that everything has been 108C<eio_poll ()> until either C<eio_poll> indicates that everything has been
102handled or C<done_poll> has been called, which signals the same. 109handled or C<done_poll> has been called, which signals the same - only one
110method is needed.
103 111
104Note that C<eio_poll> might return after C<done_poll> and C<want_poll> 112Note that C<eio_poll> might return after C<done_poll> and C<want_poll>
105have been called again, so watch out for races in your code. 113have been called again, so watch out for races in your code.
106 114
115It is quite common to have an empty C<done_call> callback and only use
116the return value from C<eio_poll>, or, when C<eio_poll> is configured to
117handle all outstanding replies, it's enough to call C<eio_poll> once.
118
107As with C<want_poll>, this callback is called while locks are being held, 119As with C<want_poll>, this callback is called while locks are being held,
108so you I<must not call any libeio functions form within this callback>. 120so you I<must not call any libeio functions from within this callback>.
109 121
110=item int eio_poll () 122=item int eio_poll ()
111 123
112This function has to be called whenever there are pending requests that 124This function has to be called whenever there are pending requests that
113need finishing. You usually call this after C<want_poll> has indicated 125need finishing. You usually call this after C<want_poll> has indicated
124=back 136=back
125 137
126For libev, you would typically use an C<ev_async> watcher: the 138For libev, you would typically use an C<ev_async> watcher: the
127C<want_poll> callback would invoke C<ev_async_send> to wake up the event 139C<want_poll> callback would invoke C<ev_async_send> to wake up the event
128loop. Inside the callback set for the watcher, one would call C<eio_poll 140loop. Inside the callback set for the watcher, one would call C<eio_poll
129()> (followed by C<ev_async_send> again if C<eio_poll> indicates that not 141()>.
130all requests have been handled yet). The race is taken care of because 142
131libev resets/rearms the async watcher before calling your callback, 143If C<eio_poll ()> is configured to not handle all results in one go
132and therefore, before calling C<eio_poll>. This might result in (some) 144(i.e. it returns C<-1>) then you should start an idle watcher that calls
133spurious wake-ups, but is generally harmless. 145C<eio_poll> until it returns something C<!= -1>.
146
147A full-featured connector between libeio and libev would look as follows
148(if C<eio_poll> is handling all requests, it can of course be simplified a
149lot by removing the idle watcher logic):
150
151 static struct ev_loop *loop;
152 static ev_idle repeat_watcher;
153 static ev_async ready_watcher;
154
155 /* idle watcher callback, only used when eio_poll */
156 /* didn't handle all results in one call */
157 static void
158 repeat (EV_P_ ev_idle *w, int revents)
159 {
160 if (eio_poll () != -1)
161 ev_idle_stop (EV_A_ w);
162 }
163
164 /* eio has some results, process them */
165 static void
166 ready (EV_P_ ev_async *w, int revents)
167 {
168 if (eio_poll () == -1)
169 ev_idle_start (EV_A_ &repeat_watcher);
170 }
171
172 /* wake up the event loop */
173 static void
174 want_poll (void)
175 {
176 ev_async_send (loop, &ready_watcher)
177 }
178
179 void
180 my_init_eio ()
181 {
182 loop = EV_DEFAULT;
183
184 ev_idle_init (&repeat_watcher, repeat);
185 ev_async_init (&ready_watcher, ready);
186 ev_async_start (loop, &watcher);
187
188 eio_init (want_poll, 0);
189 }
134 190
135For most other event loops, you would typically use a pipe - the event 191For most other event loops, you would typically use a pipe - the event
136loop should be told to wait for read readiness on the read end. In 192loop should be told to wait for read readiness on the read end. In
137C<want_poll> you would write a single byte, in C<done_poll> you would try 193C<want_poll> you would write a single byte, in C<done_poll> you would try
138to read that byte, and in the callback for the read end, you would call 194to read that byte, and in the callback for the read end, you would call
139C<eio_poll>. The race is avoided here because the event loop should invoke 195C<eio_poll>.
140your callback again and again until the byte has been read (as the pipe 196
141read callback does not read it, only C<done_poll>). 197You don't have to take special care in the case C<eio_poll> doesn't handle
198all requests, as the done callback will not be invoked, so the event loop
199will still signal readiness for the pipe until I<all> results have been
200processed.
142 201
143 202
144=head1 HIGH LEVEL REQUEST API 203=head1 HIGH LEVEL REQUEST API
145 204
146Libeio has both a high-level API, which consists of calling a request 205Libeio has both a high-level API, which consists of calling a request
153 212
154You submit a request by calling the relevant C<eio_TYPE> function with the 213You submit a request by calling the relevant C<eio_TYPE> function with the
155required parameters, a callback of type C<int (*eio_cb)(eio_req *req)> 214required parameters, a callback of type C<int (*eio_cb)(eio_req *req)>
156(called C<eio_cb> below) and a freely usable C<void *data> argument. 215(called C<eio_cb> below) and a freely usable C<void *data> argument.
157 216
158The return value will either be 0 217The return value will either be 0, in case something went really wrong
218(which can basically only happen on very fatal errors, such as C<malloc>
219returning 0, which is rather unlikely), or a pointer to the newly-created
220and submitted C<eio_req *>.
159 221
160The callback will be called with an C<eio_req *> which contains the 222The callback will be called with an C<eio_req *> which contains the
161results of the request. The members you can access inside that structure 223results of the request. The members you can access inside that structure
162vary from request to request, except for: 224vary from request to request, except for:
163 225
175=item C<void *data> 237=item C<void *data>
176 238
177The C<void *data> member simply stores the value of the C<data> argument. 239The C<void *data> member simply stores the value of the C<data> argument.
178 240
179=back 241=back
242
243Members not explicitly described as accessible must not be
244accessed. Specifically, there is no guarantee that any members will still
245have the value they had when the request was submitted.
180 246
181The return value of the callback is normally C<0>, which tells libeio to 247The return value of the callback is normally C<0>, which tells libeio to
182continue normally. If a callback returns a nonzero value, libeio will 248continue normally. If a callback returns a nonzero value, libeio will
183stop processing results (in C<eio_poll>) and will return the value to its 249stop processing results (in C<eio_poll>) and will return the value to its
184caller. 250caller.
185 251
186Memory areas passed to libeio must stay valid as long as a request 252Memory areas passed to libeio wrappers must stay valid as long as a
187executes, with the exception of paths, which are being copied 253request executes, with the exception of paths, which are being copied
188internally. Any memory libeio itself allocates will be freed after the 254internally. Any memory libeio itself allocates will be freed after the
189finish callback has been called. If you want to manage all memory passed 255finish callback has been called. If you want to manage all memory passed
190to libeio yourself you can use the low-level API. 256to libeio yourself you can use the low-level API.
191 257
192For example, to open a file, you could do this: 258For example, to open a file, you could do this:
210 } 276 }
211 277
212 /* the first three arguments are passed to open(2) */ 278 /* the first three arguments are passed to open(2) */
213 /* the remaining are priority, callback and data */ 279 /* the remaining are priority, callback and data */
214 if (!eio_open ("/etc/passwd", O_RDONLY, 0, 0, file_open_done, 0)) 280 if (!eio_open ("/etc/passwd", O_RDONLY, 0, 0, file_open_done, 0))
215 abort (); /* something ent wrong, we will all die!!! */ 281 abort (); /* something went wrong, we will all die!!! */
216 282
217Note that you additionally need to call C<eio_poll> when the C<want_cb> 283Note that you additionally need to call C<eio_poll> when the C<want_cb>
218indicates that requests are ready to be processed. 284indicates that requests are ready to be processed.
285
286=head2 CANCELLING REQUESTS
287
288Sometimes the need for a request goes away before the request is
289finished. In that case, one can cancel the request by a call to
290C<eio_cancel>:
291
292=over 4
293
294=item eio_cancel (eio_req *req)
295
296Cancel the request (and all its subrequests). If the request is currently
297executing it might still continue to execute, and in other cases it might
298still take a while till the request is cancelled.
299
300When cancelled, the finish callback will not be invoked.
301
302C<EIO_CANCELLED> is still true for requests that have successfully
303executed, as long as C<eio_cancel> was called on them at some point.
304
305=back
219 306
220=head2 AVAILABLE REQUESTS 307=head2 AVAILABLE REQUESTS
221 308
222The following request functions are available. I<All> of them return the 309The following request functions are available. I<All> of them return the
223C<eio_req *> on success and C<0> on failure, and I<all> of them have the 310C<eio_req *> on success and C<0> on failure, and I<all> of them have the
226custom data value as C<data>. 313custom data value as C<data>.
227 314
228=head3 POSIX API WRAPPERS 315=head3 POSIX API WRAPPERS
229 316
230These requests simply wrap the POSIX call of the same name, with the same 317These requests simply wrap the POSIX call of the same name, with the same
231arguments: 318arguments. If a function is not implemented by the OS and cannot be emulated
319in some way, then all of these return C<-1> and set C<errorno> to C<ENOSYS>.
232 320
233=over 4 321=over 4
234 322
235=item eio_open (const char *path, int flags, mode_t mode, int pri, eio_cb cb, void *data) 323=item eio_open (const char *path, int flags, mode_t mode, int pri, eio_cb cb, void *data)
236 324
325=item eio_truncate (const char *path, off_t offset, int pri, eio_cb cb, void *data)
326
327=item eio_chown (const char *path, uid_t uid, gid_t gid, int pri, eio_cb cb, void *data)
328
329=item eio_chmod (const char *path, mode_t mode, int pri, eio_cb cb, void *data)
330
331=item eio_mkdir (const char *path, mode_t mode, int pri, eio_cb cb, void *data)
332
333=item eio_rmdir (const char *path, int pri, eio_cb cb, void *data)
334
335=item eio_unlink (const char *path, int pri, eio_cb cb, void *data)
336
237=item eio_utime (const char *path, eio_tstamp atime, eio_tstamp mtime, int pri, eio_cb cb, void *data) 337=item eio_utime (const char *path, eio_tstamp atime, eio_tstamp mtime, int pri, eio_cb cb, void *data)
238 338
239=item eio_truncate (const char *path, off_t offset, int pri, eio_cb cb, void *data)
240
241=item eio_chown (const char *path, uid_t uid, gid_t gid, int pri, eio_cb cb, void *data)
242
243=item eio_chmod (const char *path, mode_t mode, int pri, eio_cb cb, void *data)
244
245=item eio_mkdir (const char *path, mode_t mode, int pri, eio_cb cb, void *data)
246
247=item eio_rmdir (const char *path, int pri, eio_cb cb, void *data)
248
249=item eio_unlink (const char *path, int pri, eio_cb cb, void *data)
250
251=item eio_readlink (const char *path, int pri, eio_cb cb, void *data) /* result=ptr2 allocated dynamically */
252
253=item eio_stat (const char *path, int pri, eio_cb cb, void *data) /* stat buffer=ptr2 allocated dynamically */
254
255=item eio_lstat (const char *path, int pri, eio_cb cb, void *data) /* stat buffer=ptr2 allocated dynamically */
256
257=item eio_statvfs (const char *path, int pri, eio_cb cb, void *data) /* stat buffer=ptr2 allocated dynamically */
258
259=item eio_mknod (const char *path, mode_t mode, dev_t dev, int pri, eio_cb cb, void *data) 339=item eio_mknod (const char *path, mode_t mode, dev_t dev, int pri, eio_cb cb, void *data)
260 340
261=item eio_link (const char *path, const char *new_path, int pri, eio_cb cb, void *data) 341=item eio_link (const char *path, const char *new_path, int pri, eio_cb cb, void *data)
262 342
263=item eio_symlink (const char *path, const char *new_path, int pri, eio_cb cb, void *data) 343=item eio_symlink (const char *path, const char *new_path, int pri, eio_cb cb, void *data)
264 344
265=item eio_rename (const char *path, const char *new_path, int pri, eio_cb cb, void *data) 345=item eio_rename (const char *path, const char *new_path, int pri, eio_cb cb, void *data)
266 346
267=item eio_msync (void *addr, size_t length, int flags, int pri, eio_cb cb, void *data)
268
269=item eio_mlock (void *addr, size_t length, int pri, eio_cb cb, void *data) 347=item eio_mlock (void *addr, size_t length, int pri, eio_cb cb, void *data)
270
271=item eio_mlockall (int flags, int pri, eio_cb cb, void *data)
272 348
273=item eio_close (int fd, int pri, eio_cb cb, void *data) 349=item eio_close (int fd, int pri, eio_cb cb, void *data)
274 350
275=item eio_sync (int pri, eio_cb cb, void *data) 351=item eio_sync (int pri, eio_cb cb, void *data)
276 352
305 381
306Not surprisingly, pread and pwrite are not thread-safe on Darwin (OS/X), 382Not surprisingly, pread and pwrite are not thread-safe on Darwin (OS/X),
307so it is advised not to submit multiple requests on the same fd on this 383so it is advised not to submit multiple requests on the same fd on this
308horrible pile of garbage. 384horrible pile of garbage.
309 385
386=item eio_mlockall (int flags, int pri, eio_cb cb, void *data)
387
388Like C<mlockall>, but the flag value constants are called
389C<EIO_MCL_CURRENT> and C<EIO_MCL_FUTURE>.
390
391=item eio_msync (void *addr, size_t length, int flags, int pri, eio_cb cb, void *data)
392
393Just like msync, except that the flag values are called C<EIO_MS_ASYNC>,
394C<EIO_MS_INVALIDATE> and C<EIO_MS_SYNC>.
395
396=item eio_readlink (const char *path, int pri, eio_cb cb, void *data)
397
398If successful, the path read by C<readlink(2)> can be accessed via C<<
399req->ptr2 >> and is I<NOT> null-terminated, with the length specified as
400C<< req->result >>.
401
402 if (req->result >= 0)
403 {
404 char *target = strndup ((char *)req->ptr2, req->result);
405
406 free (target);
407 }
408
409=item eio_realpath (const char *path, int pri, eio_cb cb, void *data)
410
411Similar to the realpath libc function, but unlike that one, C<<
412req->result >> is C<-1> on failure. On success, the result is the length
413of the returned path in C<ptr2> (which is I<NOT> 0-terminated) - this is
414similar to readlink.
415
416=item eio_stat (const char *path, int pri, eio_cb cb, void *data)
417
418=item eio_lstat (const char *path, int pri, eio_cb cb, void *data)
419
310=item eio_fstat (int fd, int pri, eio_cb cb, void *data) 420=item eio_fstat (int fd, int pri, eio_cb cb, void *data)
311 421
312Stats a file - if C<< req->result >> indicates success, then you can 422Stats a file - if C<< req->result >> indicates success, then you can
313access the C<struct stat>-like structure via C<< req->ptr2 >>: 423access the C<struct stat>-like structure via C<< req->ptr2 >>:
314 424
315 EIO_STRUCT_STAT *statdata = (EIO_STRUCT_STAT *)req->ptr2; 425 EIO_STRUCT_STAT *statdata = (EIO_STRUCT_STAT *)req->ptr2;
316 426
317=item eio_fstatvfs (int fd, int pri, eio_cb cb, void *data) /* stat buffer=ptr2 allocated dynamically */ 427=item eio_statvfs (const char *path, int pri, eio_cb cb, void *data)
428
429=item eio_fstatvfs (int fd, int pri, eio_cb cb, void *data)
318 430
319Stats a filesystem - if C<< req->result >> indicates success, then you can 431Stats a filesystem - if C<< req->result >> indicates success, then you can
320access the C<struct statvfs>-like structure via C<< req->ptr2 >>: 432access the C<struct statvfs>-like structure via C<< req->ptr2 >>:
321 433
322 EIO_STRUCT_STATVFS *statdata = (EIO_STRUCT_STATVFS *)req->ptr2; 434 EIO_STRUCT_STATVFS *statdata = (EIO_STRUCT_STATVFS *)req->ptr2;
323 435
324=back 436=back
325 437
326=head3 READING DIRECTORIES 438=head3 READING DIRECTORIES
327 439
328Reading directories sounds simple, but can be rather demanding, especially 440Reading directories sounds simple, but can be rather demanding, especially
329if you want to do stuff such as traversing a diretcory hierarchy or 441if you want to do stuff such as traversing a directory hierarchy or
330processing all files in a directory. Libeio can assist thess complex tasks 442processing all files in a directory. Libeio can assist these complex tasks
331with it's C<eio_readdir> call. 443with it's C<eio_readdir> call.
332 444
333=over 4 445=over 4
334 446
335=item eio_readdir (const char *path, int flags, int pri, eio_cb cb, void *data) 447=item eio_readdir (const char *path, int flags, int pri, eio_cb cb, void *data)
338(via the C<opendir>, C<readdir> and C<closedir> calls) and returns either 450(via the C<opendir>, C<readdir> and C<closedir> calls) and returns either
339the names or an array of C<struct eio_dirent>, depending on the C<flags> 451the names or an array of C<struct eio_dirent>, depending on the C<flags>
340argument. 452argument.
341 453
342The C<< req->result >> indicates either the number of files found, or 454The C<< req->result >> indicates either the number of files found, or
343C<-1> on error. On success, zero-terminated names can be found as C<< req->ptr2 >>, 455C<-1> on error. On success, null-terminated names can be found as C<< req->ptr2 >>,
344and C<struct eio_dirents>, if requested by C<flags>, can be found via C<< 456and C<struct eio_dirents>, if requested by C<flags>, can be found via C<<
345req->ptr1 >>. 457req->ptr1 >>.
346 458
347Here is an example that prints all the names: 459Here is an example that prints all the names:
348 460
367 479
368If this flag is specified, then, in addition to the names in C<ptr2>, 480If this flag is specified, then, in addition to the names in C<ptr2>,
369also an array of C<struct eio_dirent> is returned, in C<ptr1>. A C<struct 481also an array of C<struct eio_dirent> is returned, in C<ptr1>. A C<struct
370eio_dirent> looks like this: 482eio_dirent> looks like this:
371 483
372 struct eio_dirent 484 struct eio_dirent
373 { 485 {
374 int nameofs; /* offset of null-terminated name string in (char *)req->ptr2 */ 486 int nameofs; /* offset of null-terminated name string in (char *)req->ptr2 */
375 unsigned short namelen; /* size of filename without trailing 0 */ 487 unsigned short namelen; /* size of filename without trailing 0 */
376 unsigned char type; /* one of EIO_DT_* */ 488 unsigned char type; /* one of EIO_DT_* */
377 signed char score; /* internal use */ 489 signed char score; /* internal use */
378 ino_t inode; /* the inode number, if available, otherwise unspecified */ 490 ino_t inode; /* the inode number, if available, otherwise unspecified */
379 }; 491 };
380 492
381The only members you normally would access are C<nameofs>, which is the 493The only members you normally would access are C<nameofs>, which is the
382byte-offset from C<ptr2> to the start of the name, C<namelen> and C<type>. 494byte-offset from C<ptr2> to the start of the name, C<namelen> and C<type>.
383 495
384C<type> can be one of: 496C<type> can be one of:
427When this flag is specified, then the names will be returned in an order 539When this flag is specified, then the names will be returned in an order
428suitable for stat()'ing each one. That is, when you plan to stat() 540suitable for stat()'ing each one. That is, when you plan to stat()
429all files in the given directory, then the returned order will likely 541all files in the given directory, then the returned order will likely
430be fastest. 542be fastest.
431 543
432If both this flag and C<EIO_READDIR_DIRS_FIRST> are specified, then 544If both this flag and C<EIO_READDIR_DIRS_FIRST> are specified, then the
433the likely dirs come first, resulting in a less optimal stat order. 545likely directories come first, resulting in a less optimal stat order.
434 546
435=item EIO_READDIR_FOUND_UNKNOWN 547=item EIO_READDIR_FOUND_UNKNOWN
436 548
437This flag should not be specified when calling C<eio_readdir>. Instead, 549This flag should not be specified when calling C<eio_readdir>. Instead,
438it is being set by C<eio_readdir> (you can access the C<flags> via C<< 550it is being set by C<eio_readdir> (you can access the C<flags> via C<<
439req->int1 >>, when any of the C<type>'s found were C<EIO_DT_UNKNOWN>. The 551req->int1 >>, when any of the C<type>'s found were C<EIO_DT_UNKNOWN>. The
440absense of this flag therefore indicates that all C<type>'s are known, 552absence of this flag therefore indicates that all C<type>'s are known,
441which can be used to speed up some algorithms. 553which can be used to speed up some algorithms.
442 554
443A typical use case would be to identify all subdirectories within a 555A typical use case would be to identify all subdirectories within a
444directory - you would ask C<eio_readdir> for C<EIO_READDIR_DIRS_FIRST>. If 556directory - you would ask C<eio_readdir> for C<EIO_READDIR_DIRS_FIRST>. If
445then this flag is I<NOT> set, then all the entries at the beginning of the 557then this flag is I<NOT> set, then all the entries at the beginning of the
475=item eio_readahead (int fd, off_t offset, size_t length, int pri, eio_cb cb, void *data) 587=item eio_readahead (int fd, off_t offset, size_t length, int pri, eio_cb cb, void *data)
476 588
477Calls C<readahead(2)>. If the syscall is missing, then the call is 589Calls C<readahead(2)>. If the syscall is missing, then the call is
478emulated by simply reading the data (currently in 64kiB chunks). 590emulated by simply reading the data (currently in 64kiB chunks).
479 591
592=item eio_syncfs (int fd, int pri, eio_cb cb, void *data)
593
594Calls Linux' C<syncfs> syscall, if available. Returns C<-1> and sets
595C<errno> to C<ENOSYS> if the call is missing I<but still calls sync()>,
596if the C<fd> is C<< >= 0 >>, so you can probe for the availability of the
597syscall with a negative C<fd> argument and checking for C<-1/ENOSYS>.
598
480=item eio_sync_file_range (int fd, off_t offset, size_t nbytes, unsigned int flags, int pri, eio_cb cb, void *data) 599=item eio_sync_file_range (int fd, off_t offset, size_t nbytes, unsigned int flags, int pri, eio_cb cb, void *data)
481 600
482Calls C<sync_file_range>. If the syscall is missing, then this is the same 601Calls C<sync_file_range>. If the syscall is missing, then this is the same
483as calling C<fdatasync>. 602as calling C<fdatasync>.
484 603
604Flags can be any combination of C<EIO_SYNC_FILE_RANGE_WAIT_BEFORE>,
605C<EIO_SYNC_FILE_RANGE_WRITE> and C<EIO_SYNC_FILE_RANGE_WAIT_AFTER>.
606
607=item eio_fallocate (int fd, int mode, off_t offset, off_t len, int pri, eio_cb cb, void *data)
608
609Calls C<fallocate> (note: I<NOT> C<posix_fallocate>!). If the syscall is
610missing, then it returns failure and sets C<errno> to C<ENOSYS>.
611
612The C<mode> argument can be C<0> (for behaviour similar to
613C<posix_fallocate>), or C<EIO_FALLOC_FL_KEEP_SIZE>, which keeps the size
614of the file unchanged (but still preallocates space beyond end of file).
615
485=back 616=back
486 617
487=head3 LIBEIO-SPECIFIC REQUESTS 618=head3 LIBEIO-SPECIFIC REQUESTS
488 619
489These requests are specific to libeio and do not correspond to any OS call. 620These requests are specific to libeio and do not correspond to any OS call.
490 621
491=over 4 622=over 4
492 623
493=item eio_mtouch (void *addr, size_t length, int flags, int pri, eio_cb cb, void *data) 624=item eio_mtouch (void *addr, size_t length, int flags, int pri, eio_cb cb, void *data)
494 625
626Reads (C<flags == 0>) or modifies (C<flags == EIO_MT_MODIFY>) the given
627memory area, page-wise, that is, it reads (or reads and writes back) the
628first octet of every page that spans the memory area.
629
630This can be used to page in some mmapped file, or dirty some pages. Note
631that dirtying is an unlocked read-write access, so races can ensue when
632the some other thread modifies the data stored in that memory area.
633
495=item eio_custom (void (*)(eio_req *) execute, int pri, eio_cb cb, void *data) 634=item eio_custom (void (*)(eio_req *) execute, int pri, eio_cb cb, void *data)
496 635
497Executes a custom request, i.e., a user-specified callback. 636Executes a custom request, i.e., a user-specified callback.
498 637
499The callback gets the C<eio_req *> as parameter and is expected to read 638The callback gets the C<eio_req *> as parameter and is expected to read
500and modify any request-specific members. Specifically, it should set C<< 639and modify any request-specific members. Specifically, it should set C<<
520 req->result = open (req->data, O_RDONLY); 659 req->result = open (req->data, O_RDONLY);
521 } 660 }
522 661
523 eio_custom (my_open, 0, my_open_done, "/etc/passwd"); 662 eio_custom (my_open, 0, my_open_done, "/etc/passwd");
524 663
525=item eio_busy (eio_tstamp delay, int pri, eio_cb cb, void *data) 664=item eio_busy (eio_tstamp delay, int pri, eio_cb cb, void *data)
526 665
527This is a a request that takes C<delay> seconds to execute, but otherwise 666This is a request that takes C<delay> seconds to execute, but otherwise
528does nothing - it simply puts one of the worker threads to sleep for this 667does nothing - it simply puts one of the worker threads to sleep for this
529long. 668long.
530 669
531This request can be used to artificially increase load, e.g. for debugging 670This request can be used to artificially increase load, e.g. for debugging
532or benchmarking reasons. 671or benchmarking reasons.
533 672
534=item eio_nop (int pri, eio_cb cb, void *data) 673=item eio_nop (int pri, eio_cb cb, void *data)
535 674
536This request does nothing, except go through the whole request cycle. This 675This request does nothing, except go through the whole request cycle. This
537can be used to measure latency or in some cases to simplify code, but is 676can be used to measure latency or in some cases to simplify code, but is
538not really of much use. 677not really of much use.
539 678
540=back 679=back
541 680
542=head3 GROUPING AND LIMITING REQUESTS 681=head3 GROUPING AND LIMITING REQUESTS
543 682
683There is one more rather special request, C<eio_grp>. It is a very special
684aio request: Instead of doing something, it is a container for other eio
685requests.
686
687There are two primary use cases for this: a) bundle many requests into a
688single, composite, request with a definite callback and the ability to
689cancel the whole request with its subrequests and b) limiting the number
690of "active" requests.
691
692Further below you will find more discussion of these topics - first
693follows the reference section detailing the request generator and other
694methods.
695
696=over 4
697
698=item eio_req *grp = eio_grp (eio_cb cb, void *data)
699
700Creates, submits and returns a group request. Note that it doesn't have a
701priority, unlike all other requests.
702
703=item eio_grp_add (eio_req *grp, eio_req *req)
704
705Adds a request to the request group.
706
707=item eio_grp_cancel (eio_req *grp)
708
709Cancels all requests I<in> the group, but I<not> the group request
710itself. You can cancel the group request I<and> all subrequests via a
711normal C<eio_cancel> call.
712
713=back
714
715=head4 GROUP REQUEST LIFETIME
716
717Left alone, a group request will instantly move to the pending state and
718will be finished at the next call of C<eio_poll>.
719
720The usefulness stems from the fact that, if a subrequest is added to a
721group I<before> a call to C<eio_poll>, via C<eio_grp_add>, then the group
722will not finish until all the subrequests have finished.
723
724So the usage cycle of a group request is like this: after it is created,
725you normally instantly add a subrequest. If none is added, the group
726request will finish on it's own. As long as subrequests are added before
727the group request is finished it will be kept from finishing, that is the
728callbacks of any subrequests can, in turn, add more requests to the group,
729and as long as any requests are active, the group request itself will not
730finish.
731
732=head4 CREATING COMPOSITE REQUESTS
733
734Imagine you wanted to create an C<eio_load> request that opens a file,
735reads it and closes it. This means it has to execute at least three eio
736requests, but for various reasons it might be nice if that request looked
737like any other eio request.
738
739This can be done with groups:
740
741=over 4
742
743=item 1) create the request object
744
745Create a group that contains all further requests. This is the request you
746can return as "the load request".
747
748=item 2) open the file, maybe
749
750Next, open the file with C<eio_open> and add the request to the group
751request and you are finished setting up the request.
752
753If, for some reason, you cannot C<eio_open> (path is a null ptr?) you
754can set C<< grp->result >> to C<-1> to signal an error and let the group
755request finish on its own.
756
757=item 3) open callback adds more requests
758
759In the open callback, if the open was not successful, copy C<<
760req->errorno >> to C<< grp->errorno >> and set C<< grp->result >> to
761C<-1> to signal an error.
762
763Otherwise, malloc some memory or so and issue a read request, adding the
764read request to the group.
765
766=item 4) continue issuing requests till finished
767
768In the read callback, check for errors and possibly continue with
769C<eio_close> or any other eio request in the same way.
770
771As soon as no new requests are added, the group request will finish. Make
772sure you I<always> set C<< grp->result >> to some sensible value.
773
774=back
775
776=head4 REQUEST LIMITING
777
778
544#TODO 779#TODO
545 780
546/*****************************************************************************/
547/* groups */
548
549eio_req *eio_grp (eio_cb cb, void *data);
550void eio_grp_feed (eio_req *grp, void (*feed)(eio_req *req), int limit);
551void eio_grp_limit (eio_req *grp, int limit); 781void eio_grp_limit (eio_req *grp, int limit);
552void eio_grp_add (eio_req *grp, eio_req *req);
553void eio_grp_cancel (eio_req *grp); /* cancels all sub requests but not the group */
554 782
555
556=back
557 783
558 784
559=head1 LOW LEVEL REQUEST API 785=head1 LOW LEVEL REQUEST API
560 786
561#TODO 787#TODO
564=head1 ANATOMY AND LIFETIME OF AN EIO REQUEST 790=head1 ANATOMY AND LIFETIME OF AN EIO REQUEST
565 791
566A request is represented by a structure of type C<eio_req>. To initialise 792A request is represented by a structure of type C<eio_req>. To initialise
567it, clear it to all zero bytes: 793it, clear it to all zero bytes:
568 794
569 eio_req req; 795 eio_req req;
570 796
571 memset (&req, 0, sizeof (req)); 797 memset (&req, 0, sizeof (req));
572 798
573A more common way to initialise a new C<eio_req> is to use C<calloc>: 799A more common way to initialise a new C<eio_req> is to use C<calloc>:
574 800
575 eio_req *req = calloc (1, sizeof (*req)); 801 eio_req *req = calloc (1, sizeof (*req));
576 802
577In either case, libeio neither allocates, initialises or frees the 803In either case, libeio neither allocates, initialises or frees the
578C<eio_req> structure for you - it merely uses it. 804C<eio_req> structure for you - it merely uses it.
579 805
580zero 806zero
598for example, in interactive programs, you might want to limit this time to 824for example, in interactive programs, you might want to limit this time to
599C<0.01> seconds or so. 825C<0.01> seconds or so.
600 826
601Note that: 827Note that:
602 828
829=over 4
830
603a) libeio doesn't know how long your request callbacks take, so the time 831=item a) libeio doesn't know how long your request callbacks take, so the
604spent in C<eio_poll> is up to one callback invocation longer then this 832time spent in C<eio_poll> is up to one callback invocation longer then
605interval. 833this interval.
606 834
607b) this is implemented by calling C<gettimeofday> after each request, 835=item b) this is implemented by calling C<gettimeofday> after each
608which can be costly. 836request, which can be costly.
609 837
610c) at least one request will be handled. 838=item c) at least one request will be handled.
839
840=back
611 841
612=item eio_set_max_poll_reqs (unsigned int nreqs) 842=item eio_set_max_poll_reqs (unsigned int nreqs)
613 843
614When C<nreqs> is non-zero, then C<eio_poll> will not handle more than 844When C<nreqs> is non-zero, then C<eio_poll> will not handle more than
615C<nreqs> requests per invocation. This is a less costly way to limit the 845C<nreqs> requests per invocation. This is a less costly way to limit the
685This symbol governs the stack size for each eio thread. Libeio itself 915This symbol governs the stack size for each eio thread. Libeio itself
686was written to use very little stackspace, but when using C<EIO_CUSTOM> 916was written to use very little stackspace, but when using C<EIO_CUSTOM>
687requests, you might want to increase this. 917requests, you might want to increase this.
688 918
689If this symbol is undefined (the default) then libeio will use its default 919If this symbol is undefined (the default) then libeio will use its default
690stack size (C<sizeof (long) * 4096> currently). If it is defined, but 920stack size (C<sizeof (void *) * 4096> currently). In all other cases, the
691C<0>, then the default operating system stack size will be used. In all
692other cases, the value must be an expression that evaluates to the desired 921value must be an expression that evaluates to the desired stack size.
693stack size.
694 922
695=back 923=back
696 924
697 925
698=head1 PORTABILITY REQUIREMENTS 926=head1 PORTABILITY REQUIREMENTS

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines