ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/libeio/eio.pod
Revision: 1.16
Committed: Tue Jul 5 17:05:54 2011 UTC (12 years, 10 months ago) by root
Branch: MAIN
Changes since 1.15: +9 -6 lines
Log Message:
*** empty log message ***

File Contents

# User Rev Content
1 root 1.1 =head1 NAME
2    
3     libeio - truly asynchronous POSIX I/O
4    
5     =head1 SYNOPSIS
6    
7     #include <eio.h>
8    
9     =head1 DESCRIPTION
10    
11     The newest version of this document is also available as an html-formatted
12     web page you might find easier to navigate when reading it for the first
13     time: L<http://pod.tst.eu/http://cvs.schmorp.de/libeio/eio.pod>.
14    
15     Note that this library is a by-product of the C<IO::AIO> perl
16 sf-exg 1.6 module, and many of the subtler points regarding requests lifetime
17 root 1.1 and so on are only documented in its documentation at the
18     moment: L<http://pod.tst.eu/http://cvs.schmorp.de/IO-AIO/AIO.pm>.
19    
20     =head2 FEATURES
21    
22     This library provides fully asynchronous versions of most POSIX functions
23 sf-exg 1.6 dealing with I/O. Unlike most asynchronous libraries, this not only
24 root 1.1 includes C<read> and C<write>, but also C<open>, C<stat>, C<unlink> and
25     similar functions, as well as less rarely ones such as C<mknod>, C<futime>
26     or C<readlink>.
27    
28     It also offers wrappers around C<sendfile> (Solaris, Linux, HP-UX and
29     FreeBSD, with emulation on other platforms) and C<readahead> (Linux, with
30     emulation elsewhere>).
31    
32 root 1.5 The goal is to enable you to write fully non-blocking programs. For
33 root 1.1 example, in a game server, you would not want to freeze for a few seconds
34     just because the server is running a backup and you happen to call
35     C<readdir>.
36    
37     =head2 TIME REPRESENTATION
38    
39     Libeio represents time as a single floating point number, representing the
40     (fractional) number of seconds since the (POSIX) epoch (somewhere near
41     the beginning of 1970, details are complicated, don't ask). This type is
42 sf-exg 1.6 called C<eio_tstamp>, but it is guaranteed to be of type C<double> (or
43 root 1.1 better), so you can freely use C<double> yourself.
44    
45     Unlike the name component C<stamp> might indicate, it is also used for
46     time differences throughout libeio.
47    
48     =head2 FORK SUPPORT
49    
50     Calling C<fork ()> is fully supported by this module. It is implemented in these steps:
51    
52     1. wait till all requests in "execute" state have been handled
53     (basically requests that are already handed over to the kernel).
54     2. fork
55     3. in the parent, continue business as usual, done
56     4. in the child, destroy all ready and pending requests and free the
57     memory used by the worker threads. This gives you a fully empty
58     libeio queue.
59    
60 root 1.7 Note, however, since libeio does use threads, thr above guarantee doesn't
61     cover your libc, for example, malloc and other libc functions are not
62     fork-safe, so there is very little you can do after a fork, and in fatc,
63     the above might crash, and thus change.
64    
65 root 1.1 =head1 INITIALISATION/INTEGRATION
66    
67     Before you can call any eio functions you first have to initialise the
68     library. The library integrates into any event loop, but can also be used
69     without one, including in polling mode.
70    
71     You have to provide the necessary glue yourself, however.
72    
73     =over 4
74    
75     =item int eio_init (void (*want_poll)(void), void (*done_poll)(void))
76    
77     This function initialises the library. On success it returns C<0>, on
78     failure it returns C<-1> and sets C<errno> appropriately.
79    
80     It accepts two function pointers specifying callbacks as argument, both of
81     which can be C<0>, in which case the callback isn't called.
82    
83     =item want_poll callback
84    
85     The C<want_poll> callback is invoked whenever libeio wants attention (i.e.
86     it wants to be polled by calling C<eio_poll>). It is "edge-triggered",
87     that is, it will only be called once when eio wants attention, until all
88     pending requests have been handled.
89    
90     This callback is called while locks are being held, so I<you must
91     not call any libeio functions inside this callback>. That includes
92     C<eio_poll>. What you should do is notify some other thread, or wake up
93     your event loop, and then call C<eio_poll>.
94    
95     =item done_poll callback
96    
97     This callback is invoked when libeio detects that all pending requests
98     have been handled. It is "edge-triggered", that is, it will only be
99     called once after C<want_poll>. To put it differently, C<want_poll> and
100     C<done_poll> are invoked in pairs: after C<want_poll> you have to call
101     C<eio_poll ()> until either C<eio_poll> indicates that everything has been
102     handled or C<done_poll> has been called, which signals the same.
103    
104     Note that C<eio_poll> might return after C<done_poll> and C<want_poll>
105     have been called again, so watch out for races in your code.
106    
107 sf-exg 1.6 As with C<want_poll>, this callback is called while locks are being held,
108 root 1.1 so you I<must not call any libeio functions form within this callback>.
109    
110     =item int eio_poll ()
111    
112     This function has to be called whenever there are pending requests that
113     need finishing. You usually call this after C<want_poll> has indicated
114     that you should do so, but you can also call this function regularly to
115     poll for new results.
116    
117     If any request invocation returns a non-zero value, then C<eio_poll ()>
118     immediately returns with that value as return value.
119    
120     Otherwise, if all requests could be handled, it returns C<0>. If for some
121     reason not all requests have been handled, i.e. some are still pending, it
122     returns C<-1>.
123    
124     =back
125    
126     For libev, you would typically use an C<ev_async> watcher: the
127     C<want_poll> callback would invoke C<ev_async_send> to wake up the event
128     loop. Inside the callback set for the watcher, one would call C<eio_poll
129 root 1.15 ()>.
130    
131     If C<eio_poll ()> is configured to not handle all results in one go
132     (i.e. it returns C<-1>) then you should start an idle watcher that calls
133     C<eio_poll> until it returns something C<!= -1>.
134    
135 root 1.16 A full-featured conenctor between libeio and libev would look as follows
136     (if C<eio_poll> is handling all requests, it can of course be simplified a
137     lot by removing the idle watcher logic):
138 root 1.15
139     static struct ev_loop *loop;
140     static ev_idle repeat_watcher;
141     static ev_async ready_watcher;
142    
143     /* idle watcher callback, only used when eio_poll */
144     /* didn't handle all results in one call */
145     static void
146     repeat (EV_P_ ev_idle *w, int revents)
147     {
148     if (eio_poll () != -1)
149     ev_idle_stop (EV_A_ w);
150     }
151    
152     /* eio has some results, process them */
153     static void
154     ready (EV_P_ ev_async *w, int revents)
155     {
156     if (eio_poll () == -1)
157     ev_idle_start (EV_A_ &repeat_watcher);
158     }
159    
160     /* wake up the event loop */
161     static void
162     want_poll (void)
163     {
164     ev_async_send (loop, &ready_watcher)
165     }
166    
167     void
168     my_init_eio ()
169     {
170     loop = EV_DEFAULT;
171    
172     ev_idle_init (&repeat_watcher, repeat);
173     ev_async_init (&ready_watcher, ready);
174     ev_async_start (loop &watcher);
175    
176     eio_init (want_poll, 0);
177     }
178 root 1.1
179     For most other event loops, you would typically use a pipe - the event
180 sf-exg 1.6 loop should be told to wait for read readiness on the read end. In
181 root 1.1 C<want_poll> you would write a single byte, in C<done_poll> you would try
182     to read that byte, and in the callback for the read end, you would call
183 root 1.16 C<eio_poll>.
184    
185     You don't have to take special care in the case C<eio_poll> doesn't handle
186     all requests, as the done callback will not be invoked, so the event loop
187     will still signal readyness for the pipe until I<all> results have been
188     processed.
189 root 1.1
190    
191 root 1.7 =head1 HIGH LEVEL REQUEST API
192    
193     Libeio has both a high-level API, which consists of calling a request
194     function with a callback to be called on completion, and a low-level API
195     where you fill out request structures and submit them.
196    
197     This section describes the high-level API.
198    
199     =head2 REQUEST SUBMISSION AND RESULT PROCESSING
200    
201     You submit a request by calling the relevant C<eio_TYPE> function with the
202     required parameters, a callback of type C<int (*eio_cb)(eio_req *req)>
203     (called C<eio_cb> below) and a freely usable C<void *data> argument.
204    
205 root 1.12 The return value will either be 0, in case something went really wrong
206     (which can basically only happen on very fatal errors, such as C<malloc>
207     returning 0, which is rather unlikely), or a pointer to the newly-created
208     and submitted C<eio_req *>.
209 root 1.7
210     The callback will be called with an C<eio_req *> which contains the
211     results of the request. The members you can access inside that structure
212     vary from request to request, except for:
213    
214     =over 4
215    
216     =item C<ssize_t result>
217    
218     This contains the result value from the call (usually the same as the
219     syscall of the same name).
220    
221     =item C<int errorno>
222    
223     This contains the value of C<errno> after the call.
224    
225     =item C<void *data>
226    
227     The C<void *data> member simply stores the value of the C<data> argument.
228    
229     =back
230    
231     The return value of the callback is normally C<0>, which tells libeio to
232     continue normally. If a callback returns a nonzero value, libeio will
233     stop processing results (in C<eio_poll>) and will return the value to its
234     caller.
235    
236     Memory areas passed to libeio must stay valid as long as a request
237     executes, with the exception of paths, which are being copied
238     internally. Any memory libeio itself allocates will be freed after the
239     finish callback has been called. If you want to manage all memory passed
240     to libeio yourself you can use the low-level API.
241    
242     For example, to open a file, you could do this:
243    
244     static int
245     file_open_done (eio_req *req)
246     {
247     if (req->result < 0)
248     {
249     /* open() returned -1 */
250     errno = req->errorno;
251     perror ("open");
252     }
253     else
254     {
255     int fd = req->result;
256     /* now we have the new fd in fd */
257     }
258    
259     return 0;
260     }
261    
262     /* the first three arguments are passed to open(2) */
263     /* the remaining are priority, callback and data */
264     if (!eio_open ("/etc/passwd", O_RDONLY, 0, 0, file_open_done, 0))
265     abort (); /* something ent wrong, we will all die!!! */
266    
267     Note that you additionally need to call C<eio_poll> when the C<want_cb>
268     indicates that requests are ready to be processed.
269    
270     =head2 AVAILABLE REQUESTS
271    
272     The following request functions are available. I<All> of them return the
273     C<eio_req *> on success and C<0> on failure, and I<all> of them have the
274     same three trailing arguments: C<pri>, C<cb> and C<data>. The C<cb> is
275     mandatory, but in most cases, you pass in C<0> as C<pri> and C<0> or some
276     custom data value as C<data>.
277    
278     =head3 POSIX API WRAPPERS
279    
280     These requests simply wrap the POSIX call of the same name, with the same
281 root 1.11 arguments. If a function is not implemented by the OS and cannot be emulated
282 root 1.10 in some way, then all of these return C<-1> and set C<errorno> to C<ENOSYS>.
283 root 1.7
284     =over 4
285    
286     =item eio_open (const char *path, int flags, mode_t mode, int pri, eio_cb cb, void *data)
287    
288     =item eio_truncate (const char *path, off_t offset, int pri, eio_cb cb, void *data)
289    
290     =item eio_chown (const char *path, uid_t uid, gid_t gid, int pri, eio_cb cb, void *data)
291    
292     =item eio_chmod (const char *path, mode_t mode, int pri, eio_cb cb, void *data)
293    
294     =item eio_mkdir (const char *path, mode_t mode, int pri, eio_cb cb, void *data)
295    
296     =item eio_rmdir (const char *path, int pri, eio_cb cb, void *data)
297    
298     =item eio_unlink (const char *path, int pri, eio_cb cb, void *data)
299    
300 root 1.10 =item eio_utime (const char *path, eio_tstamp atime, eio_tstamp mtime, int pri, eio_cb cb, void *data)
301 root 1.7
302     =item eio_mknod (const char *path, mode_t mode, dev_t dev, int pri, eio_cb cb, void *data)
303    
304     =item eio_link (const char *path, const char *new_path, int pri, eio_cb cb, void *data)
305    
306     =item eio_symlink (const char *path, const char *new_path, int pri, eio_cb cb, void *data)
307    
308     =item eio_rename (const char *path, const char *new_path, int pri, eio_cb cb, void *data)
309    
310     =item eio_mlock (void *addr, size_t length, int pri, eio_cb cb, void *data)
311    
312     =item eio_close (int fd, int pri, eio_cb cb, void *data)
313    
314     =item eio_sync (int pri, eio_cb cb, void *data)
315    
316     =item eio_fsync (int fd, int pri, eio_cb cb, void *data)
317    
318     =item eio_fdatasync (int fd, int pri, eio_cb cb, void *data)
319    
320     =item eio_futime (int fd, eio_tstamp atime, eio_tstamp mtime, int pri, eio_cb cb, void *data)
321    
322     =item eio_ftruncate (int fd, off_t offset, int pri, eio_cb cb, void *data)
323    
324     =item eio_fchmod (int fd, mode_t mode, int pri, eio_cb cb, void *data)
325    
326     =item eio_fchown (int fd, uid_t uid, gid_t gid, int pri, eio_cb cb, void *data)
327    
328     =item eio_dup2 (int fd, int fd2, int pri, eio_cb cb, void *data)
329    
330     These have the same semantics as the syscall of the same name, their
331     return value is available as C<< req->result >> later.
332    
333     =item eio_read (int fd, void *buf, size_t length, off_t offset, int pri, eio_cb cb, void *data)
334    
335     =item eio_write (int fd, void *buf, size_t length, off_t offset, int pri, eio_cb cb, void *data)
336    
337     These two requests are called C<read> and C<write>, but actually wrap
338     C<pread> and C<pwrite>. On systems that lack these calls (such as cygwin),
339     libeio uses lseek/read_or_write/lseek and a mutex to serialise the
340     requests, so all these requests run serially and do not disturb each
341     other. However, they still disturb the file offset while they run, so it's
342     not safe to call these functions concurrently with non-libeio functions on
343     the same fd on these systems.
344    
345     Not surprisingly, pread and pwrite are not thread-safe on Darwin (OS/X),
346     so it is advised not to submit multiple requests on the same fd on this
347     horrible pile of garbage.
348    
349 root 1.10 =item eio_mlockall (int flags, int pri, eio_cb cb, void *data)
350    
351     Like C<mlockall>, but the flag value constants are called
352     C<EIO_MCL_CURRENT> and C<EIO_MCL_FUTURE>.
353    
354     =item eio_msync (void *addr, size_t length, int flags, int pri, eio_cb cb, void *data)
355    
356     Just like msync, except that the flag values are called C<EIO_MS_ASYNC>,
357     C<EIO_MS_INVALIDATE> and C<EIO_MS_SYNC>.
358    
359     =item eio_readlink (const char *path, int pri, eio_cb cb, void *data)
360    
361     If successful, the path read by C<readlink(2)> can be accessed via C<<
362     req->ptr2 >> and is I<NOT> null-terminated, with the length specified as
363     C<< req->result >>.
364    
365     if (req->result >= 0)
366     {
367     char *target = strndup ((char *)req->ptr2, req->result);
368    
369     free (target);
370     }
371    
372 root 1.13 =item eio_realpath (const char *path, int pri, eio_cb cb, void *data)
373    
374     Similar to the realpath libc function, but unlike that one, result is
375 root 1.14 C<-1> on failure and the length of the returned path in C<ptr2> (which is
376     not 0-terminated) - this is similar to readlink.
377 root 1.13
378 root 1.10 =item eio_stat (const char *path, int pri, eio_cb cb, void *data)
379    
380     =item eio_lstat (const char *path, int pri, eio_cb cb, void *data)
381    
382 root 1.7 =item eio_fstat (int fd, int pri, eio_cb cb, void *data)
383    
384     Stats a file - if C<< req->result >> indicates success, then you can
385     access the C<struct stat>-like structure via C<< req->ptr2 >>:
386    
387     EIO_STRUCT_STAT *statdata = (EIO_STRUCT_STAT *)req->ptr2;
388    
389 root 1.10 =item eio_statvfs (const char *path, int pri, eio_cb cb, void *data)
390    
391     =item eio_fstatvfs (int fd, int pri, eio_cb cb, void *data)
392 root 1.7
393     Stats a filesystem - if C<< req->result >> indicates success, then you can
394     access the C<struct statvfs>-like structure via C<< req->ptr2 >>:
395    
396     EIO_STRUCT_STATVFS *statdata = (EIO_STRUCT_STATVFS *)req->ptr2;
397    
398     =back
399    
400     =head3 READING DIRECTORIES
401    
402     Reading directories sounds simple, but can be rather demanding, especially
403     if you want to do stuff such as traversing a diretcory hierarchy or
404     processing all files in a directory. Libeio can assist thess complex tasks
405     with it's C<eio_readdir> call.
406    
407     =over 4
408    
409     =item eio_readdir (const char *path, int flags, int pri, eio_cb cb, void *data)
410    
411     This is a very complex call. It basically reads through a whole directory
412     (via the C<opendir>, C<readdir> and C<closedir> calls) and returns either
413     the names or an array of C<struct eio_dirent>, depending on the C<flags>
414     argument.
415    
416     The C<< req->result >> indicates either the number of files found, or
417 root 1.10 C<-1> on error. On success, null-terminated names can be found as C<< req->ptr2 >>,
418 root 1.7 and C<struct eio_dirents>, if requested by C<flags>, can be found via C<<
419     req->ptr1 >>.
420    
421     Here is an example that prints all the names:
422    
423     int i;
424     char *names = (char *)req->ptr2;
425    
426     for (i = 0; i < req->result; ++i)
427     {
428     printf ("name #%d: %s\n", i, names);
429    
430     /* move to next name */
431     names += strlen (names) + 1;
432     }
433    
434     Pseudo-entries such as F<.> and F<..> are never returned by C<eio_readdir>.
435    
436     C<flags> can be any combination of:
437    
438     =over 4
439    
440     =item EIO_READDIR_DENTS
441    
442     If this flag is specified, then, in addition to the names in C<ptr2>,
443     also an array of C<struct eio_dirent> is returned, in C<ptr1>. A C<struct
444     eio_dirent> looks like this:
445    
446     struct eio_dirent
447     {
448     int nameofs; /* offset of null-terminated name string in (char *)req->ptr2 */
449     unsigned short namelen; /* size of filename without trailing 0 */
450     unsigned char type; /* one of EIO_DT_* */
451     signed char score; /* internal use */
452     ino_t inode; /* the inode number, if available, otherwise unspecified */
453     };
454    
455     The only members you normally would access are C<nameofs>, which is the
456     byte-offset from C<ptr2> to the start of the name, C<namelen> and C<type>.
457    
458     C<type> can be one of:
459    
460     C<EIO_DT_UNKNOWN> - if the type is not known (very common) and you have to C<stat>
461     the name yourself if you need to know,
462     one of the "standard" POSIX file types (C<EIO_DT_REG>, C<EIO_DT_DIR>, C<EIO_DT_LNK>,
463     C<EIO_DT_FIFO>, C<EIO_DT_SOCK>, C<EIO_DT_CHR>, C<EIO_DT_BLK>)
464     or some OS-specific type (currently
465     C<EIO_DT_MPC> - multiplexed char device (v7+coherent),
466     C<EIO_DT_NAM> - xenix special named file,
467     C<EIO_DT_MPB> - multiplexed block device (v7+coherent),
468     C<EIO_DT_NWK> - HP-UX network special,
469     C<EIO_DT_CMP> - VxFS compressed,
470     C<EIO_DT_DOOR> - solaris door, or
471     C<EIO_DT_WHT>).
472    
473     This example prints all names and their type:
474    
475     int i;
476     struct eio_dirent *ents = (struct eio_dirent *)req->ptr1;
477     char *names = (char *)req->ptr2;
478    
479     for (i = 0; i < req->result; ++i)
480     {
481     struct eio_dirent *ent = ents + i;
482     char *name = names + ent->nameofs;
483    
484     printf ("name #%d: %s (type %d)\n", i, name, ent->type);
485     }
486    
487     =item EIO_READDIR_DIRS_FIRST
488    
489     When this flag is specified, then the names will be returned in an order
490     where likely directories come first, in optimal C<stat> order. This is
491     useful when you need to quickly find directories, or you want to find all
492     directories while avoiding to stat() each entry.
493    
494     If the system returns type information in readdir, then this is used
495     to find directories directly. Otherwise, likely directories are names
496     beginning with ".", or otherwise names with no dots, of which names with
497     short names are tried first.
498    
499     =item EIO_READDIR_STAT_ORDER
500    
501     When this flag is specified, then the names will be returned in an order
502     suitable for stat()'ing each one. That is, when you plan to stat()
503     all files in the given directory, then the returned order will likely
504     be fastest.
505    
506     If both this flag and C<EIO_READDIR_DIRS_FIRST> are specified, then
507     the likely dirs come first, resulting in a less optimal stat order.
508    
509     =item EIO_READDIR_FOUND_UNKNOWN
510    
511     This flag should not be specified when calling C<eio_readdir>. Instead,
512     it is being set by C<eio_readdir> (you can access the C<flags> via C<<
513     req->int1 >>, when any of the C<type>'s found were C<EIO_DT_UNKNOWN>. The
514     absense of this flag therefore indicates that all C<type>'s are known,
515     which can be used to speed up some algorithms.
516    
517     A typical use case would be to identify all subdirectories within a
518     directory - you would ask C<eio_readdir> for C<EIO_READDIR_DIRS_FIRST>. If
519     then this flag is I<NOT> set, then all the entries at the beginning of the
520     returned array of type C<EIO_DT_DIR> are the directories. Otherwise, you
521     should start C<stat()>'ing the entries starting at the beginning of the
522     array, stopping as soon as you found all directories (the count can be
523     deduced by the link count of the directory).
524    
525     =back
526    
527     =back
528    
529     =head3 OS-SPECIFIC CALL WRAPPERS
530    
531     These wrap OS-specific calls (usually Linux ones), and might or might not
532     be emulated on other operating systems. Calls that are not emulated will
533     return C<-1> and set C<errno> to C<ENOSYS>.
534    
535     =over 4
536    
537     =item eio_sendfile (int out_fd, int in_fd, off_t in_offset, size_t length, int pri, eio_cb cb, void *data)
538    
539     Wraps the C<sendfile> syscall. The arguments follow the Linux version, but
540     libeio supports and will use similar calls on FreeBSD, HP/UX, Solaris and
541     Darwin.
542    
543     If the OS doesn't support some sendfile-like call, or the call fails,
544     indicating support for the given file descriptor type (for example,
545     Linux's sendfile might not support file to file copies), then libeio will
546     emulate the call in userspace, so there are almost no limitations on its
547     use.
548    
549     =item eio_readahead (int fd, off_t offset, size_t length, int pri, eio_cb cb, void *data)
550    
551     Calls C<readahead(2)>. If the syscall is missing, then the call is
552     emulated by simply reading the data (currently in 64kiB chunks).
553    
554     =item eio_sync_file_range (int fd, off_t offset, size_t nbytes, unsigned int flags, int pri, eio_cb cb, void *data)
555    
556     Calls C<sync_file_range>. If the syscall is missing, then this is the same
557     as calling C<fdatasync>.
558    
559 root 1.10 Flags can be any combination of C<EIO_SYNC_FILE_RANGE_WAIT_BEFORE>,
560     C<EIO_SYNC_FILE_RANGE_WRITE> and C<EIO_SYNC_FILE_RANGE_WAIT_AFTER>.
561    
562 root 1.7 =back
563    
564     =head3 LIBEIO-SPECIFIC REQUESTS
565    
566     These requests are specific to libeio and do not correspond to any OS call.
567    
568     =over 4
569    
570 root 1.9 =item eio_mtouch (void *addr, size_t length, int flags, int pri, eio_cb cb, void *data)
571 root 1.7
572 root 1.9 Reads (C<flags == 0>) or modifies (C<flags == EIO_MT_MODIFY) the given
573     memory area, page-wise, that is, it reads (or reads and writes back) the
574     first octet of every page that spans the memory area.
575    
576     This can be used to page in some mmapped file, or dirty some pages. Note
577     that dirtying is an unlocked read-write access, so races can ensue when
578     the some other thread modifies the data stored in that memory area.
579    
580     =item eio_custom (void (*)(eio_req *) execute, int pri, eio_cb cb, void *data)
581 root 1.7
582     Executes a custom request, i.e., a user-specified callback.
583    
584     The callback gets the C<eio_req *> as parameter and is expected to read
585     and modify any request-specific members. Specifically, it should set C<<
586     req->result >> to the result value, just like other requests.
587    
588     Here is an example that simply calls C<open>, like C<eio_open>, but it
589     uses the C<data> member as filename and uses a hardcoded C<O_RDONLY>. If
590     you want to pass more/other parameters, you either need to pass some
591     struct or so via C<data> or provide your own wrapper using the low-level
592     API.
593    
594     static int
595     my_open_done (eio_req *req)
596     {
597     int fd = req->result;
598    
599     return 0;
600     }
601    
602     static void
603     my_open (eio_req *req)
604     {
605     req->result = open (req->data, O_RDONLY);
606     }
607    
608     eio_custom (my_open, 0, my_open_done, "/etc/passwd");
609    
610 root 1.9 =item eio_busy (eio_tstamp delay, int pri, eio_cb cb, void *data)
611 root 1.7
612     This is a a request that takes C<delay> seconds to execute, but otherwise
613     does nothing - it simply puts one of the worker threads to sleep for this
614     long.
615    
616     This request can be used to artificially increase load, e.g. for debugging
617     or benchmarking reasons.
618    
619 root 1.9 =item eio_nop (int pri, eio_cb cb, void *data)
620 root 1.7
621     This request does nothing, except go through the whole request cycle. This
622     can be used to measure latency or in some cases to simplify code, but is
623     not really of much use.
624    
625     =back
626    
627     =head3 GROUPING AND LIMITING REQUESTS
628 root 1.1
629 root 1.12 There is one more rather special request, C<eio_grp>. It is a very special
630     aio request: Instead of doing something, it is a container for other eio
631     requests.
632    
633     There are two primary use cases for this: a) bundle many requests into a
634     single, composite, request with a definite callback and the ability to
635     cancel the whole request with its subrequests and b) limiting the number
636     of "active" requests.
637    
638     Further below you will find more dicussion of these topics - first follows
639     the reference section detailing the request generator and other methods.
640    
641     =over 4
642    
643     =item eio_grp (eio_cb cb, void *data)
644    
645     Creates and submits a group request.
646    
647     =back
648    
649    
650    
651 root 1.1 #TODO
652    
653 root 1.7 /*****************************************************************************/
654     /* groups */
655 root 1.1
656 root 1.7 eio_req *eio_grp (eio_cb cb, void *data);
657     void eio_grp_feed (eio_req *grp, void (*feed)(eio_req *req), int limit);
658     void eio_grp_limit (eio_req *grp, int limit);
659     void eio_grp_add (eio_req *grp, eio_req *req);
660     void eio_grp_cancel (eio_req *grp); /* cancels all sub requests but not the group */
661 root 1.1
662    
663     =back
664    
665    
666     =head1 LOW LEVEL REQUEST API
667    
668     #TODO
669    
670 root 1.7
671     =head1 ANATOMY AND LIFETIME OF AN EIO REQUEST
672    
673     A request is represented by a structure of type C<eio_req>. To initialise
674     it, clear it to all zero bytes:
675    
676     eio_req req;
677    
678     memset (&req, 0, sizeof (req));
679    
680     A more common way to initialise a new C<eio_req> is to use C<calloc>:
681    
682     eio_req *req = calloc (1, sizeof (*req));
683    
684     In either case, libeio neither allocates, initialises or frees the
685     C<eio_req> structure for you - it merely uses it.
686    
687     zero
688    
689     #TODO
690    
691 root 1.8 =head2 CONFIGURATION
692    
693     The functions in this section can sometimes be useful, but the default
694     configuration will do in most case, so you should skip this section on
695     first reading.
696    
697     =over 4
698    
699     =item eio_set_max_poll_time (eio_tstamp nseconds)
700    
701     This causes C<eio_poll ()> to return after it has detected that it was
702     running for C<nsecond> seconds or longer (this number can be fractional).
703    
704     This can be used to limit the amount of time spent handling eio requests,
705     for example, in interactive programs, you might want to limit this time to
706     C<0.01> seconds or so.
707    
708     Note that:
709    
710     a) libeio doesn't know how long your request callbacks take, so the time
711     spent in C<eio_poll> is up to one callback invocation longer then this
712     interval.
713    
714     b) this is implemented by calling C<gettimeofday> after each request,
715     which can be costly.
716    
717     c) at least one request will be handled.
718    
719     =item eio_set_max_poll_reqs (unsigned int nreqs)
720    
721     When C<nreqs> is non-zero, then C<eio_poll> will not handle more than
722     C<nreqs> requests per invocation. This is a less costly way to limit the
723     amount of work done by C<eio_poll> then setting a time limit.
724    
725     If you know your callbacks are generally fast, you could use this to
726     encourage interactiveness in your programs by setting it to C<10>, C<100>
727     or even C<1000>.
728    
729     =item eio_set_min_parallel (unsigned int nthreads)
730    
731     Make sure libeio can handle at least this many requests in parallel. It
732     might be able handle more.
733    
734     =item eio_set_max_parallel (unsigned int nthreads)
735    
736     Set the maximum number of threads that libeio will spawn.
737    
738     =item eio_set_max_idle (unsigned int nthreads)
739    
740     Libeio uses threads internally to handle most requests, and will start and stop threads on demand.
741    
742     This call can be used to limit the number of idle threads (threads without
743     work to do): libeio will keep some threads idle in preparation for more
744     requests, but never longer than C<nthreads> threads.
745    
746     In addition to this, libeio will also stop threads when they are idle for
747     a few seconds, regardless of this setting.
748    
749     =item unsigned int eio_nthreads ()
750    
751     Return the number of worker threads currently running.
752    
753     =item unsigned int eio_nreqs ()
754    
755     Return the number of requests currently handled by libeio. This is the
756     total number of requests that have been submitted to libeio, but not yet
757     destroyed.
758    
759     =item unsigned int eio_nready ()
760    
761     Returns the number of ready requests, i.e. requests that have been
762     submitted but have not yet entered the execution phase.
763    
764     =item unsigned int eio_npending ()
765    
766     Returns the number of pending requests, i.e. requests that have been
767     executed and have results, but have not been finished yet by a call to
768     C<eio_poll>).
769    
770     =back
771    
772 root 1.1 =head1 EMBEDDING
773    
774     Libeio can be embedded directly into programs. This functionality is not
775     documented and not (yet) officially supported.
776    
777 root 1.3 Note that, when including C<libeio.m4>, you are responsible for defining
778     the compilation environment (C<_LARGEFILE_SOURCE>, C<_GNU_SOURCE> etc.).
779    
780 root 1.2 If you need to know how, check the C<IO::AIO> perl module, which does
781 root 1.1 exactly that.
782    
783    
784 root 1.4 =head1 COMPILETIME CONFIGURATION
785    
786     These symbols, if used, must be defined when compiling F<eio.c>.
787    
788     =over 4
789    
790     =item EIO_STACKSIZE
791    
792     This symbol governs the stack size for each eio thread. Libeio itself
793     was written to use very little stackspace, but when using C<EIO_CUSTOM>
794     requests, you might want to increase this.
795    
796     If this symbol is undefined (the default) then libeio will use its default
797     stack size (C<sizeof (long) * 4096> currently). If it is defined, but
798     C<0>, then the default operating system stack size will be used. In all
799     other cases, the value must be an expression that evaluates to the desired
800     stack size.
801    
802     =back
803    
804    
805 root 1.1 =head1 PORTABILITY REQUIREMENTS
806    
807     In addition to a working ISO-C implementation, libeio relies on a few
808     additional extensions:
809    
810     =over 4
811    
812     =item POSIX threads
813    
814     To be portable, this module uses threads, specifically, the POSIX threads
815     library must be available (and working, which partially excludes many xBSD
816     systems, where C<fork ()> is buggy).
817    
818     =item POSIX-compatible filesystem API
819    
820     This is actually a harder portability requirement: The libeio API is quite
821     demanding regarding POSIX API calls (symlinks, user/group management
822     etc.).
823    
824     =item C<double> must hold a time value in seconds with enough accuracy
825    
826     The type C<double> is used to represent timestamps. It is required to
827     have at least 51 bits of mantissa (and 9 bits of exponent), which is good
828     enough for at least into the year 4000. This requirement is fulfilled by
829     implementations implementing IEEE 754 (basically all existing ones).
830    
831     =back
832    
833     If you know of other additional requirements drop me a note.
834    
835    
836     =head1 AUTHOR
837    
838     Marc Lehmann <libeio@schmorp.de>.
839