ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/libeio/eio.pod
(Generate patch)

Comparing libeio/eio.pod (file contents):
Revision 1.19 by root, Tue Jul 5 20:38:47 2011 UTC vs.
Revision 1.36 by root, Sun Jan 24 16:36:20 2016 UTC

25similar functions, as well as less rarely ones such as C<mknod>, C<futime> 25similar functions, as well as less rarely ones such as C<mknod>, C<futime>
26or C<readlink>. 26or C<readlink>.
27 27
28It also offers wrappers around C<sendfile> (Solaris, Linux, HP-UX and 28It also offers wrappers around C<sendfile> (Solaris, Linux, HP-UX and
29FreeBSD, with emulation on other platforms) and C<readahead> (Linux, with 29FreeBSD, with emulation on other platforms) and C<readahead> (Linux, with
30emulation elsewhere>). 30emulation elsewhere).
31 31
32The goal is to enable you to write fully non-blocking programs. For 32The goal is to enable you to write fully non-blocking programs. For
33example, in a game server, you would not want to freeze for a few seconds 33example, in a game server, you would not want to freeze for a few seconds
34just because the server is running a backup and you happen to call 34just because the server is running a backup and you happen to call
35C<readdir>. 35C<readdir>.
45Unlike the name component C<stamp> might indicate, it is also used for 45Unlike the name component C<stamp> might indicate, it is also used for
46time differences throughout libeio. 46time differences throughout libeio.
47 47
48=head2 FORK SUPPORT 48=head2 FORK SUPPORT
49 49
50Calling C<fork ()> is fully supported by this module - but you must not 50Usage of pthreads in a program changes the semantics of fork
51rely on this. It is currently implemented in these steps: 51considerably. Specifically, only async-safe functions can be called after
52fork. Libeio uses pthreads, so this applies, and makes using fork hard for
53anything but relatively fork + exec uses.
52 54
53 1. wait till all requests in "execute" state have been handled 55This library only works in the process that initialised it: Forking is
54 (basically requests that are already handed over to the kernel). 56fully supported, but using libeio in any other process than the one that
55 2. fork 57called C<eio_init> is not.
56 3. in the parent, continue business as usual, done
57 4. in the child, destroy all ready and pending requests and free the
58 memory used by the worker threads. This gives you a fully empty
59 libeio queue.
60 58
61Note, however, since libeio does use threads, the above guarantee doesn't 59You might get around by not I<using> libeio before (or after) forking in
62cover your libc, for example, malloc and other libc functions are not 60the parent, and using it in the child afterwards. You could also try to
63fork-safe, so there is very little you can do after a fork, and in fact, 61call the L<eio_init> function again in the child, which will brutally
64the above might crash, and thus change. 62reinitialise all data structures, which isn't POSIX conformant, but
63typically works.
64
65Otherwise, the only recommendation you should follow is: treat fork code
66the same way you treat signal handlers, and only ever call C<eio_init> in
67the process that uses it, and only once ever.
65 68
66=head1 INITIALISATION/INTEGRATION 69=head1 INITIALISATION/INTEGRATION
67 70
68Before you can call any eio functions you first have to initialise the 71Before you can call any eio functions you first have to initialise the
69library. The library integrates into any event loop, but can also be used 72library. The library integrates into any event loop, but can also be used
78This function initialises the library. On success it returns C<0>, on 81This function initialises the library. On success it returns C<0>, on
79failure it returns C<-1> and sets C<errno> appropriately. 82failure it returns C<-1> and sets C<errno> appropriately.
80 83
81It accepts two function pointers specifying callbacks as argument, both of 84It accepts two function pointers specifying callbacks as argument, both of
82which can be C<0>, in which case the callback isn't called. 85which can be C<0>, in which case the callback isn't called.
86
87There is currently no way to change these callbacks later, or to
88"uninitialise" the library again.
83 89
84=item want_poll callback 90=item want_poll callback
85 91
86The C<want_poll> callback is invoked whenever libeio wants attention (i.e. 92The C<want_poll> callback is invoked whenever libeio wants attention (i.e.
87it wants to be polled by calling C<eio_poll>). It is "edge-triggered", 93it wants to be polled by calling C<eio_poll>). It is "edge-triggered",
98This callback is invoked when libeio detects that all pending requests 104This callback is invoked when libeio detects that all pending requests
99have been handled. It is "edge-triggered", that is, it will only be 105have been handled. It is "edge-triggered", that is, it will only be
100called once after C<want_poll>. To put it differently, C<want_poll> and 106called once after C<want_poll>. To put it differently, C<want_poll> and
101C<done_poll> are invoked in pairs: after C<want_poll> you have to call 107C<done_poll> are invoked in pairs: after C<want_poll> you have to call
102C<eio_poll ()> until either C<eio_poll> indicates that everything has been 108C<eio_poll ()> until either C<eio_poll> indicates that everything has been
103handled or C<done_poll> has been called, which signals the same. 109handled or C<done_poll> has been called, which signals the same - only one
110method is needed.
104 111
105Note that C<eio_poll> might return after C<done_poll> and C<want_poll> 112Note that C<eio_poll> might return after C<done_poll> and C<want_poll>
106have been called again, so watch out for races in your code. 113have been called again, so watch out for races in your code.
107 114
115It is quite common to have an empty C<done_call> callback and only use
116the return value from C<eio_poll>, or, when C<eio_poll> is configured to
117handle all outstanding replies, it's enough to call C<eio_poll> once.
118
108As with C<want_poll>, this callback is called while locks are being held, 119As with C<want_poll>, this callback is called while locks are being held,
109so you I<must not call any libeio functions form within this callback>. 120so you I<must not call any libeio functions from within this callback>.
110 121
111=item int eio_poll () 122=item int eio_poll ()
112 123
113This function has to be called whenever there are pending requests that 124This function has to be called whenever there are pending requests that
114need finishing. You usually call this after C<want_poll> has indicated 125need finishing. You usually call this after C<want_poll> has indicated
131 142
132If C<eio_poll ()> is configured to not handle all results in one go 143If C<eio_poll ()> is configured to not handle all results in one go
133(i.e. it returns C<-1>) then you should start an idle watcher that calls 144(i.e. it returns C<-1>) then you should start an idle watcher that calls
134C<eio_poll> until it returns something C<!= -1>. 145C<eio_poll> until it returns something C<!= -1>.
135 146
136A full-featured conenctor between libeio and libev would look as follows 147A full-featured connector between libeio and libev would look as follows
137(if C<eio_poll> is handling all requests, it can of course be simplified a 148(if C<eio_poll> is handling all requests, it can of course be simplified a
138lot by removing the idle watcher logic): 149lot by removing the idle watcher logic):
139 150
140 static struct ev_loop *loop; 151 static struct ev_loop *loop;
141 static ev_idle repeat_watcher; 152 static ev_idle repeat_watcher;
170 { 181 {
171 loop = EV_DEFAULT; 182 loop = EV_DEFAULT;
172 183
173 ev_idle_init (&repeat_watcher, repeat); 184 ev_idle_init (&repeat_watcher, repeat);
174 ev_async_init (&ready_watcher, ready); 185 ev_async_init (&ready_watcher, ready);
175 ev_async_start (loop &watcher); 186 ev_async_start (loop, &watcher);
176 187
177 eio_init (want_poll, 0); 188 eio_init (want_poll, 0);
178 } 189 }
179 190
180For most other event loops, you would typically use a pipe - the event 191For most other event loops, you would typically use a pipe - the event
227 238
228The C<void *data> member simply stores the value of the C<data> argument. 239The C<void *data> member simply stores the value of the C<data> argument.
229 240
230=back 241=back
231 242
243Members not explicitly described as accessible must not be
244accessed. Specifically, there is no guarantee that any members will still
245have the value they had when the request was submitted.
246
232The return value of the callback is normally C<0>, which tells libeio to 247The return value of the callback is normally C<0>, which tells libeio to
233continue normally. If a callback returns a nonzero value, libeio will 248continue normally. If a callback returns a nonzero value, libeio will
234stop processing results (in C<eio_poll>) and will return the value to its 249stop processing results (in C<eio_poll>) and will return the value to its
235caller. 250caller.
236 251
237Memory areas passed to libeio must stay valid as long as a request 252Memory areas passed to libeio wrappers must stay valid as long as a
238executes, with the exception of paths, which are being copied 253request executes, with the exception of paths, which are being copied
239internally. Any memory libeio itself allocates will be freed after the 254internally. Any memory libeio itself allocates will be freed after the
240finish callback has been called. If you want to manage all memory passed 255finish callback has been called. If you want to manage all memory passed
241to libeio yourself you can use the low-level API. 256to libeio yourself you can use the low-level API.
242 257
243For example, to open a file, you could do this: 258For example, to open a file, you could do this:
280 295
281Cancel the request (and all its subrequests). If the request is currently 296Cancel the request (and all its subrequests). If the request is currently
282executing it might still continue to execute, and in other cases it might 297executing it might still continue to execute, and in other cases it might
283still take a while till the request is cancelled. 298still take a while till the request is cancelled.
284 299
285Even if cancelled, the finish callback will still be invoked - the 300When cancelled, the finish callback will not be invoked.
286callbacks of all cancellable requests need to check whether the request
287has been cancelled by calling C<EIO_CANCELLED (req)>:
288
289 static int
290 my_eio_cb (eio_req *req)
291 {
292 if (EIO_CANCELLED (req))
293 return 0;
294 }
295
296In addition, cancelled requests will I<either> have C<< req->result >>
297set to C<-1> and C<errno> to C<ECANCELED>, or I<otherwise> they were
298successfully executed, despite being cancelled (e.g. when they have
299already been executed at the time they were cancelled).
300 301
301C<EIO_CANCELLED> is still true for requests that have successfully 302C<EIO_CANCELLED> is still true for requests that have successfully
302executed, as long as C<eio_cancel> was called on them at some point. 303executed, as long as C<eio_cancel> was called on them at some point.
303 304
304=back 305=back
405 free (target); 406 free (target);
406 } 407 }
407 408
408=item eio_realpath (const char *path, int pri, eio_cb cb, void *data) 409=item eio_realpath (const char *path, int pri, eio_cb cb, void *data)
409 410
410Similar to the realpath libc function, but unlike that one, result is 411Similar to the realpath libc function, but unlike that one, C<<
411C<-1> on failure and the length of the returned path in C<ptr2> (which is 412req->result >> is C<-1> on failure. On success, the result is the length
412not 0-terminated) - this is similar to readlink. 413of the returned path in C<ptr2> (which is I<NOT> 0-terminated) - this is
414similar to readlink.
413 415
414=item eio_stat (const char *path, int pri, eio_cb cb, void *data) 416=item eio_stat (const char *path, int pri, eio_cb cb, void *data)
415 417
416=item eio_lstat (const char *path, int pri, eio_cb cb, void *data) 418=item eio_lstat (const char *path, int pri, eio_cb cb, void *data)
417 419
585=item eio_readahead (int fd, off_t offset, size_t length, int pri, eio_cb cb, void *data) 587=item eio_readahead (int fd, off_t offset, size_t length, int pri, eio_cb cb, void *data)
586 588
587Calls C<readahead(2)>. If the syscall is missing, then the call is 589Calls C<readahead(2)>. If the syscall is missing, then the call is
588emulated by simply reading the data (currently in 64kiB chunks). 590emulated by simply reading the data (currently in 64kiB chunks).
589 591
592=item eio_syncfs (int fd, int pri, eio_cb cb, void *data)
593
594Calls Linux' C<syncfs> syscall, if available. Returns C<-1> and sets
595C<errno> to C<ENOSYS> if the call is missing I<but still calls sync()>,
596if the C<fd> is C<< >= 0 >>, so you can probe for the availability of the
597syscall with a negative C<fd> argument and checking for C<-1/ENOSYS>.
598
590=item eio_sync_file_range (int fd, off_t offset, size_t nbytes, unsigned int flags, int pri, eio_cb cb, void *data) 599=item eio_sync_file_range (int fd, off_t offset, size_t nbytes, unsigned int flags, int pri, eio_cb cb, void *data)
591 600
592Calls C<sync_file_range>. If the syscall is missing, then this is the same 601Calls C<sync_file_range>. If the syscall is missing, then this is the same
593as calling C<fdatasync>. 602as calling C<fdatasync>.
594 603
595Flags can be any combination of C<EIO_SYNC_FILE_RANGE_WAIT_BEFORE>, 604Flags can be any combination of C<EIO_SYNC_FILE_RANGE_WAIT_BEFORE>,
596C<EIO_SYNC_FILE_RANGE_WRITE> and C<EIO_SYNC_FILE_RANGE_WAIT_AFTER>. 605C<EIO_SYNC_FILE_RANGE_WRITE> and C<EIO_SYNC_FILE_RANGE_WAIT_AFTER>.
597 606
607=item eio_fallocate (int fd, int mode, off_t offset, off_t len, int pri, eio_cb cb, void *data)
608
609Calls C<fallocate> (note: I<NOT> C<posix_fallocate>!). If the syscall is
610missing, then it returns failure and sets C<errno> to C<ENOSYS>.
611
612The C<mode> argument can be C<0> (for behaviour similar to
613C<posix_fallocate>), or C<EIO_FALLOC_FL_KEEP_SIZE>, which keeps the size
614of the file unchanged (but still preallocates space beyond end of file).
615
598=back 616=back
599 617
600=head3 LIBEIO-SPECIFIC REQUESTS 618=head3 LIBEIO-SPECIFIC REQUESTS
601 619
602These requests are specific to libeio and do not correspond to any OS call. 620These requests are specific to libeio and do not correspond to any OS call.
603 621
604=over 4 622=over 4
605 623
606=item eio_mtouch (void *addr, size_t length, int flags, int pri, eio_cb cb, void *data) 624=item eio_mtouch (void *addr, size_t length, int flags, int pri, eio_cb cb, void *data)
607 625
608Reads (C<flags == 0>) or modifies (C<flags == EIO_MT_MODIFY) the given 626Reads (C<flags == 0>) or modifies (C<flags == EIO_MT_MODIFY>) the given
609memory area, page-wise, that is, it reads (or reads and writes back) the 627memory area, page-wise, that is, it reads (or reads and writes back) the
610first octet of every page that spans the memory area. 628first octet of every page that spans the memory area.
611 629
612This can be used to page in some mmapped file, or dirty some pages. Note 630This can be used to page in some mmapped file, or dirty some pages. Note
613that dirtying is an unlocked read-write access, so races can ensue when 631that dirtying is an unlocked read-write access, so races can ensue when
677 695
678=over 4 696=over 4
679 697
680=item eio_req *grp = eio_grp (eio_cb cb, void *data) 698=item eio_req *grp = eio_grp (eio_cb cb, void *data)
681 699
682Creates, submits and returns a group request. 700Creates, submits and returns a group request. Note that it doesn't have a
701priority, unlike all other requests.
683 702
684=item eio_grp_add (eio_req *grp, eio_req *req) 703=item eio_grp_add (eio_req *grp, eio_req *req)
685 704
686Adds a request to the request group. 705Adds a request to the request group.
687 706
688=item eio_grp_cancel (eio_req *grp) 707=item eio_grp_cancel (eio_req *grp)
689 708
690Cancels all requests I<in> the group, but I<not> the group request 709Cancels all requests I<in> the group, but I<not> the group request
691itself. You can cancel the group request via a normal C<eio_cancel> call. 710itself. You can cancel the group request I<and> all subrequests via a
711normal C<eio_cancel> call.
692 712
693
694
695=back 713=back
696 714
715=head4 GROUP REQUEST LIFETIME
716
717Left alone, a group request will instantly move to the pending state and
718will be finished at the next call of C<eio_poll>.
719
720The usefulness stems from the fact that, if a subrequest is added to a
721group I<before> a call to C<eio_poll>, via C<eio_grp_add>, then the group
722will not finish until all the subrequests have finished.
723
724So the usage cycle of a group request is like this: after it is created,
725you normally instantly add a subrequest. If none is added, the group
726request will finish on it's own. As long as subrequests are added before
727the group request is finished it will be kept from finishing, that is the
728callbacks of any subrequests can, in turn, add more requests to the group,
729and as long as any requests are active, the group request itself will not
730finish.
731
732=head4 CREATING COMPOSITE REQUESTS
733
734Imagine you wanted to create an C<eio_load> request that opens a file,
735reads it and closes it. This means it has to execute at least three eio
736requests, but for various reasons it might be nice if that request looked
737like any other eio request.
738
739This can be done with groups:
740
741=over 4
742
743=item 1) create the request object
744
745Create a group that contains all further requests. This is the request you
746can return as "the load request".
747
748=item 2) open the file, maybe
749
750Next, open the file with C<eio_open> and add the request to the group
751request and you are finished setting up the request.
752
753If, for some reason, you cannot C<eio_open> (path is a null ptr?) you
754can set C<< grp->result >> to C<-1> to signal an error and let the group
755request finish on its own.
756
757=item 3) open callback adds more requests
758
759In the open callback, if the open was not successful, copy C<<
760req->errorno >> to C<< grp->errorno >> and set C<< grp->result >> to
761C<-1> to signal an error.
762
763Otherwise, malloc some memory or so and issue a read request, adding the
764read request to the group.
765
766=item 4) continue issuing requests till finished
767
768In the read callback, check for errors and possibly continue with
769C<eio_close> or any other eio request in the same way.
770
771As soon as no new requests are added, the group request will finish. Make
772sure you I<always> set C<< grp->result >> to some sensible value.
773
774=back
775
776=head4 REQUEST LIMITING
697 777
698 778
699#TODO 779#TODO
700 780
701/*****************************************************************************/
702/* groups */
703
704eio_req *eio_grp (eio_cb cb, void *data);
705void eio_grp_feed (eio_req *grp, void (*feed)(eio_req *req), int limit);
706void eio_grp_limit (eio_req *grp, int limit); 781void eio_grp_limit (eio_req *grp, int limit);
707void eio_grp_cancel (eio_req *grp); /* cancels all sub requests but not the group */
708 782
709
710=back
711 783
712 784
713=head1 LOW LEVEL REQUEST API 785=head1 LOW LEVEL REQUEST API
714 786
715#TODO 787#TODO
843This symbol governs the stack size for each eio thread. Libeio itself 915This symbol governs the stack size for each eio thread. Libeio itself
844was written to use very little stackspace, but when using C<EIO_CUSTOM> 916was written to use very little stackspace, but when using C<EIO_CUSTOM>
845requests, you might want to increase this. 917requests, you might want to increase this.
846 918
847If this symbol is undefined (the default) then libeio will use its default 919If this symbol is undefined (the default) then libeio will use its default
848stack size (C<sizeof (long) * 4096> currently). If it is defined, but 920stack size (C<sizeof (void *) * 4096> currently). In all other cases, the
849C<0>, then the default operating system stack size will be used. In all
850other cases, the value must be an expression that evaluates to the desired 921value must be an expression that evaluates to the desired stack size.
851stack size.
852 922
853=back 923=back
854 924
855 925
856=head1 PORTABILITY REQUIREMENTS 926=head1 PORTABILITY REQUIREMENTS

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines