ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/libeio/eio.pod
(Generate patch)

Comparing libeio/eio.pod (file contents):
Revision 1.17 by root, Tue Jul 5 18:59:28 2011 UTC vs.
Revision 1.35 by root, Mon Aug 18 08:11:54 2014 UTC

25similar functions, as well as less rarely ones such as C<mknod>, C<futime> 25similar functions, as well as less rarely ones such as C<mknod>, C<futime>
26or C<readlink>. 26or C<readlink>.
27 27
28It also offers wrappers around C<sendfile> (Solaris, Linux, HP-UX and 28It also offers wrappers around C<sendfile> (Solaris, Linux, HP-UX and
29FreeBSD, with emulation on other platforms) and C<readahead> (Linux, with 29FreeBSD, with emulation on other platforms) and C<readahead> (Linux, with
30emulation elsewhere>). 30emulation elsewhere).
31 31
32The goal is to enable you to write fully non-blocking programs. For 32The goal is to enable you to write fully non-blocking programs. For
33example, in a game server, you would not want to freeze for a few seconds 33example, in a game server, you would not want to freeze for a few seconds
34just because the server is running a backup and you happen to call 34just because the server is running a backup and you happen to call
35C<readdir>. 35C<readdir>.
45Unlike the name component C<stamp> might indicate, it is also used for 45Unlike the name component C<stamp> might indicate, it is also used for
46time differences throughout libeio. 46time differences throughout libeio.
47 47
48=head2 FORK SUPPORT 48=head2 FORK SUPPORT
49 49
50Calling C<fork ()> is fully supported by this module - but you must not 50Usage of pthreads in a program changes the semantics of fork
51rely on this. It is currently implemented in these steps: 51considerably. Specifically, only async-safe functions can be called after
52fork. Libeio uses pthreads, so this applies, and makes using fork hard for
53anything but relatively fork + exec uses.
52 54
53 1. wait till all requests in "execute" state have been handled 55This library only works in the process that initialised it: Forking is
54 (basically requests that are already handed over to the kernel). 56fully supported, but using libeio in any other process than the one that
55 2. fork 57called C<eio_init> is not.
56 3. in the parent, continue business as usual, done
57 4. in the child, destroy all ready and pending requests and free the
58 memory used by the worker threads. This gives you a fully empty
59 libeio queue.
60 58
61Note, however, since libeio does use threads, the above guarantee doesn't 59You might get around by not I<using> libeio before (or after) forking in
62cover your libc, for example, malloc and other libc functions are not 60the parent, and using it in the child afterwards. You could also try to
63fork-safe, so there is very little you can do after a fork, and in fact, 61call the L<eio_init> function again in the child, which will brutally
64the above might crash, and thus change. 62reinitialise all data structures, which isn't POSIX conformant, but
63typically works.
64
65Otherwise, the only recommendation you should follow is: treat fork code
66the same way you treat signal handlers, and only ever call C<eio_init> in
67the process that uses it, and only once ever.
65 68
66=head1 INITIALISATION/INTEGRATION 69=head1 INITIALISATION/INTEGRATION
67 70
68Before you can call any eio functions you first have to initialise the 71Before you can call any eio functions you first have to initialise the
69library. The library integrates into any event loop, but can also be used 72library. The library integrates into any event loop, but can also be used
78This function initialises the library. On success it returns C<0>, on 81This function initialises the library. On success it returns C<0>, on
79failure it returns C<-1> and sets C<errno> appropriately. 82failure it returns C<-1> and sets C<errno> appropriately.
80 83
81It accepts two function pointers specifying callbacks as argument, both of 84It accepts two function pointers specifying callbacks as argument, both of
82which can be C<0>, in which case the callback isn't called. 85which can be C<0>, in which case the callback isn't called.
86
87There is currently no way to change these callbacks later, or to
88"uninitialise" the library again.
83 89
84=item want_poll callback 90=item want_poll callback
85 91
86The C<want_poll> callback is invoked whenever libeio wants attention (i.e. 92The C<want_poll> callback is invoked whenever libeio wants attention (i.e.
87it wants to be polled by calling C<eio_poll>). It is "edge-triggered", 93it wants to be polled by calling C<eio_poll>). It is "edge-triggered",
131 137
132If C<eio_poll ()> is configured to not handle all results in one go 138If C<eio_poll ()> is configured to not handle all results in one go
133(i.e. it returns C<-1>) then you should start an idle watcher that calls 139(i.e. it returns C<-1>) then you should start an idle watcher that calls
134C<eio_poll> until it returns something C<!= -1>. 140C<eio_poll> until it returns something C<!= -1>.
135 141
136A full-featured conenctor between libeio and libev would look as follows 142A full-featured connector between libeio and libev would look as follows
137(if C<eio_poll> is handling all requests, it can of course be simplified a 143(if C<eio_poll> is handling all requests, it can of course be simplified a
138lot by removing the idle watcher logic): 144lot by removing the idle watcher logic):
139 145
140 static struct ev_loop *loop; 146 static struct ev_loop *loop;
141 static ev_idle repeat_watcher; 147 static ev_idle repeat_watcher;
170 { 176 {
171 loop = EV_DEFAULT; 177 loop = EV_DEFAULT;
172 178
173 ev_idle_init (&repeat_watcher, repeat); 179 ev_idle_init (&repeat_watcher, repeat);
174 ev_async_init (&ready_watcher, ready); 180 ev_async_init (&ready_watcher, ready);
175 ev_async_start (loop &watcher); 181 ev_async_start (loop, &watcher);
176 182
177 eio_init (want_poll, 0); 183 eio_init (want_poll, 0);
178 } 184 }
179 185
180For most other event loops, you would typically use a pipe - the event 186For most other event loops, you would typically use a pipe - the event
183to read that byte, and in the callback for the read end, you would call 189to read that byte, and in the callback for the read end, you would call
184C<eio_poll>. 190C<eio_poll>.
185 191
186You don't have to take special care in the case C<eio_poll> doesn't handle 192You don't have to take special care in the case C<eio_poll> doesn't handle
187all requests, as the done callback will not be invoked, so the event loop 193all requests, as the done callback will not be invoked, so the event loop
188will still signal readyness for the pipe until I<all> results have been 194will still signal readiness for the pipe until I<all> results have been
189processed. 195processed.
190 196
191 197
192=head1 HIGH LEVEL REQUEST API 198=head1 HIGH LEVEL REQUEST API
193 199
227 233
228The C<void *data> member simply stores the value of the C<data> argument. 234The C<void *data> member simply stores the value of the C<data> argument.
229 235
230=back 236=back
231 237
238Members not explicitly described as accessible must not be
239accessed. Specifically, there is no guarantee that any members will still
240have the value they had when the request was submitted.
241
232The return value of the callback is normally C<0>, which tells libeio to 242The return value of the callback is normally C<0>, which tells libeio to
233continue normally. If a callback returns a nonzero value, libeio will 243continue normally. If a callback returns a nonzero value, libeio will
234stop processing results (in C<eio_poll>) and will return the value to its 244stop processing results (in C<eio_poll>) and will return the value to its
235caller. 245caller.
236 246
237Memory areas passed to libeio must stay valid as long as a request 247Memory areas passed to libeio wrappers must stay valid as long as a
238executes, with the exception of paths, which are being copied 248request executes, with the exception of paths, which are being copied
239internally. Any memory libeio itself allocates will be freed after the 249internally. Any memory libeio itself allocates will be freed after the
240finish callback has been called. If you want to manage all memory passed 250finish callback has been called. If you want to manage all memory passed
241to libeio yourself you can use the low-level API. 251to libeio yourself you can use the low-level API.
242 252
243For example, to open a file, you could do this: 253For example, to open a file, you could do this:
261 } 271 }
262 272
263 /* the first three arguments are passed to open(2) */ 273 /* the first three arguments are passed to open(2) */
264 /* the remaining are priority, callback and data */ 274 /* the remaining are priority, callback and data */
265 if (!eio_open ("/etc/passwd", O_RDONLY, 0, 0, file_open_done, 0)) 275 if (!eio_open ("/etc/passwd", O_RDONLY, 0, 0, file_open_done, 0))
266 abort (); /* something ent wrong, we will all die!!! */ 276 abort (); /* something went wrong, we will all die!!! */
267 277
268Note that you additionally need to call C<eio_poll> when the C<want_cb> 278Note that you additionally need to call C<eio_poll> when the C<want_cb>
269indicates that requests are ready to be processed. 279indicates that requests are ready to be processed.
270 280
271=head2 CANCELLING REQUESTS 281=head2 CANCELLING REQUESTS
272 282
273Sometimes the need for a request goes away before the request is 283Sometimes the need for a request goes away before the request is
274finished. In that case, one can cancel the reqiest by a call to 284finished. In that case, one can cancel the request by a call to
275C<eio_cancel>: 285C<eio_cancel>:
276 286
277=over 4 287=over 4
278 288
279=item eio_cancel (eio_req *req) 289=item eio_cancel (eio_req *req)
280 290
281Cancel the request. If the request is currently executing it might still 291Cancel the request (and all its subrequests). If the request is currently
282continue to execute, and in other cases it might still take a while till 292executing it might still continue to execute, and in other cases it might
283the request is cancelled. 293still take a while till the request is cancelled.
284 294
285Even if cancelled, the finish callback will still be invoked - the 295When cancelled, the finish callback will not be invoked.
286callbacks of all cancellable requests need to check whether the request
287has been cancelled by calling C<EIO_CANCELLED (req)>:
288 296
289 static int 297C<EIO_CANCELLED> is still true for requests that have successfully
290 my_eio_cb (eio_req *req) 298executed, as long as C<eio_cancel> was called on them at some point.
291 {
292 if (EIO_CANCELLED (req))
293 return 0;
294 }
295
296In addition, cancelled requests will either have C<< req->result >> set to
297C<-1> and C<errno> to C<ECANCELED>, or otherwise they were successfully
298executed despite being cancelled (e.g. when they have already been
299executed at the time they were cancelled).
300 299
301=back 300=back
302 301
303=head2 AVAILABLE REQUESTS 302=head2 AVAILABLE REQUESTS
304 303
402 free (target); 401 free (target);
403 } 402 }
404 403
405=item eio_realpath (const char *path, int pri, eio_cb cb, void *data) 404=item eio_realpath (const char *path, int pri, eio_cb cb, void *data)
406 405
407Similar to the realpath libc function, but unlike that one, result is 406Similar to the realpath libc function, but unlike that one, C<<
408C<-1> on failure and the length of the returned path in C<ptr2> (which is 407req->result >> is C<-1> on failure. On success, the result is the length
409not 0-terminated) - this is similar to readlink. 408of the returned path in C<ptr2> (which is I<NOT> 0-terminated) - this is
409similar to readlink.
410 410
411=item eio_stat (const char *path, int pri, eio_cb cb, void *data) 411=item eio_stat (const char *path, int pri, eio_cb cb, void *data)
412 412
413=item eio_lstat (const char *path, int pri, eio_cb cb, void *data) 413=item eio_lstat (const char *path, int pri, eio_cb cb, void *data)
414 414
431=back 431=back
432 432
433=head3 READING DIRECTORIES 433=head3 READING DIRECTORIES
434 434
435Reading directories sounds simple, but can be rather demanding, especially 435Reading directories sounds simple, but can be rather demanding, especially
436if you want to do stuff such as traversing a diretcory hierarchy or 436if you want to do stuff such as traversing a directory hierarchy or
437processing all files in a directory. Libeio can assist thess complex tasks 437processing all files in a directory. Libeio can assist these complex tasks
438with it's C<eio_readdir> call. 438with it's C<eio_readdir> call.
439 439
440=over 4 440=over 4
441 441
442=item eio_readdir (const char *path, int flags, int pri, eio_cb cb, void *data) 442=item eio_readdir (const char *path, int flags, int pri, eio_cb cb, void *data)
534When this flag is specified, then the names will be returned in an order 534When this flag is specified, then the names will be returned in an order
535suitable for stat()'ing each one. That is, when you plan to stat() 535suitable for stat()'ing each one. That is, when you plan to stat()
536all files in the given directory, then the returned order will likely 536all files in the given directory, then the returned order will likely
537be fastest. 537be fastest.
538 538
539If both this flag and C<EIO_READDIR_DIRS_FIRST> are specified, then 539If both this flag and C<EIO_READDIR_DIRS_FIRST> are specified, then the
540the likely dirs come first, resulting in a less optimal stat order. 540likely directories come first, resulting in a less optimal stat order.
541 541
542=item EIO_READDIR_FOUND_UNKNOWN 542=item EIO_READDIR_FOUND_UNKNOWN
543 543
544This flag should not be specified when calling C<eio_readdir>. Instead, 544This flag should not be specified when calling C<eio_readdir>. Instead,
545it is being set by C<eio_readdir> (you can access the C<flags> via C<< 545it is being set by C<eio_readdir> (you can access the C<flags> via C<<
546req->int1 >>, when any of the C<type>'s found were C<EIO_DT_UNKNOWN>. The 546req->int1 >>, when any of the C<type>'s found were C<EIO_DT_UNKNOWN>. The
547absense of this flag therefore indicates that all C<type>'s are known, 547absence of this flag therefore indicates that all C<type>'s are known,
548which can be used to speed up some algorithms. 548which can be used to speed up some algorithms.
549 549
550A typical use case would be to identify all subdirectories within a 550A typical use case would be to identify all subdirectories within a
551directory - you would ask C<eio_readdir> for C<EIO_READDIR_DIRS_FIRST>. If 551directory - you would ask C<eio_readdir> for C<EIO_READDIR_DIRS_FIRST>. If
552then this flag is I<NOT> set, then all the entries at the beginning of the 552then this flag is I<NOT> set, then all the entries at the beginning of the
582=item eio_readahead (int fd, off_t offset, size_t length, int pri, eio_cb cb, void *data) 582=item eio_readahead (int fd, off_t offset, size_t length, int pri, eio_cb cb, void *data)
583 583
584Calls C<readahead(2)>. If the syscall is missing, then the call is 584Calls C<readahead(2)>. If the syscall is missing, then the call is
585emulated by simply reading the data (currently in 64kiB chunks). 585emulated by simply reading the data (currently in 64kiB chunks).
586 586
587=item eio_syncfs (int fd, int pri, eio_cb cb, void *data)
588
589Calls Linux' C<syncfs> syscall, if available. Returns C<-1> and sets
590C<errno> to C<ENOSYS> if the call is missing I<but still calls sync()>,
591if the C<fd> is C<< >= 0 >>, so you can probe for the availability of the
592syscall with a negative C<fd> argument and checking for C<-1/ENOSYS>.
593
587=item eio_sync_file_range (int fd, off_t offset, size_t nbytes, unsigned int flags, int pri, eio_cb cb, void *data) 594=item eio_sync_file_range (int fd, off_t offset, size_t nbytes, unsigned int flags, int pri, eio_cb cb, void *data)
588 595
589Calls C<sync_file_range>. If the syscall is missing, then this is the same 596Calls C<sync_file_range>. If the syscall is missing, then this is the same
590as calling C<fdatasync>. 597as calling C<fdatasync>.
591 598
592Flags can be any combination of C<EIO_SYNC_FILE_RANGE_WAIT_BEFORE>, 599Flags can be any combination of C<EIO_SYNC_FILE_RANGE_WAIT_BEFORE>,
593C<EIO_SYNC_FILE_RANGE_WRITE> and C<EIO_SYNC_FILE_RANGE_WAIT_AFTER>. 600C<EIO_SYNC_FILE_RANGE_WRITE> and C<EIO_SYNC_FILE_RANGE_WAIT_AFTER>.
594 601
602=item eio_fallocate (int fd, int mode, off_t offset, off_t len, int pri, eio_cb cb, void *data)
603
604Calls C<fallocate> (note: I<NOT> C<posix_fallocate>!). If the syscall is
605missing, then it returns failure and sets C<errno> to C<ENOSYS>.
606
607The C<mode> argument can be C<0> (for behaviour similar to
608C<posix_fallocate>), or C<EIO_FALLOC_FL_KEEP_SIZE>, which keeps the size
609of the file unchanged (but still preallocates space beyond end of file).
610
595=back 611=back
596 612
597=head3 LIBEIO-SPECIFIC REQUESTS 613=head3 LIBEIO-SPECIFIC REQUESTS
598 614
599These requests are specific to libeio and do not correspond to any OS call. 615These requests are specific to libeio and do not correspond to any OS call.
600 616
601=over 4 617=over 4
602 618
603=item eio_mtouch (void *addr, size_t length, int flags, int pri, eio_cb cb, void *data) 619=item eio_mtouch (void *addr, size_t length, int flags, int pri, eio_cb cb, void *data)
604 620
605Reads (C<flags == 0>) or modifies (C<flags == EIO_MT_MODIFY) the given 621Reads (C<flags == 0>) or modifies (C<flags == EIO_MT_MODIFY>) the given
606memory area, page-wise, that is, it reads (or reads and writes back) the 622memory area, page-wise, that is, it reads (or reads and writes back) the
607first octet of every page that spans the memory area. 623first octet of every page that spans the memory area.
608 624
609This can be used to page in some mmapped file, or dirty some pages. Note 625This can be used to page in some mmapped file, or dirty some pages. Note
610that dirtying is an unlocked read-write access, so races can ensue when 626that dirtying is an unlocked read-write access, so races can ensue when
640 656
641 eio_custom (my_open, 0, my_open_done, "/etc/passwd"); 657 eio_custom (my_open, 0, my_open_done, "/etc/passwd");
642 658
643=item eio_busy (eio_tstamp delay, int pri, eio_cb cb, void *data) 659=item eio_busy (eio_tstamp delay, int pri, eio_cb cb, void *data)
644 660
645This is a a request that takes C<delay> seconds to execute, but otherwise 661This is a request that takes C<delay> seconds to execute, but otherwise
646does nothing - it simply puts one of the worker threads to sleep for this 662does nothing - it simply puts one of the worker threads to sleep for this
647long. 663long.
648 664
649This request can be used to artificially increase load, e.g. for debugging 665This request can be used to artificially increase load, e.g. for debugging
650or benchmarking reasons. 666or benchmarking reasons.
666There are two primary use cases for this: a) bundle many requests into a 682There are two primary use cases for this: a) bundle many requests into a
667single, composite, request with a definite callback and the ability to 683single, composite, request with a definite callback and the ability to
668cancel the whole request with its subrequests and b) limiting the number 684cancel the whole request with its subrequests and b) limiting the number
669of "active" requests. 685of "active" requests.
670 686
671Further below you will find more dicussion of these topics - first follows 687Further below you will find more discussion of these topics - first
672the reference section detailing the request generator and other methods. 688follows the reference section detailing the request generator and other
689methods.
673 690
674=over 4 691=over 4
675 692
676=item eio_req *grp = eio_grp (eio_cb cb, void *data) 693=item eio_req *grp = eio_grp (eio_cb cb, void *data)
677 694
678Creates, submits and returns a group request. 695Creates, submits and returns a group request. Note that it doesn't have a
696priority, unlike all other requests.
679 697
680=item eio_grp_add (eio_req *grp, eio_req *req) 698=item eio_grp_add (eio_req *grp, eio_req *req)
681 699
682Adds a request to the request group. 700Adds a request to the request group.
683 701
684=item eio_grp_cancel (eio_req *grp) 702=item eio_grp_cancel (eio_req *grp)
685 703
686Cancels all requests I<in> the group, but I<not> the group request 704Cancels all requests I<in> the group, but I<not> the group request
687itself. You can cancel the group request via a normal C<eio_cancel> call. 705itself. You can cancel the group request I<and> all subrequests via a
706normal C<eio_cancel> call.
688 707
689
690
691=back 708=back
692 709
710=head4 GROUP REQUEST LIFETIME
711
712Left alone, a group request will instantly move to the pending state and
713will be finished at the next call of C<eio_poll>.
714
715The usefulness stems from the fact that, if a subrequest is added to a
716group I<before> a call to C<eio_poll>, via C<eio_grp_add>, then the group
717will not finish until all the subrequests have finished.
718
719So the usage cycle of a group request is like this: after it is created,
720you normally instantly add a subrequest. If none is added, the group
721request will finish on it's own. As long as subrequests are added before
722the group request is finished it will be kept from finishing, that is the
723callbacks of any subrequests can, in turn, add more requests to the group,
724and as long as any requests are active, the group request itself will not
725finish.
726
727=head4 CREATING COMPOSITE REQUESTS
728
729Imagine you wanted to create an C<eio_load> request that opens a file,
730reads it and closes it. This means it has to execute at least three eio
731requests, but for various reasons it might be nice if that request looked
732like any other eio request.
733
734This can be done with groups:
735
736=over 4
737
738=item 1) create the request object
739
740Create a group that contains all further requests. This is the request you
741can return as "the load request".
742
743=item 2) open the file, maybe
744
745Next, open the file with C<eio_open> and add the request to the group
746request and you are finished setting up the request.
747
748If, for some reason, you cannot C<eio_open> (path is a null ptr?) you
749can set C<< grp->result >> to C<-1> to signal an error and let the group
750request finish on its own.
751
752=item 3) open callback adds more requests
753
754In the open callback, if the open was not successful, copy C<<
755req->errorno >> to C<< grp->errorno >> and set C<< grp->result >> to
756C<-1> to signal an error.
757
758Otherwise, malloc some memory or so and issue a read request, adding the
759read request to the group.
760
761=item 4) continue issuing requests till finished
762
763In the read callback, check for errors and possibly continue with
764C<eio_close> or any other eio request in the same way.
765
766As soon as no new requests are added, the group request will finish. Make
767sure you I<always> set C<< grp->result >> to some sensible value.
768
769=back
770
771=head4 REQUEST LIMITING
693 772
694 773
695#TODO 774#TODO
696 775
697/*****************************************************************************/
698/* groups */
699
700eio_req *eio_grp (eio_cb cb, void *data);
701void eio_grp_feed (eio_req *grp, void (*feed)(eio_req *req), int limit);
702void eio_grp_limit (eio_req *grp, int limit); 776void eio_grp_limit (eio_req *grp, int limit);
703void eio_grp_cancel (eio_req *grp); /* cancels all sub requests but not the group */
704 777
705
706=back
707 778
708 779
709=head1 LOW LEVEL REQUEST API 780=head1 LOW LEVEL REQUEST API
710 781
711#TODO 782#TODO
748for example, in interactive programs, you might want to limit this time to 819for example, in interactive programs, you might want to limit this time to
749C<0.01> seconds or so. 820C<0.01> seconds or so.
750 821
751Note that: 822Note that:
752 823
824=over 4
825
753a) libeio doesn't know how long your request callbacks take, so the time 826=item a) libeio doesn't know how long your request callbacks take, so the
754spent in C<eio_poll> is up to one callback invocation longer then this 827time spent in C<eio_poll> is up to one callback invocation longer then
755interval. 828this interval.
756 829
757b) this is implemented by calling C<gettimeofday> after each request, 830=item b) this is implemented by calling C<gettimeofday> after each
758which can be costly. 831request, which can be costly.
759 832
760c) at least one request will be handled. 833=item c) at least one request will be handled.
834
835=back
761 836
762=item eio_set_max_poll_reqs (unsigned int nreqs) 837=item eio_set_max_poll_reqs (unsigned int nreqs)
763 838
764When C<nreqs> is non-zero, then C<eio_poll> will not handle more than 839When C<nreqs> is non-zero, then C<eio_poll> will not handle more than
765C<nreqs> requests per invocation. This is a less costly way to limit the 840C<nreqs> requests per invocation. This is a less costly way to limit the
835This symbol governs the stack size for each eio thread. Libeio itself 910This symbol governs the stack size for each eio thread. Libeio itself
836was written to use very little stackspace, but when using C<EIO_CUSTOM> 911was written to use very little stackspace, but when using C<EIO_CUSTOM>
837requests, you might want to increase this. 912requests, you might want to increase this.
838 913
839If this symbol is undefined (the default) then libeio will use its default 914If this symbol is undefined (the default) then libeio will use its default
840stack size (C<sizeof (long) * 4096> currently). If it is defined, but 915stack size (C<sizeof (void *) * 4096> currently). In all other cases, the
841C<0>, then the default operating system stack size will be used. In all
842other cases, the value must be an expression that evaluates to the desired 916value must be an expression that evaluates to the desired stack size.
843stack size.
844 917
845=back 918=back
846 919
847 920
848=head1 PORTABILITY REQUIREMENTS 921=head1 PORTABILITY REQUIREMENTS

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines