ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/libeio/eio.pod
(Generate patch)

Comparing libeio/eio.pod (file contents):
Revision 1.15 by root, Tue Jul 5 16:57:41 2011 UTC vs.
Revision 1.24 by sf-exg, Thu Jul 14 08:51:29 2011 UTC

45Unlike the name component C<stamp> might indicate, it is also used for 45Unlike the name component C<stamp> might indicate, it is also used for
46time differences throughout libeio. 46time differences throughout libeio.
47 47
48=head2 FORK SUPPORT 48=head2 FORK SUPPORT
49 49
50Calling C<fork ()> is fully supported by this module. It is implemented in these steps: 50Calling C<fork ()> is fully supported by this module - but you must not
51rely on this. It is currently implemented in these steps:
51 52
52 1. wait till all requests in "execute" state have been handled 53 1. wait till all requests in "execute" state have been handled
53 (basically requests that are already handed over to the kernel). 54 (basically requests that are already handed over to the kernel).
54 2. fork 55 2. fork
55 3. in the parent, continue business as usual, done 56 3. in the parent, continue business as usual, done
56 4. in the child, destroy all ready and pending requests and free the 57 4. in the child, destroy all ready and pending requests and free the
57 memory used by the worker threads. This gives you a fully empty 58 memory used by the worker threads. This gives you a fully empty
58 libeio queue. 59 libeio queue.
59 60
60Note, however, since libeio does use threads, thr above guarantee doesn't 61Note, however, since libeio does use threads, the above guarantee doesn't
61cover your libc, for example, malloc and other libc functions are not 62cover your libc, for example, malloc and other libc functions are not
62fork-safe, so there is very little you can do after a fork, and in fatc, 63fork-safe, so there is very little you can do after a fork, and in fact,
63the above might crash, and thus change. 64the above might crash, and thus change.
64 65
65=head1 INITIALISATION/INTEGRATION 66=head1 INITIALISATION/INTEGRATION
66 67
67Before you can call any eio functions you first have to initialise the 68Before you can call any eio functions you first have to initialise the
130 131
131If C<eio_poll ()> is configured to not handle all results in one go 132If C<eio_poll ()> is configured to not handle all results in one go
132(i.e. it returns C<-1>) then you should start an idle watcher that calls 133(i.e. it returns C<-1>) then you should start an idle watcher that calls
133C<eio_poll> until it returns something C<!= -1>. 134C<eio_poll> until it returns something C<!= -1>.
134 135
135A full-featured wrapper would look as follows (if C<eio_poll> is handling 136A full-featured connector between libeio and libev would look as follows
136all requests, it can of course be simplified a lot by removing the idle 137(if C<eio_poll> is handling all requests, it can of course be simplified a
137watcher logic): 138lot by removing the idle watcher logic):
138 139
139 static struct ev_loop *loop; 140 static struct ev_loop *loop;
140 static ev_idle repeat_watcher; 141 static ev_idle repeat_watcher;
141 static ev_async ready_watcher; 142 static ev_async ready_watcher;
142 143
143 /* idle watcher callback, only used when eio_poll */ 144 /* idle watcher callback, only used when eio_poll */
144 /* didn't handle all results in one call */ 145 /* didn't handle all results in one call */
145 static void 146 static void
146 repeat (EV_P_ ev_idle *w, int revents) 147 repeat (EV_P_ ev_idle *w, int revents)
147 { 148 {
148 if (eio_poll () != -1) 149 if (eio_poll () != -1)
149 ev_idle_stop (EV_A_ w); 150 ev_idle_stop (EV_A_ w);
150 } 151 }
151 152
152 /* eio has some results, process them */ 153 /* eio has some results, process them */
153 static void 154 static void
154 ready (EV_P_ ev_async *w, int revents) 155 ready (EV_P_ ev_async *w, int revents)
155 { 156 {
156 if (eio_poll () == -1) 157 if (eio_poll () == -1)
157 ev_idle_start (EV_A_ &repeat_watcher); 158 ev_idle_start (EV_A_ &repeat_watcher);
158 } 159 }
159 160
160 /* wake up the event loop */ 161 /* wake up the event loop */
161 static void 162 static void
162 want_poll (void) 163 want_poll (void)
163 { 164 {
164 ev_async_send (loop, &ready_watcher) 165 ev_async_send (loop, &ready_watcher)
165 } 166 }
166 167
167 void 168 void
168 my_init_eio () 169 my_init_eio ()
169 { 170 {
170 loop = EV_DEFAULT; 171 loop = EV_DEFAULT;
171 172
172 ev_idle_init (&repeat_watcher, repeat); 173 ev_idle_init (&repeat_watcher, repeat);
173 ev_async_init (&ready_watcher, ready); 174 ev_async_init (&ready_watcher, ready);
174 ev_async_start (loop &watcher); 175 ev_async_start (loop &watcher);
175 176
176 eio_init (want_poll, 0); 177 eio_init (want_poll, 0);
177 } 178 }
178 179
179For most other event loops, you would typically use a pipe - the event 180For most other event loops, you would typically use a pipe - the event
180loop should be told to wait for read readiness on the read end. In 181loop should be told to wait for read readiness on the read end. In
181C<want_poll> you would write a single byte, in C<done_poll> you would try 182C<want_poll> you would write a single byte, in C<done_poll> you would try
182to read that byte, and in the callback for the read end, you would call 183to read that byte, and in the callback for the read end, you would call
183C<eio_poll>. The race is avoided here because the event loop should invoke 184C<eio_poll>.
184your callback again and again until the byte has been read (as the pipe 185
185read callback does not read it, only C<done_poll>). 186You don't have to take special care in the case C<eio_poll> doesn't handle
187all requests, as the done callback will not be invoked, so the event loop
188will still signal readiness for the pipe until I<all> results have been
189processed.
186 190
187 191
188=head1 HIGH LEVEL REQUEST API 192=head1 HIGH LEVEL REQUEST API
189 193
190Libeio has both a high-level API, which consists of calling a request 194Libeio has both a high-level API, which consists of calling a request
257 } 261 }
258 262
259 /* the first three arguments are passed to open(2) */ 263 /* the first three arguments are passed to open(2) */
260 /* the remaining are priority, callback and data */ 264 /* the remaining are priority, callback and data */
261 if (!eio_open ("/etc/passwd", O_RDONLY, 0, 0, file_open_done, 0)) 265 if (!eio_open ("/etc/passwd", O_RDONLY, 0, 0, file_open_done, 0))
262 abort (); /* something ent wrong, we will all die!!! */ 266 abort (); /* something went wrong, we will all die!!! */
263 267
264Note that you additionally need to call C<eio_poll> when the C<want_cb> 268Note that you additionally need to call C<eio_poll> when the C<want_cb>
265indicates that requests are ready to be processed. 269indicates that requests are ready to be processed.
270
271=head2 CANCELLING REQUESTS
272
273Sometimes the need for a request goes away before the request is
274finished. In that case, one can cancel the request by a call to
275C<eio_cancel>:
276
277=over 4
278
279=item eio_cancel (eio_req *req)
280
281Cancel the request (and all its subrequests). If the request is currently
282executing it might still continue to execute, and in other cases it might
283still take a while till the request is cancelled.
284
285Even if cancelled, the finish callback will still be invoked - the
286callbacks of all cancellable requests need to check whether the request
287has been cancelled by calling C<EIO_CANCELLED (req)>:
288
289 static int
290 my_eio_cb (eio_req *req)
291 {
292 if (EIO_CANCELLED (req))
293 return 0;
294 }
295
296In addition, cancelled requests will I<either> have C<< req->result >>
297set to C<-1> and C<errno> to C<ECANCELED>, or I<otherwise> they were
298successfully executed, despite being cancelled (e.g. when they have
299already been executed at the time they were cancelled).
300
301C<EIO_CANCELLED> is still true for requests that have successfully
302executed, as long as C<eio_cancel> was called on them at some point.
303
304=back
266 305
267=head2 AVAILABLE REQUESTS 306=head2 AVAILABLE REQUESTS
268 307
269The following request functions are available. I<All> of them return the 308The following request functions are available. I<All> of them return the
270C<eio_req *> on success and C<0> on failure, and I<all> of them have the 309C<eio_req *> on success and C<0> on failure, and I<all> of them have the
366 free (target); 405 free (target);
367 } 406 }
368 407
369=item eio_realpath (const char *path, int pri, eio_cb cb, void *data) 408=item eio_realpath (const char *path, int pri, eio_cb cb, void *data)
370 409
371Similar to the realpath libc function, but unlike that one, result is 410Similar to the realpath libc function, but unlike that one, C<<
372C<-1> on failure and the length of the returned path in C<ptr2> (which is 411req->result >> is C<-1> on failure. On success, the result is the length
373not 0-terminated) - this is similar to readlink. 412of the returned path in C<ptr2> (which is I<NOT> 0-terminated) - this is
413similar to readlink.
374 414
375=item eio_stat (const char *path, int pri, eio_cb cb, void *data) 415=item eio_stat (const char *path, int pri, eio_cb cb, void *data)
376 416
377=item eio_lstat (const char *path, int pri, eio_cb cb, void *data) 417=item eio_lstat (const char *path, int pri, eio_cb cb, void *data)
378 418
379=item eio_fstat (int fd, int pri, eio_cb cb, void *data) 419=item eio_fstat (int fd, int pri, eio_cb cb, void *data)
380 420
381Stats a file - if C<< req->result >> indicates success, then you can 421Stats a file - if C<< req->result >> indicates success, then you can
382access the C<struct stat>-like structure via C<< req->ptr2 >>: 422access the C<struct stat>-like structure via C<< req->ptr2 >>:
383 423
384 EIO_STRUCT_STAT *statdata = (EIO_STRUCT_STAT *)req->ptr2; 424 EIO_STRUCT_STAT *statdata = (EIO_STRUCT_STAT *)req->ptr2;
385 425
386=item eio_statvfs (const char *path, int pri, eio_cb cb, void *data) 426=item eio_statvfs (const char *path, int pri, eio_cb cb, void *data)
387 427
388=item eio_fstatvfs (int fd, int pri, eio_cb cb, void *data) 428=item eio_fstatvfs (int fd, int pri, eio_cb cb, void *data)
389 429
390Stats a filesystem - if C<< req->result >> indicates success, then you can 430Stats a filesystem - if C<< req->result >> indicates success, then you can
391access the C<struct statvfs>-like structure via C<< req->ptr2 >>: 431access the C<struct statvfs>-like structure via C<< req->ptr2 >>:
392 432
393 EIO_STRUCT_STATVFS *statdata = (EIO_STRUCT_STATVFS *)req->ptr2; 433 EIO_STRUCT_STATVFS *statdata = (EIO_STRUCT_STATVFS *)req->ptr2;
394 434
395=back 435=back
396 436
397=head3 READING DIRECTORIES 437=head3 READING DIRECTORIES
398 438
399Reading directories sounds simple, but can be rather demanding, especially 439Reading directories sounds simple, but can be rather demanding, especially
400if you want to do stuff such as traversing a diretcory hierarchy or 440if you want to do stuff such as traversing a directory hierarchy or
401processing all files in a directory. Libeio can assist thess complex tasks 441processing all files in a directory. Libeio can assist these complex tasks
402with it's C<eio_readdir> call. 442with it's C<eio_readdir> call.
403 443
404=over 4 444=over 4
405 445
406=item eio_readdir (const char *path, int flags, int pri, eio_cb cb, void *data) 446=item eio_readdir (const char *path, int flags, int pri, eio_cb cb, void *data)
438 478
439If this flag is specified, then, in addition to the names in C<ptr2>, 479If this flag is specified, then, in addition to the names in C<ptr2>,
440also an array of C<struct eio_dirent> is returned, in C<ptr1>. A C<struct 480also an array of C<struct eio_dirent> is returned, in C<ptr1>. A C<struct
441eio_dirent> looks like this: 481eio_dirent> looks like this:
442 482
443 struct eio_dirent 483 struct eio_dirent
444 { 484 {
445 int nameofs; /* offset of null-terminated name string in (char *)req->ptr2 */ 485 int nameofs; /* offset of null-terminated name string in (char *)req->ptr2 */
446 unsigned short namelen; /* size of filename without trailing 0 */ 486 unsigned short namelen; /* size of filename without trailing 0 */
447 unsigned char type; /* one of EIO_DT_* */ 487 unsigned char type; /* one of EIO_DT_* */
448 signed char score; /* internal use */ 488 signed char score; /* internal use */
449 ino_t inode; /* the inode number, if available, otherwise unspecified */ 489 ino_t inode; /* the inode number, if available, otherwise unspecified */
450 }; 490 };
451 491
452The only members you normally would access are C<nameofs>, which is the 492The only members you normally would access are C<nameofs>, which is the
453byte-offset from C<ptr2> to the start of the name, C<namelen> and C<type>. 493byte-offset from C<ptr2> to the start of the name, C<namelen> and C<type>.
454 494
455C<type> can be one of: 495C<type> can be one of:
498When this flag is specified, then the names will be returned in an order 538When this flag is specified, then the names will be returned in an order
499suitable for stat()'ing each one. That is, when you plan to stat() 539suitable for stat()'ing each one. That is, when you plan to stat()
500all files in the given directory, then the returned order will likely 540all files in the given directory, then the returned order will likely
501be fastest. 541be fastest.
502 542
503If both this flag and C<EIO_READDIR_DIRS_FIRST> are specified, then 543If both this flag and C<EIO_READDIR_DIRS_FIRST> are specified, then the
504the likely dirs come first, resulting in a less optimal stat order. 544likely directories come first, resulting in a less optimal stat order.
505 545
506=item EIO_READDIR_FOUND_UNKNOWN 546=item EIO_READDIR_FOUND_UNKNOWN
507 547
508This flag should not be specified when calling C<eio_readdir>. Instead, 548This flag should not be specified when calling C<eio_readdir>. Instead,
509it is being set by C<eio_readdir> (you can access the C<flags> via C<< 549it is being set by C<eio_readdir> (you can access the C<flags> via C<<
510req->int1 >>, when any of the C<type>'s found were C<EIO_DT_UNKNOWN>. The 550req->int1 >>, when any of the C<type>'s found were C<EIO_DT_UNKNOWN>. The
511absense of this flag therefore indicates that all C<type>'s are known, 551absence of this flag therefore indicates that all C<type>'s are known,
512which can be used to speed up some algorithms. 552which can be used to speed up some algorithms.
513 553
514A typical use case would be to identify all subdirectories within a 554A typical use case would be to identify all subdirectories within a
515directory - you would ask C<eio_readdir> for C<EIO_READDIR_DIRS_FIRST>. If 555directory - you would ask C<eio_readdir> for C<EIO_READDIR_DIRS_FIRST>. If
516then this flag is I<NOT> set, then all the entries at the beginning of the 556then this flag is I<NOT> set, then all the entries at the beginning of the
554as calling C<fdatasync>. 594as calling C<fdatasync>.
555 595
556Flags can be any combination of C<EIO_SYNC_FILE_RANGE_WAIT_BEFORE>, 596Flags can be any combination of C<EIO_SYNC_FILE_RANGE_WAIT_BEFORE>,
557C<EIO_SYNC_FILE_RANGE_WRITE> and C<EIO_SYNC_FILE_RANGE_WAIT_AFTER>. 597C<EIO_SYNC_FILE_RANGE_WRITE> and C<EIO_SYNC_FILE_RANGE_WAIT_AFTER>.
558 598
599=item eio_fallocate (int fd, int mode, off_t offset, off_t len, int pri, eio_cb cb, void *data)
600
601Calls C<fallocate> (note: I<NOT> C<posix_fallocate>!). If the syscall is
602missing, then it returns failure and sets C<errno> to C<ENOSYS>.
603
604The C<mode> argument can be C<0> (for behaviour similar to
605C<posix_fallocate>), or C<EIO_FALLOC_FL_KEEP_SIZE>, which keeps the size
606of the file unchanged (but still preallocates space beyond end of file).
607
559=back 608=back
560 609
561=head3 LIBEIO-SPECIFIC REQUESTS 610=head3 LIBEIO-SPECIFIC REQUESTS
562 611
563These requests are specific to libeio and do not correspond to any OS call. 612These requests are specific to libeio and do not correspond to any OS call.
604 653
605 eio_custom (my_open, 0, my_open_done, "/etc/passwd"); 654 eio_custom (my_open, 0, my_open_done, "/etc/passwd");
606 655
607=item eio_busy (eio_tstamp delay, int pri, eio_cb cb, void *data) 656=item eio_busy (eio_tstamp delay, int pri, eio_cb cb, void *data)
608 657
609This is a a request that takes C<delay> seconds to execute, but otherwise 658This is a request that takes C<delay> seconds to execute, but otherwise
610does nothing - it simply puts one of the worker threads to sleep for this 659does nothing - it simply puts one of the worker threads to sleep for this
611long. 660long.
612 661
613This request can be used to artificially increase load, e.g. for debugging 662This request can be used to artificially increase load, e.g. for debugging
614or benchmarking reasons. 663or benchmarking reasons.
630There are two primary use cases for this: a) bundle many requests into a 679There are two primary use cases for this: a) bundle many requests into a
631single, composite, request with a definite callback and the ability to 680single, composite, request with a definite callback and the ability to
632cancel the whole request with its subrequests and b) limiting the number 681cancel the whole request with its subrequests and b) limiting the number
633of "active" requests. 682of "active" requests.
634 683
635Further below you will find more dicussion of these topics - first follows 684Further below you will find more discussion of these topics - first
636the reference section detailing the request generator and other methods. 685follows the reference section detailing the request generator and other
686methods.
637 687
638=over 4 688=over 4
639 689
640=item eio_grp (eio_cb cb, void *data) 690=item eio_req *grp = eio_grp (eio_cb cb, void *data)
641 691
642Creates and submits a group request. 692Creates, submits and returns a group request. Note that it doesn't have a
693priority, unlike all other requests.
643 694
644=back 695=item eio_grp_add (eio_req *grp, eio_req *req)
645 696
697Adds a request to the request group.
698
699=item eio_grp_cancel (eio_req *grp)
700
701Cancels all requests I<in> the group, but I<not> the group request
702itself. You can cancel the group request I<and> all subrequests via a
703normal C<eio_cancel> call.
704
705=back
706
707=head4 GROUP REQUEST LIFETIME
708
709Left alone, a group request will instantly move to the pending state and
710will be finished at the next call of C<eio_poll>.
711
712The usefulness stems from the fact that, if a subrequest is added to a
713group I<before> a call to C<eio_poll>, via C<eio_grp_add>, then the group
714will not finish until all the subrequests have finished.
715
716So the usage cycle of a group request is like this: after it is created,
717you normally instantly add a subrequest. If none is added, the group
718request will finish on it's own. As long as subrequests are added before
719the group request is finished it will be kept from finishing, that is the
720callbacks of any subrequests can, in turn, add more requests to the group,
721and as long as any requests are active, the group request itself will not
722finish.
723
724=head4 CREATING COMPOSITE REQUESTS
725
726Imagine you wanted to create an C<eio_load> request that opens a file,
727reads it and closes it. This means it has to execute at least three eio
728requests, but for various reasons it might be nice if that request looked
729like any other eio request.
730
731This can be done with groups:
732
733=over 4
734
735=item 1) create the request object
736
737Create a group that contains all further requests. This is the request you
738can return as "the load request".
739
740=item 2) open the file, maybe
741
742Next, open the file with C<eio_open> and add the request to the group
743request and you are finished setting up the request.
744
745If, for some reason, you cannot C<eio_open> (path is a null ptr?) you
746can set C<< grp->result >> to C<-1> to signal an error and let the group
747request finish on its own.
748
749=item 3) open callback adds more requests
750
751In the open callback, if the open was not successful, copy C<<
752req->errorno >> to C<< grp->errorno >> and set C<< grp->errorno >> to
753C<-1> to signal an error.
754
755Otherwise, malloc some memory or so and issue a read request, adding the
756read request to the group.
757
758=item 4) continue issuing requests till finished
759
760In the real callback, check for errors and possibly continue with
761C<eio_close> or any other eio request in the same way.
762
763As soon as no new requests are added the group request will finish. Make
764sure you I<always> set C<< grp->result >> to some sensible value.
765
766=back
767
768=head4 REQUEST LIMITING
646 769
647 770
648#TODO 771#TODO
649 772
650/*****************************************************************************/
651/* groups */
652
653eio_req *eio_grp (eio_cb cb, void *data);
654void eio_grp_feed (eio_req *grp, void (*feed)(eio_req *req), int limit);
655void eio_grp_limit (eio_req *grp, int limit); 773void eio_grp_limit (eio_req *grp, int limit);
656void eio_grp_add (eio_req *grp, eio_req *req);
657void eio_grp_cancel (eio_req *grp); /* cancels all sub requests but not the group */
658 774
659 775
660=back 776=back
661 777
662 778
668=head1 ANATOMY AND LIFETIME OF AN EIO REQUEST 784=head1 ANATOMY AND LIFETIME OF AN EIO REQUEST
669 785
670A request is represented by a structure of type C<eio_req>. To initialise 786A request is represented by a structure of type C<eio_req>. To initialise
671it, clear it to all zero bytes: 787it, clear it to all zero bytes:
672 788
673 eio_req req; 789 eio_req req;
674 790
675 memset (&req, 0, sizeof (req)); 791 memset (&req, 0, sizeof (req));
676 792
677A more common way to initialise a new C<eio_req> is to use C<calloc>: 793A more common way to initialise a new C<eio_req> is to use C<calloc>:
678 794
679 eio_req *req = calloc (1, sizeof (*req)); 795 eio_req *req = calloc (1, sizeof (*req));
680 796
681In either case, libeio neither allocates, initialises or frees the 797In either case, libeio neither allocates, initialises or frees the
682C<eio_req> structure for you - it merely uses it. 798C<eio_req> structure for you - it merely uses it.
683 799
684zero 800zero
702for example, in interactive programs, you might want to limit this time to 818for example, in interactive programs, you might want to limit this time to
703C<0.01> seconds or so. 819C<0.01> seconds or so.
704 820
705Note that: 821Note that:
706 822
823=over 4
824
707a) libeio doesn't know how long your request callbacks take, so the time 825=item a) libeio doesn't know how long your request callbacks take, so the
708spent in C<eio_poll> is up to one callback invocation longer then this 826time spent in C<eio_poll> is up to one callback invocation longer then
709interval. 827this interval.
710 828
711b) this is implemented by calling C<gettimeofday> after each request, 829=item b) this is implemented by calling C<gettimeofday> after each
712which can be costly. 830request, which can be costly.
713 831
714c) at least one request will be handled. 832=item c) at least one request will be handled.
833
834=back
715 835
716=item eio_set_max_poll_reqs (unsigned int nreqs) 836=item eio_set_max_poll_reqs (unsigned int nreqs)
717 837
718When C<nreqs> is non-zero, then C<eio_poll> will not handle more than 838When C<nreqs> is non-zero, then C<eio_poll> will not handle more than
719C<nreqs> requests per invocation. This is a less costly way to limit the 839C<nreqs> requests per invocation. This is a less costly way to limit the

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines