ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/libev/ev.pod
(Generate patch)

Comparing libev/ev.pod (file contents):
Revision 1.202 by root, Fri Oct 24 08:30:01 2008 UTC vs.
Revision 1.207 by root, Tue Oct 28 14:13:52 2008 UTC

384=item C<EVBACKEND_EPOLL> (value 4, Linux) 384=item C<EVBACKEND_EPOLL> (value 4, Linux)
385 385
386For few fds, this backend is a bit little slower than poll and select, 386For few fds, this backend is a bit little slower than poll and select,
387but it scales phenomenally better. While poll and select usually scale 387but it scales phenomenally better. While poll and select usually scale
388like O(total_fds) where n is the total number of fds (or the highest fd), 388like O(total_fds) where n is the total number of fds (or the highest fd),
389epoll scales either O(1) or O(active_fds). The epoll design has a number 389epoll scales either O(1) or O(active_fds).
390of shortcomings, such as silently dropping events in some hard-to-detect 390
391cases and requiring a system call per fd change, no fork support and bad 391The epoll syscalls are the most misdesigned of the more advanced event
392support for dup. 392mechanisms: problems include silently dropping fds, requiring a system
393call per change per fd (and unnecessary guessing of parameters), problems
394with dup and so on. The biggest issue is fork races, however - if a
395program forks then I<both> parent and child process have to recreate the
396epoll set, which can take considerable time (one syscall per fd) and is of
397course hard to detect.
398
399Epoll is also notoriously buggy - embedding epoll fds should work, but
400of course doesn't, and epoll just loves to report events for totally
401I<different> file descriptors (even already closed ones, so one cannot
402even remove them from the set) than registered in the set (especially
403on SMP systems). Libev tries to counter these spurious notifications by
404employing an additional generation counter and comparing that against the
405events to filter out spurious ones.
393 406
394While stopping, setting and starting an I/O watcher in the same iteration 407While stopping, setting and starting an I/O watcher in the same iteration
395will result in some caching, there is still a system call per such incident 408will result in some caching, there is still a system call per such incident
396(because the fd could point to a different file description now), so its 409(because the fd could point to a different file description now), so its
397best to avoid that. Also, C<dup ()>'ed file descriptors might not work 410best to avoid that. Also, C<dup ()>'ed file descriptors might not work
398very well if you register events for both fds. 411very well if you register events for both fds.
399 412
400Please note that epoll sometimes generates spurious notifications, so you
401need to use non-blocking I/O or other means to avoid blocking when no data
402(or space) is available.
403
404Best performance from this backend is achieved by not unregistering all 413Best performance from this backend is achieved by not unregistering all
405watchers for a file descriptor until it has been closed, if possible, 414watchers for a file descriptor until it has been closed, if possible,
406i.e. keep at least one watcher active per fd at all times. Stopping and 415i.e. keep at least one watcher active per fd at all times. Stopping and
407starting a watcher (without re-setting it) also usually doesn't cause 416starting a watcher (without re-setting it) also usually doesn't cause
408extra overhead. 417extra overhead. A fork can both result in spurious notifications as well
418as in libev having to destroy and recreate the epoll object, which can
419take considerable time and thus should be avoided.
409 420
410While nominally embeddable in other event loops, this feature is broken in 421While nominally embeddable in other event loops, this feature is broken in
411all kernel versions tested so far. 422all kernel versions tested so far.
412 423
413This backend maps C<EV_READ> and C<EV_WRITE> in the same way as 424This backend maps C<EV_READ> and C<EV_WRITE> in the same way as
428 439
429It scales in the same way as the epoll backend, but the interface to the 440It scales in the same way as the epoll backend, but the interface to the
430kernel is more efficient (which says nothing about its actual speed, of 441kernel is more efficient (which says nothing about its actual speed, of
431course). While stopping, setting and starting an I/O watcher does never 442course). While stopping, setting and starting an I/O watcher does never
432cause an extra system call as with C<EVBACKEND_EPOLL>, it still adds up to 443cause an extra system call as with C<EVBACKEND_EPOLL>, it still adds up to
433two event changes per incident. Support for C<fork ()> is very bad and it 444two event changes per incident. Support for C<fork ()> is very bad (but
434drops fds silently in similarly hard-to-detect cases. 445sane, unlike epoll) and it drops fds silently in similarly hard-to-detect
446cases
435 447
436This backend usually performs well under most conditions. 448This backend usually performs well under most conditions.
437 449
438While nominally embeddable in other event loops, this doesn't work 450While nominally embeddable in other event loops, this doesn't work
439everywhere, so you might need to test for this. And since it is broken 451everywhere, so you might need to test for this. And since it is broken
468might perform better. 480might perform better.
469 481
470On the positive side, with the exception of the spurious readiness 482On the positive side, with the exception of the spurious readiness
471notifications, this backend actually performed fully to specification 483notifications, this backend actually performed fully to specification
472in all tests and is fully embeddable, which is a rare feat among the 484in all tests and is fully embeddable, which is a rare feat among the
473OS-specific backends. 485OS-specific backends (I vastly prefer correctness over speed hacks).
474 486
475This backend maps C<EV_READ> and C<EV_WRITE> in the same way as 487This backend maps C<EV_READ> and C<EV_WRITE> in the same way as
476C<EVBACKEND_POLL>. 488C<EVBACKEND_POLL>.
477 489
478=item C<EVBACKEND_ALL> 490=item C<EVBACKEND_ALL>
531responsibility to either stop all watchers cleanly yourself I<before> 543responsibility to either stop all watchers cleanly yourself I<before>
532calling this function, or cope with the fact afterwards (which is usually 544calling this function, or cope with the fact afterwards (which is usually
533the easiest thing, you can just ignore the watchers and/or C<free ()> them 545the easiest thing, you can just ignore the watchers and/or C<free ()> them
534for example). 546for example).
535 547
536Note that certain global state, such as signal state, will not be freed by 548Note that certain global state, such as signal state (and installed signal
537this function, and related watchers (such as signal and child watchers) 549handlers), will not be freed by this function, and related watchers (such
538would need to be stopped manually. 550as signal and child watchers) would need to be stopped manually.
539 551
540In general it is not advisable to call this function except in the 552In general it is not advisable to call this function except in the
541rare occasion where you really need to free e.g. the signal handling 553rare occasion where you really need to free e.g. the signal handling
542pipe fds. If you need dynamically allocated loops it is better to use 554pipe fds. If you need dynamically allocated loops it is better to use
543C<ev_loop_new> and C<ev_loop_destroy>). 555C<ev_loop_new> and C<ev_loop_destroy>).
1910 1922
1911 1923
1912=head2 C<ev_stat> - did the file attributes just change? 1924=head2 C<ev_stat> - did the file attributes just change?
1913 1925
1914This watches a file system path for attribute changes. That is, it calls 1926This watches a file system path for attribute changes. That is, it calls
1915C<stat> regularly (or when the OS says it changed) and sees if it changed 1927C<stat> on that path in regular intervals (or when the OS says it changed)
1916compared to the last time, invoking the callback if it did. 1928and sees if it changed compared to the last time, invoking the callback if
1929it did.
1917 1930
1918The path does not need to exist: changing from "path exists" to "path does 1931The path does not need to exist: changing from "path exists" to "path does
1919not exist" is a status change like any other. The condition "path does 1932not exist" is a status change like any other. The condition "path does
1920not exist" is signified by the C<st_nlink> field being zero (which is 1933not exist" is signified by the C<st_nlink> field being zero (which is
1921otherwise always forced to be at least one) and all the other fields of 1934otherwise always forced to be at least one) and all the other fields of
1922the stat buffer having unspecified contents. 1935the stat buffer having unspecified contents.
1923 1936
1924The path I<should> be absolute and I<must not> end in a slash. If it is 1937The path I<must not> end in a slash or contain special components such as
1938C<.> or C<..>. The path I<should> be absolute: If it is relative and
1925relative and your working directory changes, the behaviour is undefined. 1939your working directory changes, then the behaviour is undefined.
1926 1940
1927Since there is no standard kernel interface to do this, the portable 1941Since there is no portable change notification interface available, the
1928implementation simply calls C<stat (2)> regularly on the path to see if 1942portable implementation simply calls C<stat(2)> regularly on the path
1929it changed somehow. You can specify a recommended polling interval for 1943to see if it changed somehow. You can specify a recommended polling
1930this case. If you specify a polling interval of C<0> (highly recommended!) 1944interval for this case. If you specify a polling interval of C<0> (highly
1931then a I<suitable, unspecified default> value will be used (which 1945recommended!) then a I<suitable, unspecified default> value will be used
1932you can expect to be around five seconds, although this might change 1946(which you can expect to be around five seconds, although this might
1933dynamically). Libev will also impose a minimum interval which is currently 1947change dynamically). Libev will also impose a minimum interval which is
1934around C<0.1>, but thats usually overkill. 1948currently around C<0.1>, but thats usually overkill.
1935 1949
1936This watcher type is not meant for massive numbers of stat watchers, 1950This watcher type is not meant for massive numbers of stat watchers,
1937as even with OS-supported change notifications, this can be 1951as even with OS-supported change notifications, this can be
1938resource-intensive. 1952resource-intensive.
1939 1953
1949support disabled by default, you get the 32 bit version of the stat 1963support disabled by default, you get the 32 bit version of the stat
1950structure. When using the library from programs that change the ABI to 1964structure. When using the library from programs that change the ABI to
1951use 64 bit file offsets the programs will fail. In that case you have to 1965use 64 bit file offsets the programs will fail. In that case you have to
1952compile libev with the same flags to get binary compatibility. This is 1966compile libev with the same flags to get binary compatibility. This is
1953obviously the case with any flags that change the ABI, but the problem is 1967obviously the case with any flags that change the ABI, but the problem is
1954most noticeably disabled with ev_stat and large file support. 1968most noticeably displayed with ev_stat and large file support.
1955 1969
1956The solution for this is to lobby your distribution maker to make large 1970The solution for this is to lobby your distribution maker to make large
1957file interfaces available by default (as e.g. FreeBSD does) and not 1971file interfaces available by default (as e.g. FreeBSD does) and not
1958optional. Libev cannot simply switch on large file support because it has 1972optional. Libev cannot simply switch on large file support because it has
1959to exchange stat structures with application programs compiled using the 1973to exchange stat structures with application programs compiled using the
1978descriptor open on the object at all times, and detecting renames, unlinks 1992descriptor open on the object at all times, and detecting renames, unlinks
1979etc. is difficult. 1993etc. is difficult.
1980 1994
1981=head3 The special problem of stat time resolution 1995=head3 The special problem of stat time resolution
1982 1996
1983The C<stat ()> system call only supports full-second resolution portably, and 1997The C<stat ()> system call only supports full-second resolution portably,
1984even on systems where the resolution is higher, most file systems still 1998and even on systems where the resolution is higher, most file systems
1985only support whole seconds. 1999still only support whole seconds.
1986 2000
1987That means that, if the time is the only thing that changes, you can 2001That means that, if the time is the only thing that changes, you can
1988easily miss updates: on the first update, C<ev_stat> detects a change and 2002easily miss updates: on the first update, C<ev_stat> detects a change and
1989calls your callback, which does something. When there is another update 2003calls your callback, which does something. When there is another update
1990within the same second, C<ev_stat> will be unable to detect unless the 2004within the same second, C<ev_stat> will be unable to detect unless the

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines