--- AnyEvent/README 2008/05/10 22:30:28 1.20 +++ AnyEvent/README 2009/06/29 21:00:32 1.42 @@ -1,23 +1,44 @@ NAME AnyEvent - provide framework for multiple event loops - EV, Event, Glib, Tk, Perl, Event::Lib, Qt, POE - various supported event - loops + EV, Event, Glib, Tk, Perl, Event::Lib, Qt and POE are various supported + event loops. SYNOPSIS use AnyEvent; - my $w = AnyEvent->io (fh => $fh, poll => "r|w", cb => sub { - ... - }); + # file descriptor readable + my $w = AnyEvent->io (fh => $fh, poll => "r", cb => sub { ... }); - my $w = AnyEvent->timer (after => $seconds, cb => sub { + # one-shot or repeating timers + my $w = AnyEvent->timer (after => $seconds, cb => sub { ... }); + my $w = AnyEvent->timer (after => $seconds, interval => $seconds, cb => ... + + print AnyEvent->now; # prints current event loop time + print AnyEvent->time; # think Time::HiRes::time or simply CORE::time. + + # POSIX signal + my $w = AnyEvent->signal (signal => "TERM", cb => sub { ... }); + + # child process exit + my $w = AnyEvent->child (pid => $pid, cb => sub { + my ($pid, $status) = @_; ... }); + # called when event loop idle (if applicable) + my $w = AnyEvent->idle (cb => sub { ... }); + my $w = AnyEvent->condvar; # stores whether a condition was flagged $w->send; # wake up current and all future recv's $w->recv; # enters "main loop" till $condvar gets ->send + # use a condvar in callback mode: + $w->cb (sub { $_[0]->recv }); + +INTRODUCTION/TUTORIAL + This manpage is mainly a reference manual. If you are interested in a + tutorial or some gentle introduction, have a look at the AnyEvent::Intro + manpage. WHY YOU SHOULD USE THIS MODULE (OR NOT) Glib, POE, IO::Async, Event... CPAN offers event models by the dozen @@ -27,11 +48,12 @@ policy* and AnyEvent is *small and efficient*. First and foremost, *AnyEvent is not an event model* itself, it only - interfaces to whatever event model the main program happens to use in a + interfaces to whatever event model the main program happens to use, in a pragmatic way. For event models and certain classes of immortals alike, the statement "there can only be one" is a bitter reality: In general, only one event loop can be active at the same time in a process. - AnyEvent helps hiding the differences between those event loops. + AnyEvent cannot change this, but it can hide the differences between + those event loops. The goal of AnyEvent is to offer module authors the ability to do event programming (waiting for I/O or timer events) without subscribing to a @@ -42,27 +64,33 @@ For modules like POE or IO::Async (which is a total misnomer as it is actually doing all I/O *synchronously*...), using them in your module is like joining a cult: After you joined, you are dependent on them and you - cannot use anything else, as it is simply incompatible to everything - that isn't itself. What's worse, all the potential users of your module + cannot use anything else, as they are simply incompatible to everything + that isn't them. What's worse, all the potential users of your module are *also* forced to use the same event loop you use. AnyEvent is different: AnyEvent + POE works fine. AnyEvent + Glib works fine. AnyEvent + Tk works fine etc. etc. but none of these work together - with the rest: POE + IO::Async? no go. Tk + Event? no go. Again: if your + with the rest: POE + IO::Async? No go. Tk + Event? No go. Again: if your module uses one of those, every user of your module has to use it, too. But if your module uses AnyEvent, it works transparently with all event - models it supports (including stuff like POE and IO::Async, as long as - those use one of the supported event loops. It is trivial to add new - event loops to AnyEvent, too, so it is future-proof). + models it supports (including stuff like IO::Async, as long as those use + one of the supported event loops. It is trivial to add new event loops + to AnyEvent, too, so it is future-proof). In addition to being free of having to use *the one and only true event model*, AnyEvent also is free of bloat and policy: with POE or similar - modules, you get an enourmous amount of code and strict rules you have - to follow. AnyEvent, on the other hand, is lean and up to the point, by + modules, you get an enormous amount of code and strict rules you have to + follow. AnyEvent, on the other hand, is lean and up to the point, by only offering the functionality that is necessary, in as thin as a wrapper as technically possible. - Of course, if you want lots of policy (this can arguably be somewhat + Of course, AnyEvent comes with a big (and fully optional!) toolbox of + useful functionality, such as an asynchronous DNS resolver, 100% + non-blocking connects (even with TLS/SSL, IPv6 and on broken platforms + such as Windows) and lots of real-world knowledge and workarounds for + platform bugs and differences. + + Now, if you *do want* lots of policy (this can arguably be somewhat useful) and you want to force your users to use the one and only event model, you should *not* use this module. @@ -101,18 +129,24 @@ The pure-perl implementation of AnyEvent is called "AnyEvent::Impl::Perl". Like other event modules you can load it - explicitly. + explicitly and enjoy the high availability of that event loop :) WATCHERS AnyEvent has the central concept of a *watcher*, which is an object that stores relevant data for each kind of event you are waiting for, such as - the callback to call, the filehandle to watch, etc. + the callback to call, the file handle to watch, etc. These watchers are normal Perl objects with normal Perl lifetime. After creating a watcher it will immediately "watch" for events and invoke the callback when the event occurs (of course, only when the event model is in control). + Note that callbacks must not permanently change global variables + potentially in use by the event loop (such as $_ or $[) and that + callbacks must not "die". The former is good programming practise in + Perl and the latter stems from the fact that exception handling differs + widely between event loops. + To disable the watcher you have to destroy it (e.g. by setting the variable you store it in to "undef" or otherwise deleting all references to it). @@ -124,10 +158,10 @@ An any way to achieve that is this pattern: - my $w; $w = AnyEvent->type (arg => value ..., cb => sub { - # you can use $w here, for example to undef it - undef $w; - }); + my $w; $w = AnyEvent->type (arg => value ..., cb => sub { + # you can use $w here, for example to undef it + undef $w; + }); Note that "my $w; $w =" combination. This is necessary because in Perl, my variables are only visible after the statement in which they are @@ -137,10 +171,17 @@ You can create an I/O watcher by calling the "AnyEvent->io" method with the following mandatory key-value pairs as arguments: - "fh" the Perl *file handle* (*not* file descriptor) to watch for events. + "fh" is the Perl *file handle* (*not* file descriptor) to watch for + events (AnyEvent might or might not keep a reference to this file + handle). Note that only file handles pointing to things for which + non-blocking operation makes sense are allowed. This includes sockets, + most character devices, pipes, fifos and so on, but not for example + files or block devices. + "poll" must be a string that is either "r" or "w", which creates a - watcher waiting for "r"eadable or "w"ritable events, respectively. "cb" - is the callback to invoke each time the file handle becomes ready. + watcher waiting for "r"eadable or "w"ritable events, respectively. + + "cb" is the callback to invoke each time the file handle becomes ready. Although the callback might get passed parameters, their value and presence is undefined and you cannot rely on them. Portable AnyEvent @@ -154,9 +195,9 @@ always use non-blocking calls when reading/writing from/to your file handles. - Example: + Example: wait for readability of STDIN, then read a line and disable the + watcher. - # wait for readability of STDIN, then read a line and disable the watcher my $w; $w = AnyEvent->io (fh => \*STDIN, poll => 'r', cb => sub { chomp (my $input = ); warn "read: $input\n"; @@ -175,13 +216,18 @@ presence is undefined and you cannot rely on them. Portable AnyEvent callbacks cannot use arguments passed to time watcher callbacks. - The timer callback will be invoked at most once: if you want a repeating - timer you have to create a new watcher (this is a limitation by both Tk - and Glib). + The callback will normally be invoked once only. If you specify another + parameter, "interval", as a strictly positive number (> 0), then the + callback will be invoked regularly at that interval (in fractional + seconds) after the first invocation. If "interval" is specified with a + false value, then it is treated as if it were missing. + + The callback will be rescheduled before invoking the callback, but no + attempt is done to avoid timer drift in most backends, so the interval + is only approximate. - Example: + Example: fire an event after 7.7 seconds. - # fire an event after 7.7 seconds my $w = AnyEvent->timer (after => 7.7, cb => sub { warn "timeout\n"; }); @@ -189,19 +235,12 @@ # to cancel the timer: undef $w; - Example 2: - - # fire an event after 0.5 seconds, then roughly every second - my $w; + Example 2: fire an event after 0.5 seconds, then roughly every second. - my $cb = sub { - # cancel the old timer while creating a new one - $w = AnyEvent->timer (after => 1, cb => $cb); + my $w = AnyEvent->timer (after => 0.5, interval => 1, cb => sub { + warn "timeout\n"; }; - # start the "loop" by creating the first watcher - $w = AnyEvent->timer (after => 0.5, cb => $cb); - TIMING ISSUES There are two ways to handle timers: based on real time (relative, "fire in 10 seconds") and based on wallclock time (absolute, "fire at 12 @@ -222,19 +261,95 @@ AnyEvent always prefers relative timers, if available, matching the AnyEvent API. + AnyEvent has two additional methods that return the "current time": + + AnyEvent->time + This returns the "current wallclock time" as a fractional number of + seconds since the Epoch (the same thing as "time" or + "Time::HiRes::time" return, and the result is guaranteed to be + compatible with those). + + It progresses independently of any event loop processing, i.e. each + call will check the system clock, which usually gets updated + frequently. + + AnyEvent->now + This also returns the "current wallclock time", but unlike "time", + above, this value might change only once per event loop iteration, + depending on the event loop (most return the same time as "time", + above). This is the time that AnyEvent's timers get scheduled + against. + + *In almost all cases (in all cases if you don't care), this is the + function to call when you want to know the current time.* + + This function is also often faster then "AnyEvent->time", and thus + the preferred method if you want some timestamp (for example, + AnyEvent::Handle uses this to update it's activity timeouts). + + The rest of this section is only of relevance if you try to be very + exact with your timing, you can skip it without bad conscience. + + For a practical example of when these times differ, consider + Event::Lib and EV and the following set-up: + + The event loop is running and has just invoked one of your callback + at time=500 (assume no other callbacks delay processing). In your + callback, you wait a second by executing "sleep 1" (blocking the + process for a second) and then (at time=501) you create a relative + timer that fires after three seconds. + + With Event::Lib, "AnyEvent->time" and "AnyEvent->now" will both + return 501, because that is the current time, and the timer will be + scheduled to fire at time=504 (501 + 3). + + With EV, "AnyEvent->time" returns 501 (as that is the current time), + but "AnyEvent->now" returns 500, as that is the time the last event + processing phase started. With EV, your timer gets scheduled to run + at time=503 (500 + 3). + + In one sense, Event::Lib is more exact, as it uses the current time + regardless of any delays introduced by event processing. However, + most callbacks do not expect large delays in processing, so this + causes a higher drift (and a lot more system calls to get the + current time). + + In another sense, EV is more exact, as your timer will be scheduled + at the same time, regardless of how long event processing actually + took. + + In either case, if you care (and in most cases, you don't), then you + can get whatever behaviour you want with any event loop, by taking + the difference between "AnyEvent->time" and "AnyEvent->now" into + account. + + AnyEvent->now_update + Some event loops (such as EV or AnyEvent::Impl::Perl) cache the + current time for each loop iteration (see the discussion of + AnyEvent->now, above). + + When a callback runs for a long time (or when the process sleeps), + then this "current" time will differ substantially from the real + time, which might affect timers and time-outs. + + When this is the case, you can call this method, which will update + the event loop's idea of "current time". + + Note that updating the time *might* cause some events to be handled. + SIGNAL WATCHERS You can watch for signals using a signal watcher, "signal" is the signal - *name* without any "SIG" prefix, "cb" is the Perl callback to be invoked - whenever a signal occurs. + *name* in uppercase and without any "SIG" prefix, "cb" is the Perl + callback to be invoked whenever a signal occurs. Although the callback might get passed parameters, their value and presence is undefined and you cannot rely on them. Portable AnyEvent callbacks cannot use arguments passed to signal watcher callbacks. - Multiple signal occurances can be clumped together into one callback - invocation, and callback invocation will be synchronous. synchronous + Multiple signal occurrences can be clumped together into one callback + invocation, and callback invocation will be synchronous. Synchronous means that it might take a while until the signal gets handled by the - process, but it is guarenteed not to interrupt any other callbacks. + process, but it is guaranteed not to interrupt any other callbacks. The main advantage of using these watchers is that you can share a signal between multiple watchers. @@ -250,20 +365,29 @@ You can also watch on a child process exit and catch its exit status. The child process is specified by the "pid" argument (if set to 0, it - watches for any child process exit). The watcher will trigger as often - as status change for the child are received. This works by installing a - signal handler for "SIGCHLD". The callback will be called with the pid - and exit status (as returned by waitpid), so unlike other watcher types, - you *can* rely on child watcher callback arguments. + watches for any child process exit). The watcher will triggered only + when the child process has finished and an exit status is available, not + on any trace events (stopped/continued). + + The callback will be called with the pid and exit status (as returned by + waitpid), so unlike other watcher types, you *can* rely on child watcher + callback arguments. + + This watcher type works by installing a signal handler for "SIGCHLD", + and since it cannot be shared, nothing else should use SIGCHLD or reap + random child processes (waiting for specific child processes, e.g. + inside "system", is just fine). There is a slight catch to child watchers, however: you usually start them *after* the child process was created, and this means the process could have exited already (and no SIGCHLD will be sent anymore). - Not all event models handle this correctly (POE doesn't), but even for - event models that *do* handle this correctly, they usually need to be - loaded before the process exits (i.e. before you fork in the first - place). + Not all event models handle this correctly (neither POE nor IO::Async + do, see their AnyEvent::Impl manpages for details), but even for event + models that *do* handle this correctly, they usually need to be loaded + before the process exits (i.e. before you fork in the first place). + AnyEvent's pure perl event loop handles all cases correctly regardless + of when you start the watcher. This means you cannot create a child watcher as the very first thing in an AnyEvent program, you *have* to create at least one watcher before @@ -271,21 +395,55 @@ Example: fork a process and wait for it - my $done = AnyEvent->condvar; - - my $pid = fork or exit 5; - - my $w = AnyEvent->child ( - pid => $pid, - cb => sub { - my ($pid, $status) = @_; - warn "pid $pid exited with status $status"; - $done->send; - }, - ); - - # do something else, then wait for process exit - $done->recv; + my $done = AnyEvent->condvar; + + my $pid = fork or exit 5; + + my $w = AnyEvent->child ( + pid => $pid, + cb => sub { + my ($pid, $status) = @_; + warn "pid $pid exited with status $status"; + $done->send; + }, + ); + + # do something else, then wait for process exit + $done->recv; + + IDLE WATCHERS + Sometimes there is a need to do something, but it is not so important to + do it instantly, but only when there is nothing better to do. This + "nothing better to do" is usually defined to be "no other events need + attention by the event loop". + + Idle watchers ideally get invoked when the event loop has nothing better + to do, just before it would block the process to wait for new events. + Instead of blocking, the idle watcher is invoked. + + Most event loops unfortunately do not really support idle watchers (only + EV, Event and Glib do it in a usable fashion) - for the rest, AnyEvent + will simply call the callback "from time to time". + + Example: read lines from STDIN, but only process them when the program + is otherwise idle: + + my @lines; # read data + my $idle_w; + my $io_w = AnyEvent->io (fh => \*STDIN, poll => 'r', cb => sub { + push @lines, scalar ; + + # start an idle watcher, if not already done + $idle_w ||= AnyEvent->idle (cb => sub { + # handle only one line, when there are lines left + if (my $line = shift @lines) { + print "handled when idle: $line"; + } else { + # otherwise disable the idle watcher again + undef $idle_w; + } + }); + }); CONDITION VARIABLES If you are familiar with some event loops you will know that all of them @@ -300,17 +458,21 @@ Condition variables can be created by calling the "AnyEvent->condvar" method, usually without arguments. The only argument pair allowed is + "cb", which specifies a callback to be called when the condition - variable becomes true. + variable becomes true, with the condition variable as the first argument + (but not the results). - After creation, the conditon variable is "false" until it becomes "true" - by calling the "send" method. + After creation, the condition variable is "false" until it becomes + "true" by calling the "send" method (or calling the condition variable + as if it were a callback, read about the caveats in the description for + the "->send" method). Condition variables are similar to callbacks, except that you can optionally wait for them. They can also be called merge points - points - in time where multiple outstandign events have been processed. And yet - another way to call them is transations - each condition variable can be - used to represent a transaction, which finishes at some point and + in time where multiple outstanding events have been processed. And yet + another way to call them is transactions - each condition variable can + be used to represent a transaction, which finishes at some point and delivers a result. Condition variables are very useful to signal that something has @@ -325,7 +487,7 @@ button of your app, which would "->send" the "quit" event. Note that condition variables recurse into the event loop - if you have - two pieces of code that call "->recv" in a round-robbin fashion, you + two pieces of code that call "->recv" in a round-robin fashion, you lose. Therefore, condition variables are good to export to your caller, but you should avoid making a blocking wait yourself, at least in callbacks, as this asks for trouble. @@ -340,7 +502,7 @@ which eventually calls "-> send", and the "consumer side", which waits for the send to occur. - Example: + Example: wait for a timer. # wait till the result is ready my $result_ready = AnyEvent->condvar; @@ -358,6 +520,30 @@ # calls send $result_ready->recv; + Example: wait for a timer, but take advantage of the fact that condition + variables are also code references. + + my $done = AnyEvent->condvar; + my $delay = AnyEvent->timer (after => 5, cb => $done); + $done->recv; + + Example: Imagine an API that returns a condvar and doesn't support + callbacks. This is how you make a synchronous call, for example from the + main program: + + use AnyEvent::CouchDB; + + ... + + my @info = $couchdb->info->recv; + + And this is how you would just ste a callback to be called whenever the + results are available: + + $couchdb->info->cb (sub { + my @info = $_[0]->recv; + }); + METHODS FOR PRODUCERS These methods should only be used by the producing side, i.e. the code/module that eventually sends the signal. Note that it is also the @@ -375,6 +561,15 @@ Any arguments passed to the "send" call will be returned by all future "->recv" calls. + Condition variables are overloaded so one can call them directly (as + a code reference). Calling them directly is the same as calling + "send". Note, however, that many C-based event loops do not handle + overloading, so as tempting as it may be, passing a condition + variable instead of a callback does not work. Both the pure perl and + EV loops support overloading, however, as well as all functions that + use perl to invoke a callback (as in AnyEvent::Socket and + AnyEvent::DNS for example). + $cv->croak ($error) Similar to send, but causes all call's to "->recv" to invoke "Carp::croak" with the given error message/object/scalar. @@ -384,8 +579,6 @@ $cv->begin ([group callback]) $cv->end - These two methods are EXPERIMENTAL and MIGHT CHANGE. - These two methods can be used to combine many transactions/events into one. For example, a function that pings many hosts in parallel might want to use a condition variable for the whole process. @@ -396,7 +589,38 @@ *supposed* to call "->send", but that is not required. If no callback was set, "send" will be called without any arguments. - Let's clarify this with the ping example: + You can think of "$cv->send" giving you an OR condition (one call + sends), while "$cv->begin" and "$cv->end" giving you an AND + condition (all "begin" calls must be "end"'ed before the condvar + sends). + + Let's start with a simple example: you have two I/O watchers (for + example, STDOUT and STDERR for a program), and you want to wait for + both streams to close before activating a condvar: + + my $cv = AnyEvent->condvar; + + $cv->begin; # first watcher + my $w1 = AnyEvent->io (fh => $fh1, cb => sub { + defined sysread $fh1, my $buf, 4096 + or $cv->end; + }); + + $cv->begin; # second watcher + my $w2 = AnyEvent->io (fh => $fh2, cb => sub { + defined sysread $fh2, my $buf, 4096 + or $cv->end; + }); + + $cv->recv; + + This works because for every event source (EOF on file handle), + there is one call to "begin", so the condvar waits for all calls to + "end" before sending. + + The ping example mentioned above is slightly more complicated, as + the there are results to be passwd back, and the number of tasks + that are begung can potentially be zero: my $cv = AnyEvent->condvar; @@ -426,11 +650,11 @@ ensures that "send" is called even when "no" hosts are being pinged (the loop doesn't execute once). - This is the general pattern when you "fan out" into multiple - subrequests: use an outer "begin"/"end" pair to set the callback and - ensure "end" is called at least once, and then, for each subrequest - you start, call "begin" and for eahc subrequest you finish, call - "end". + This is the general pattern when you "fan out" into multiple (but + potentially none) subrequests: use an outer "begin"/"end" pair to + set the callback and ensure "end" is called at least once, and then, + for each subrequest you start, call "begin" and for each subrequest + you finish, call "end". METHODS FOR CONSUMERS These methods should only be used by the consuming side, i.e. the code @@ -455,7 +679,7 @@ the caller decide whether the call will block or not (for example, by coupling condition variables with some kind of request results and supporting callbacks so the caller knows that getting the result - will not block, while still suppporting blocking waits if the caller + will not block, while still supporting blocking waits if the caller so desires). Another reason *never* to "->recv" in a module is that you cannot @@ -478,13 +702,14 @@ Returns true when the condition is "true", i.e. whether "send" or "croak" have been called. - $cb = $cv->cb ([new callback]) + $cb = $cv->cb ($cb->($cv)) This is a mutator function that returns the callback set and optionally replaces it before doing so. The callback will be called when the condition becomes "true", i.e. - when "send" or "croak" are called. Calling "recv" inside the - callback or at any later time is guaranteed not to block. + when "send" or "croak" are called, with the only argument being the + condition variable itself. Calling "recv" inside the callback or at + any later time is guaranteed not to block. GLOBAL VARIABLES AND FUNCTIONS $AnyEvent::MODEL @@ -505,6 +730,10 @@ AnyEvent::Impl::EventLib based on Event::Lib, leaks memory and worse. AnyEvent::Impl::POE based on POE, not generic enough for full support. + # warning, support for IO::Async is only partial, as it is too broken + # and limited toe ven support the AnyEvent API. See AnyEvent::Impl::Async. + AnyEvent::Impl::IOAsync based on IO::Async, cannot be autoprobed (see its docs). + There is no support for WxWidgets, as WxWidgets has no support for watching file handles. However, you can use WxWidgets through the POE Adaptor, as POE has a Wx backend that simply polls 20 times per @@ -568,8 +797,8 @@ AnyEvent decide which implementation to chose if some module relies on it. - If the main program relies on a specific event model. For example, in - Gtk2 programs you have to rely on the Glib module. You should load the + If the main program relies on a specific event model - for example, in + Gtk2 programs you have to rely on the Glib module - you should load the event module before loading AnyEvent or any module that uses it: generally speaking, you should load it as early as possible. The reason is that modules might create watchers when they are loaded, and AnyEvent @@ -577,9 +806,25 @@ and it might chose the wrong one unless you load the correct one yourself. - You can chose to use a rather inefficient pure-perl implementation by - loading the "AnyEvent::Impl::Perl" module, which gives you similar - behaviour everywhere, but letting AnyEvent chose is generally better. + You can chose to use a pure-perl implementation by loading the + "AnyEvent::Impl::Perl" module, which gives you similar behaviour + everywhere, but letting AnyEvent chose the model is generally better. + + MAINLOOP EMULATION + Sometimes (often for short test scripts, or even standalone programs who + only want to use AnyEvent), you do not want to run a specific event + loop. + + In that case, you can use a condition variable like this: + + AnyEvent->condvar->recv; + + This has the effect of entering the event loop and looping forever. + + Note that usually your program has some exit condition, in which case it + is better to use the "traditional" approach of storing a condition + variable somewhere, waiting for it, and sending it when the program + should exit cleanly. OTHER MODULES The following is a non-exhaustive list of additional modules that use @@ -592,22 +837,53 @@ blocking functions such as "inet_aton" by event-/callback-based versions. + AnyEvent::Socket + Provides various utility functions for (internet protocol) sockets, + addresses and name resolution. Also functions to create non-blocking + tcp connections or tcp servers, with IPv6 and SRV record support and + more. + AnyEvent::Handle - Provide read and write buffers and manages watchers for reads and - writes. + Provide read and write buffers, manages watchers for reads and + writes, supports raw and formatted I/O, I/O queued and fully + transparent and non-blocking SSL/TLS. + + AnyEvent::DNS + Provides rich asynchronous DNS resolver capabilities. + + AnyEvent::HTTP + A simple-to-use HTTP library that is capable of making a lot of + concurrent HTTP requests. AnyEvent::HTTPD Provides a simple web application server framework. - AnyEvent::DNS - Provides asynchronous DNS resolver capabilities, beyond what - AnyEvent::Util offers. - AnyEvent::FastPing The fastest ping in the west. - Net::IRC3 - AnyEvent based IRC client module family. + AnyEvent::DBI + Executes DBI requests asynchronously in a proxy process. + + AnyEvent::AIO + Truly asynchronous I/O, should be in the toolbox of every event + programmer. AnyEvent::AIO transparently fuses IO::AIO and AnyEvent + together. + + AnyEvent::BDB + Truly asynchronous Berkeley DB access. AnyEvent::BDB transparently + fuses BDB and AnyEvent together. + + AnyEvent::GPSD + A non-blocking interface to gpsd, a daemon delivering GPS + information. + + AnyEvent::IGS + A non-blocking interface to the Internet Go Server protocol (used by + App::IGS). + + AnyEvent::IRC + AnyEvent based IRC client module family (replacing the older + Net::IRC3). Net::XMPP2 AnyEvent based XMPP (Jabber protocol) module family. @@ -622,19 +898,109 @@ Coro Has special support for AnyEvent via Coro::AnyEvent. - AnyEvent::AIO, IO::AIO - Truly asynchronous I/O, should be in the toolbox of every event - programmer. AnyEvent::AIO transparently fuses IO::AIO and AnyEvent - together. - - AnyEvent::BDB, BDB - Truly asynchronous Berkeley DB access. AnyEvent::AIO transparently - fuses IO::AIO and AnyEvent together. - IO::Lambda The lambda approach to I/O - don't ask, look there. Can use AnyEvent. +ERROR AND EXCEPTION HANDLING + In general, AnyEvent does not do any error handling - it relies on the + caller to do that if required. The AnyEvent::Strict module (see also the + "PERL_ANYEVENT_STRICT" environment variable, below) provides strict + checking of all AnyEvent methods, however, which is highly useful during + development. + + As for exception handling (i.e. runtime errors and exceptions thrown + while executing a callback), this is not only highly event-loop + specific, but also not in any way wrapped by this module, as this is the + job of the main program. + + The pure perl event loop simply re-throws the exception (usually within + "condvar->recv"), the Event and EV modules call "$Event/EV::DIED->()", + Glib uses "install_exception_handler" and so on. + +ENVIRONMENT VARIABLES + The following environment variables are used by this module or its + submodules. + + Note that AnyEvent will remove *all* environment variables starting with + "PERL_ANYEVENT_" from %ENV when it is loaded while taint mode is + enabled. + + "PERL_ANYEVENT_VERBOSE" + By default, AnyEvent will be completely silent except in fatal + conditions. You can set this environment variable to make AnyEvent + more talkative. + + When set to 1 or higher, causes AnyEvent to warn about unexpected + conditions, such as not being able to load the event model specified + by "PERL_ANYEVENT_MODEL". + + When set to 2 or higher, cause AnyEvent to report to STDERR which + event model it chooses. + + "PERL_ANYEVENT_STRICT" + AnyEvent does not do much argument checking by default, as thorough + argument checking is very costly. Setting this variable to a true + value will cause AnyEvent to load "AnyEvent::Strict" and then to + thoroughly check the arguments passed to most method calls. If it + finds any problems, it will croak. + + In other words, enables "strict" mode. + + Unlike "use strict", it is definitely recommended to keep it off in + production. Keeping "PERL_ANYEVENT_STRICT=1" in your environment + while developing programs can be very useful, however. + + "PERL_ANYEVENT_MODEL" + This can be used to specify the event model to be used by AnyEvent, + before auto detection and -probing kicks in. It must be a string + consisting entirely of ASCII letters. The string "AnyEvent::Impl::" + gets prepended and the resulting module name is loaded and if the + load was successful, used as event model. If it fails to load + AnyEvent will proceed with auto detection and -probing. + + This functionality might change in future versions. + + For example, to force the pure perl model (AnyEvent::Impl::Perl) you + could start your program like this: + + PERL_ANYEVENT_MODEL=Perl perl ... + + "PERL_ANYEVENT_PROTOCOLS" + Used by both AnyEvent::DNS and AnyEvent::Socket to determine + preferences for IPv4 or IPv6. The default is unspecified (and might + change, or be the result of auto probing). + + Must be set to a comma-separated list of protocols or address + families, current supported: "ipv4" and "ipv6". Only protocols + mentioned will be used, and preference will be given to protocols + mentioned earlier in the list. + + This variable can effectively be used for denial-of-service attacks + against local programs (e.g. when setuid), although the impact is + likely small, as the program has to handle conenction and other + failures anyways. + + Examples: "PERL_ANYEVENT_PROTOCOLS=ipv4,ipv6" - prefer IPv4 over + IPv6, but support both and try to use both. + "PERL_ANYEVENT_PROTOCOLS=ipv4" - only support IPv4, never try to + resolve or contact IPv6 addresses. + "PERL_ANYEVENT_PROTOCOLS=ipv6,ipv4" support either IPv4 or IPv6, but + prefer IPv6 over IPv4. + + "PERL_ANYEVENT_EDNS0" + Used by AnyEvent::DNS to decide whether to use the EDNS0 extension + for DNS. This extension is generally useful to reduce DNS traffic, + but some (broken) firewalls drop such DNS packets, which is why it + is off by default. + + Setting this variable to 1 will cause AnyEvent::DNS to announce + EDNS0 in its DNS requests. + + "PERL_ANYEVENT_MAX_FORKS" + The maximum number of child processes that + "AnyEvent::Util::fork_call" will create in parallel. + SUPPLYING YOUR OWN EVENT MODEL INTERFACE This is an advanced topic that you do not normally need to use AnyEvent in a module. This section is only of use to event loop authors who want @@ -678,36 +1044,6 @@ "die". This still works with most modules/usages, and blocking calls must not be done in an interactive application, so it makes sense. -ENVIRONMENT VARIABLES - The following environment variables are used by this module: - - "PERL_ANYEVENT_VERBOSE" - By default, AnyEvent will be completely silent except in fatal - conditions. You can set this environment variable to make AnyEvent - more talkative. - - When set to 1 or higher, causes AnyEvent to warn about unexpected - conditions, such as not being able to load the event model specified - by "PERL_ANYEVENT_MODEL". - - When set to 2 or higher, cause AnyEvent to report to STDERR which - event model it chooses. - - "PERL_ANYEVENT_MODEL" - This can be used to specify the event model to be used by AnyEvent, - before autodetection and -probing kicks in. It must be a string - consisting entirely of ASCII letters. The string "AnyEvent::Impl::" - gets prepended and the resulting module name is loaded and if the - load was successful, used as event model. If it fails to load - AnyEvent will proceed with autodetection and -probing. - - This functionality might change in future versions. - - For example, to force the pure perl model (AnyEvent::Impl::Perl) you - could start your program like this: - - PERL_ANYEVENT_MODEL=Perl perl ... - EXAMPLE PROGRAM The following program uses an I/O watcher to read data from STDIN, a timer to display a message once per second, and a condition variable to @@ -724,7 +1060,7 @@ warn "io event <$_[0]>\n"; # will always output chomp (my $input = ); # read a line warn "read: $input\n"; # output what has been read - $cv->broadcast if $input =~ /^q/i; # quit program if /^q/i + $cv->send if $input =~ /^q/i; # quit program if /^q/i }, ); @@ -739,7 +1075,7 @@ new_timer; # create first timer - $cv->wait; # wait until user enters /^q/i + $cv->recv; # wait until user enters /^q/i REAL-WORLD EXAMPLE Consider the Net::FCP module. It features (among others) the following @@ -798,13 +1134,13 @@ $txn->{w} = AnyEvent->io (fh => $txn->{fh}, poll => 'r', cb => sub { $txn->fh_ready_r }); Again, "fh_ready_r" waits till all data has arrived, and then stores the - result and signals any possible waiters that the request ahs finished: + result and signals any possible waiters that the request has finished: sysread $txn->{fh}, $txn->{buf}, length $txn->{$buf}; if (end-of-file or data complete) { $txn->{result} = $txn->{buf}; - $txn->{finished}->broadcast; + $txn->{finished}->send; $txb->{cb}->($txn) of $txn->{cb}; # also call callback } @@ -812,11 +1148,11 @@ request was already finished, it doesn't wait, of course, and returns the data: - $txn->{finished}->wait; + $txn->{finished}->recv; return $txn->{result}; The actual code goes further and collects all errors ("die"s, - exceptions) that occured during request processing. The "result" method + exceptions) that occurred during request processing. The "result" method detects whether an exception as thrown (it is stored inside the $txn object) and just throws the exception, which means connection errors and other problems get reported tot he code that tries to use the result, @@ -857,10 +1193,10 @@ $fcp->txn_client_get ($url)->cb (sub { ... - $quit->broadcast; + $quit->send; }); - $quit->wait; + $quit->recv; BENCHMARKS To give you an idea of the performance and overheads that AnyEvent adds @@ -869,7 +1205,7 @@ BENCHMARKING ANYEVENT OVERHEAD Here is a benchmark of various supported event models used natively and - through anyevent. The benchmark creates a lot of timers (with a zero + through AnyEvent. The benchmark creates a lot of timers (with a zero timeout) and I/O watchers (watching STDOUT, a pty, to become writable, which it is), lets them fire exactly once and destroys them again. @@ -895,24 +1231,26 @@ *invoke* is the time, in microseconds, used to invoke a simple callback. The callback simply counts down a Perl variable and after it was invoked - "watcher" times, it would "->broadcast" a condvar once to signal the end - of this phase. + "watcher" times, it would "->send" a condvar once to signal the end of + this phase. *destroy* is the time, in microseconds, that it takes to destroy a single watcher. Results name watchers bytes create invoke destroy comment - EV/EV 400000 244 0.56 0.46 0.31 EV native interface - EV/Any 100000 244 2.50 0.46 0.29 EV + AnyEvent watchers - CoroEV/Any 100000 244 2.49 0.44 0.29 coroutines + Coro::Signal - Perl/Any 100000 513 4.92 0.87 1.12 pure perl implementation - Event/Event 16000 516 31.88 31.30 0.85 Event native interface - Event/Any 16000 590 35.75 31.42 1.08 Event + AnyEvent watchers - Glib/Any 16000 1357 98.22 12.41 54.00 quadratic behaviour - Tk/Any 2000 1860 26.97 67.98 14.00 SEGV with >> 2000 watchers - POE/Event 2000 6644 108.64 736.02 14.73 via POE::Loop::Event - POE/Select 2000 6343 94.13 809.12 565.96 via POE::Loop::Select + EV/EV 400000 224 0.47 0.35 0.27 EV native interface + EV/Any 100000 224 2.88 0.34 0.27 EV + AnyEvent watchers + CoroEV/Any 100000 224 2.85 0.35 0.28 coroutines + Coro::Signal + Perl/Any 100000 452 4.13 0.73 0.95 pure perl implementation + Event/Event 16000 517 32.20 31.80 0.81 Event native interface + Event/Any 16000 590 35.85 31.55 1.06 Event + AnyEvent watchers + IOAsync/Any 16000 989 38.10 32.77 11.13 via IO::Async::Loop::IO_Poll + IOAsync/Any 16000 990 37.59 29.50 10.61 via IO::Async::Loop::Epoll + Glib/Any 16000 1357 102.33 12.31 51.00 quadratic behaviour + Tk/Any 2000 1860 27.20 66.31 14.00 SEGV with >> 2000 watchers + POE/Event 2000 6328 109.99 751.67 14.02 via POE::Loop::Event + POE/Select 2000 6027 94.54 809.13 579.80 via POE::Loop::Select Discussion The benchmark does *not* measure scalability of the event loop very @@ -948,6 +1286,9 @@ The "Event" module has a relatively high setup and callback invocation cost, but overall scores in on the third place. + "IO::Async" performs admirably well, about on par with "Event", even + when using its pure perl backend. + "Glib"'s memory usage is quite a bit higher, but it features a faster callback invocation and overall ends up in the same class as "Event". However, Glib scales extremely badly, doubling the number of watchers @@ -994,8 +1335,8 @@ reasonable memory usage. BENCHMARKING THE LARGE SERVER CASE - This benchmark atcually benchmarks the event loop itself. It works by - creating a number of "servers": each server consists of a socketpair, a + This benchmark actually benchmarks the event loop itself. It works by + creating a number of "servers": each server consists of a socket pair, a timeout watcher that gets reset on activity (but never fires), and an I/O watcher waiting for input on one side of the socket. Each time the socket watcher reads a byte it will write that byte to a random other @@ -1003,11 +1344,11 @@ The effect is that there will be a lot of I/O watchers, only part of which are active at any one point (so there is a constant number of - active fds for each loop iterstaion, but which fds these are is random). + active fds for each loop iteration, but which fds these are is random). The timeout is reset each time something is read because that reflects how most timeouts work (and puts extra pressure on the event loops). - In this benchmark, we use 10000 socketpairs (20000 sockets), of which + In this benchmark, we use 10000 socket pairs (20000 sockets), of which 100 (1%) are active. This mirrors the activity of large servers with many connections, most of which are idle at any one point in time. @@ -1018,7 +1359,7 @@ *sockets* is the number of sockets, and twice the number of "servers" (as each server has a read and write socket end). - *create* is the time it takes to create a socketpair (which is + *create* is the time it takes to create a socket pair (which is nontrivial) and two watchers: an I/O watcher and a timeout watcher. *request*, the most important value, is the time it takes to handle a @@ -1027,12 +1368,14 @@ and creating a new one that moves the timeout into the future. Results - name sockets create request - EV 20000 69.01 11.16 - Perl 20000 73.32 35.87 - Event 20000 212.62 257.32 - Glib 20000 651.16 1896.30 - POE 20000 349.67 12317.24 uses POE::Loop::Event + name sockets create request + EV 20000 69.01 11.16 + Perl 20000 73.32 35.87 + IOAsync 20000 157.00 98.14 epoll + IOAsync 20000 159.31 616.06 poll + Event 20000 212.62 257.32 + Glib 20000 651.16 1896.30 + POE 20000 349.67 12317.24 uses POE::Loop::Event Discussion This benchmark *does* measure scalability and overall performance of the @@ -1044,6 +1387,9 @@ Perl surprisingly comes second. It is much faster than the C-based event loops Event and Glib. + IO::Async performs very well when using its epoll backend, and still + quite good compared to Glib when using its pure perl backend. + Event suffers from high setup time as well (look at its code and you will understand why). Callback invocation also has a high overhead compared to the "$_->() for .."-style loop that the Perl event loop @@ -1092,7 +1438,7 @@ EV is again fastest. - Perl again comes second. It is noticably faster than the C-based event + Perl again comes second. It is noticeably faster than the C-based event loops Event and Glib, although the difference is too small to really matter. @@ -1103,6 +1449,90 @@ * C-based event loops perform very well with small number of watchers, as the management overhead dominates. + THE IO::Lambda BENCHMARK + Recently I was told about the benchmark in the IO::Lambda manpage, which + could be misinterpreted to make AnyEvent look bad. In fact, the + benchmark simply compares IO::Lambda with POE, and IO::Lambda looks + better (which shouldn't come as a surprise to anybody). As such, the + benchmark is fine, and mostly shows that the AnyEvent backend from + IO::Lambda isn't very optimal. But how would AnyEvent compare when used + without the extra baggage? To explore this, I wrote the equivalent + benchmark for AnyEvent. + + The benchmark itself creates an echo-server, and then, for 500 times, + connects to the echo server, sends a line, waits for the reply, and then + creates the next connection. This is a rather bad benchmark, as it + doesn't test the efficiency of the framework or much non-blocking I/O, + but it is a benchmark nevertheless. + + name runtime + Lambda/select 0.330 sec + + optimized 0.122 sec + Lambda/AnyEvent 0.327 sec + + optimized 0.138 sec + Raw sockets/select 0.077 sec + POE/select, components 0.662 sec + POE/select, raw sockets 0.226 sec + POE/select, optimized 0.404 sec + + AnyEvent/select/nb 0.085 sec + AnyEvent/EV/nb 0.068 sec + +state machine 0.134 sec + + The benchmark is also a bit unfair (my fault): the IO::Lambda/POE + benchmarks actually make blocking connects and use 100% blocking I/O, + defeating the purpose of an event-based solution. All of the newly + written AnyEvent benchmarks use 100% non-blocking connects (using + AnyEvent::Socket::tcp_connect and the asynchronous pure perl DNS + resolver), so AnyEvent is at a disadvantage here, as non-blocking + connects generally require a lot more bookkeeping and event handling + than blocking connects (which involve a single syscall only). + + The last AnyEvent benchmark additionally uses AnyEvent::Handle, which + offers similar expressive power as POE and IO::Lambda, using + conventional Perl syntax. This means that both the echo server and the + client are 100% non-blocking, further placing it at a disadvantage. + + As you can see, the AnyEvent + EV combination even beats the + hand-optimised "raw sockets benchmark", while AnyEvent + its pure perl + backend easily beats IO::Lambda and POE. + + And even the 100% non-blocking version written using the high-level (and + slow :) AnyEvent::Handle abstraction beats both POE and IO::Lambda by a + large margin, even though it does all of DNS, tcp-connect and socket I/O + in a non-blocking way. + + The two AnyEvent benchmarks programs can be found as eg/ae0.pl and + eg/ae2.pl in the AnyEvent distribution, the remaining benchmarks are + part of the IO::lambda distribution and were used without any changes. + +SIGNALS + AnyEvent currently installs handlers for these signals: + + SIGCHLD + A handler for "SIGCHLD" is installed by AnyEvent's child watcher + emulation for event loops that do not support them natively. Also, + some event loops install a similar handler. + + If, when AnyEvent is loaded, SIGCHLD is set to IGNORE, then AnyEvent + will reset it to default, to avoid losing child exit statuses. + + SIGPIPE + A no-op handler is installed for "SIGPIPE" when $SIG{PIPE} is + "undef" when AnyEvent gets loaded. + + The rationale for this is that AnyEvent users usually do not really + depend on SIGPIPE delivery (which is purely an optimisation for + shell use, or badly-written programs), but "SIGPIPE" can cause + spurious and rare program exits as a lot of people do not expect + "SIGPIPE" when writing to some random socket. + + The rationale for installing a no-op handler as opposed to ignoring + it is that this way, the handler will be restored to defaults on + exec. + + Feel free to install your own handler, or reset it to defaults. + FORK Most event libraries are not fork-safe. The ones who are usually are because they rely on inefficient but fork-safe "select" or "poll" calls. @@ -1122,15 +1552,29 @@ You can make AnyEvent completely ignore this variable by deleting it before the first watcher gets created, e.g. with a "BEGIN" block: - BEGIN { delete $ENV{PERL_ANYEVENT_MODEL} } - - use AnyEvent; + BEGIN { delete $ENV{PERL_ANYEVENT_MODEL} } + + use AnyEvent; Similar considerations apply to $ENV{PERL_ANYEVENT_VERBOSE}, as that can be used to probe what backend is used and gain other information (which - is probably even less useful to an attacker than PERL_ANYEVENT_MODEL). + is probably even less useful to an attacker than PERL_ANYEVENT_MODEL), + and $ENV{PERL_ANYEVENT_STRICT}. + + Note that AnyEvent will remove *all* environment variables starting with + "PERL_ANYEVENT_" from %ENV when it is loaded while taint mode is + enabled. + +BUGS + Perl 5.8 has numerous memleaks that sometimes hit this module and are + hard to work around. If you suffer from memleaks, first upgrade to Perl + 5.10 and check wether the leaks still show up. (Perl 5.10.0 has other + annoying memleaks, such as leaking on "map" and "grep" but it is usually + not as pronounced). SEE ALSO + Utility functions: AnyEvent::Util. + Event modules: EV, EV::Glib, Glib::EV, Event, Glib::Event, Glib, Tk, Event::Lib, Qt, POE. @@ -1138,11 +1582,16 @@ AnyEvent::Impl::Glib, AnyEvent::Impl::Tk, AnyEvent::Impl::Perl, AnyEvent::Impl::EventLib, AnyEvent::Impl::Qt, AnyEvent::Impl::POE. + Non-blocking file handles, sockets, TCP clients and servers: + AnyEvent::Handle, AnyEvent::Socket. + + Asynchronous DNS: AnyEvent::DNS. + Coroutine support: Coro, Coro::AnyEvent, Coro::EV, Coro::Event, - Nontrivial usage examples: Net::FCP, Net::XMPP2. + Nontrivial usage examples: Net::FCP, Net::XMPP2, AnyEvent::DNS. AUTHOR - Marc Lehmann - http://home.schmorp.de/ + Marc Lehmann + http://home.schmorp.de/