ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/Coro/README
Revision: 1.18
Committed: Thu Nov 20 09:37:21 2008 UTC (15 years, 6 months ago) by root
Branch: MAIN
CVS Tags: rel-5_0
Changes since 1.17: +134 -28 lines
Log Message:
5.0

File Contents

# User Rev Content
1 root 1.1 NAME
2     Coro - coroutine process abstraction
3    
4     SYNOPSIS
5 root 1.14 use Coro;
6    
7     async {
8     # some asynchronous thread of execution
9     print "2\n";
10     cede; # yield back to main
11     print "4\n";
12     };
13     print "1\n";
14     cede; # yield to coroutine
15     print "3\n";
16     cede; # and again
17    
18     # use locking
19 root 1.15 use Coro::Semaphore;
20 root 1.14 my $lock = new Coro::Semaphore;
21     my $locked;
22    
23     $lock->down;
24     $locked = 1;
25     $lock->up;
26 root 1.1
27     DESCRIPTION
28     This module collection manages coroutines. Coroutines are similar to
29 root 1.14 threads but don't (in general) run in parallel at the same time even on
30     SMP machines. The specific flavor of coroutine used in this module also
31     guarantees you that it will not switch between coroutines unless
32     necessary, at easily-identified points in your program, so locking and
33     parallel access are rarely an issue, making coroutine programming much
34     safer and easier than threads programming.
35    
36     Unlike a normal perl program, however, coroutines allow you to have
37     multiple running interpreters that share data, which is especially
38     useful to code pseudo-parallel processes and for event-based
39     programming, such as multiple HTTP-GET requests running concurrently.
40     See Coro::AnyEvent to learn more.
41    
42     Coroutines are also useful because Perl has no support for threads (the
43     so called "threads" that perl offers are nothing more than the (bad)
44     process emulation coming from the Windows platform: On standard
45     operating systems they serve no purpose whatsoever, except by making
46     your programs slow and making them use a lot of memory. Best disable
47     them when building perl, or aks your software vendor/distributor to do
48     it for you).
49 root 1.1
50     In this module, coroutines are defined as "callchain + lexical variables
51 root 1.5 + @_ + $_ + $@ + $/ + C stack), that is, a coroutine has its own
52     callchain, its own set of lexicals and its own set of perls most
53 root 1.12 important global variables (see Coro::State for more configuration).
54 root 1.1
55 root 1.14 $Coro::main
56     This variable stores the coroutine object that represents the main
57     program. While you cna "ready" it and do most other things you can
58     do to coroutines, it is mainly useful to compare again
59 root 1.15 $Coro::current, to see whether you are running in the main program
60     or not.
61 root 1.14
62     $Coro::current
63     The coroutine object representing the current coroutine (the last
64     coroutine that the Coro scheduler switched to). The initial value is
65 root 1.18 $Coro::main (of course).
66 root 1.14
67     This variable is strictly *read-only*. You can take copies of the
68     value stored in it and use it as any other coroutine object, but you
69     must not otherwise modify the variable itself.
70    
71     $Coro::idle
72     This variable is mainly useful to integrate Coro into event loops.
73     It is usually better to rely on Coro::AnyEvent or L"Coro::EV", as
74     this is pretty low-level functionality.
75    
76     This variable stores a callback that is called whenever the
77     scheduler finds no ready coroutines to run. The default
78     implementation prints "FATAL: deadlock detected" and exits, because
79     the program has no other way to continue.
80    
81     This hook is overwritten by modules such as "Coro::Timer" and
82     "Coro::AnyEvent" to wait on an external event that hopefully wake up
83     a coroutine so the scheduler can run it.
84 root 1.1
85 root 1.14 Note that the callback *must not*, under any circumstances, block
86     the current coroutine. Normally, this is achieved by having an "idle
87     coroutine" that calls the event loop and then blocks again, and then
88     readying that coroutine in the idle handler.
89 root 1.4
90 root 1.14 See Coro::Event or Coro::AnyEvent for examples of using this
91     technique.
92 root 1.4
93     Please note that if your callback recursively invokes perl (e.g. for
94 root 1.12 event handlers), then it must be prepared to be called recursively
95     itself.
96 root 1.1
97 root 1.14 SIMPLE COROUTINE CREATION
98     async { ... } [@args...]
99     Create a new coroutine and return it's coroutine object (usually
100     unused). The coroutine will be put into the ready queue, so it will
101     start running automatically on the next scheduler run.
102 root 1.1
103 root 1.14 The first argument is a codeblock/closure that should be executed in
104     the coroutine. When it returns argument returns the coroutine is
105 root 1.1 automatically terminated.
106    
107 root 1.14 The remaining arguments are passed as arguments to the closure.
108    
109 root 1.10 See the "Coro::State::new" constructor for info about the coroutine
110 root 1.14 environment in which coroutines are executed.
111 root 1.10
112 root 1.7 Calling "exit" in a coroutine will do the same as calling exit
113     outside the coroutine. Likewise, when the coroutine dies, the
114     program will exit, just as it would in the main program.
115 root 1.3
116 root 1.14 If you do not want that, you can provide a default "die" handler, or
117     simply avoid dieing (by use of "eval").
118    
119     Example: Create a new coroutine that just prints its arguments.
120    
121 root 1.1 async {
122     print "@_\n";
123     } 1,2,3,4;
124    
125 root 1.6 async_pool { ... } [@args...]
126     Similar to "async", but uses a coroutine pool, so you should not
127 root 1.14 call terminate or join on it (although you are allowed to), and you
128     get a coroutine that might have executed other code already (which
129     can be good or bad :).
130    
131 root 1.18 On the plus side, this function is about twice as fast as creating
132     (and destroying) a completely new coroutine, so if you need a lot of
133 root 1.14 generic coroutines in quick successsion, use "async_pool", not
134     "async".
135 root 1.6
136 root 1.14 The code block is executed in an "eval" context and a warning will
137 root 1.6 be issued in case of an exception instead of terminating the
138     program, as "async" does. As the coroutine is being reused, stuff
139     like "on_destroy" will not work in the expected way, unless you call
140 root 1.14 terminate or cancel, which somehow defeats the purpose of pooling
141     (but is fine in the exceptional case).
142 root 1.6
143 root 1.14 The priority will be reset to 0 after each run, tracing will be
144 root 1.10 disabled, the description will be reset and the default output
145 root 1.14 filehandle gets restored, so you can change all these. Otherwise the
146     coroutine will be re-used "as-is": most notably if you change other
147 root 1.16 per-coroutine global stuff such as $/ you *must needs* revert that
148     change, which is most simply done by using local as in: "local $/".
149    
150     The idle pool size is limited to 8 idle coroutines (this can be
151     adjusted by changing $Coro::POOL_SIZE), but there can be as many
152     non-idle coros as required.
153 root 1.6
154     If you are concerned about pooled coroutines growing a lot because a
155     single "async_pool" used a lot of stackspace you can e.g.
156     "async_pool { terminate }" once per second or so to slowly replenish
157 root 1.9 the pool. In addition to that, when the stacks used by a handler
158 root 1.14 grows larger than 16kb (adjustable via $Coro::POOL_RSS) it will also
159     be destroyed.
160    
161     STATIC METHODS
162     Static methods are actually functions that operate on the current
163     coroutine.
164 root 1.6
165 root 1.1 schedule
166 root 1.14 Calls the scheduler. The scheduler will find the next coroutine that
167     is to be run from the ready queue and switches to it. The next
168     coroutine to be run is simply the one with the highest priority that
169     is longest in its ready queue. If there is no coroutine ready, it
170     will clal the $Coro::idle hook.
171    
172     Please note that the current coroutine will *not* be put into the
173     ready queue, so calling this function usually means you will never
174     be called again unless something else (e.g. an event handler) calls
175     "->ready", thus waking you up.
176    
177     This makes "schedule" *the* generic method to use to block the
178     current coroutine and wait for events: first you remember the
179     current coroutine in a variable, then arrange for some callback of
180     yours to call "->ready" on that once some event happens, and last
181     you call "schedule" to put yourself to sleep. Note that a lot of
182 root 1.15 things can wake your coroutine up, so you need to check whether the
183 root 1.14 event indeed happened, e.g. by storing the status in a variable.
184 root 1.4
185 root 1.18 See HOW TO WAIT FOR A CALLBACK, below, for some ways to wait for
186     callbacks.
187 root 1.1
188     cede
189 root 1.4 "Cede" to other coroutines. This function puts the current coroutine
190 root 1.1 into the ready queue and calls "schedule", which has the effect of
191     giving up the current "timeslice" to other coroutines of the same or
192 root 1.14 higher priority. Once your coroutine gets its turn again it will
193     automatically be resumed.
194    
195     This function is often called "yield" in other languages.
196 root 1.1
197 root 1.6 Coro::cede_notself
198 root 1.14 Works like cede, but is not exported by default and will cede to
199     *any* coroutine, regardless of priority. This is useful sometimes to
200     ensure progress is made.
201 root 1.6
202 root 1.1 terminate [arg...]
203 root 1.4 Terminates the current coroutine with the given status values (see
204 root 1.1 cancel).
205    
206 root 1.10 killall
207     Kills/terminates/cancels all coroutines except the currently running
208     one. This is useful after a fork, either in the child or the parent,
209     as usually only one of them should inherit the running coroutines.
210    
211 root 1.14 Note that while this will try to free some of the main programs
212 root 1.15 resources, you cannot free all of them, so if a coroutine that is
213 root 1.14 not the main program calls this function, there will be some
214     one-time resource leak.
215 root 1.1
216 root 1.4 COROUTINE METHODS
217 root 1.14 These are the methods you can call on coroutine objects (or to create
218     them).
219 root 1.1
220     new Coro \&sub [, @args...]
221 root 1.14 Create a new coroutine and return it. When the sub returns, the
222 root 1.4 coroutine automatically terminates as if "terminate" with the
223     returned values were called. To make the coroutine run you must
224     first put it into the ready queue by calling the ready method.
225    
226 root 1.10 See "async" and "Coro::State::new" for additional info about the
227     coroutine environment.
228 root 1.4
229     $success = $coroutine->ready
230 root 1.14 Put the given coroutine into the end of its ready queue (there is
231     one queue for each priority) and return true. If the coroutine is
232     already in the ready queue, do nothing and return false.
233    
234     This ensures that the scheduler will resume this coroutine
235     automatically once all the coroutines of higher priority and all
236     coroutines of the same priority that were put into the ready queue
237     earlier have been resumed.
238 root 1.4
239     $is_ready = $coroutine->is_ready
240 root 1.15 Return whether the coroutine is currently the ready queue or not,
241 root 1.4
242     $coroutine->cancel (arg...)
243     Terminates the given coroutine and makes it return the given
244 root 1.6 arguments as status (default: the empty list). Never returns if the
245     coroutine is the current coroutine.
246 root 1.1
247 root 1.18 $coroutine->schedule_to
248     Puts the current coroutine to sleep (like "Coro::schedule"), but
249     instead of continuing with the next coro from the ready queue,
250     always switch to the given coroutine object (regardless of priority
251     etc.). The readyness state of that coroutine isn't changed.
252    
253     This is an advanced method for special cases - I'd love to hear
254     about any uses for this one.
255    
256     $coroutine->cede_to
257     Like "schedule_to", but puts the current coroutine into the ready
258     queue. This has the effect of temporarily switching to the given
259     coroutine, and continuing some time later.
260    
261     This is an advanced method for special cases - I'd love to hear
262     about any uses for this one.
263    
264 root 1.17 $coroutine->throw ([$scalar])
265     If $throw is specified and defined, it will be thrown as an
266 root 1.18 exception inside the coroutine at the next convenient point in time.
267 root 1.17 Otherwise clears the exception object.
268    
269 root 1.18 Coro will check for the exception each time a schedule-like-function
270     returns, i.e. after each "schedule", "cede",
271     "Coro::Semaphore->down", "Coro::Handle->readable" and so on. Most of
272     these functions detect this case and return early in case an
273     exception is pending.
274    
275 root 1.17 The exception object will be thrown "as is" with the specified
276     scalar in $@, i.e. if it is a string, no line number or newline will
277     be appended (unlike with "die").
278    
279     This can be used as a softer means than "cancel" to ask a coroutine
280     to end itself, although there is no guarantee that the exception
281     will lead to termination, and if the exception isn't caught it might
282     well end the whole program.
283    
284     You might also think of "throw" as being the moral equivalent of
285     "kill"ing a coroutine with a signal (in this case, a scalar).
286    
287 root 1.4 $coroutine->join
288 root 1.1 Wait until the coroutine terminates and return any values given to
289 root 1.10 the "terminate" or "cancel" functions. "join" can be called
290 root 1.14 concurrently from multiple coroutines, and all will be resumed and
291     given the status return once the $coroutine terminates.
292 root 1.1
293 root 1.6 $coroutine->on_destroy (\&cb)
294     Registers a callback that is called when this coroutine gets
295     destroyed, but before it is joined. The callback gets passed the
296 root 1.14 terminate arguments, if any, and *must not* die, under any
297     circumstances.
298 root 1.6
299 root 1.4 $oldprio = $coroutine->prio ($newprio)
300 root 1.1 Sets (or gets, if the argument is missing) the priority of the
301 root 1.4 coroutine. Higher priority coroutines get run before lower priority
302     coroutines. Priorities are small signed integers (currently -4 ..
303 root 1.1 +3), that you can refer to using PRIO_xxx constants (use the import
304     tag :prio to get then):
305    
306     PRIO_MAX > PRIO_HIGH > PRIO_NORMAL > PRIO_LOW > PRIO_IDLE > PRIO_MIN
307     3 > 1 > 0 > -1 > -3 > -4
308    
309     # set priority to HIGH
310     current->prio(PRIO_HIGH);
311    
312     The idle coroutine ($Coro::idle) always has a lower priority than
313     any existing coroutine.
314    
315 root 1.4 Changing the priority of the current coroutine will take effect
316     immediately, but changing the priority of coroutines in the ready
317 root 1.1 queue (but not running) will only take effect after the next
318 root 1.4 schedule (of that coroutine). This is a bug that will be fixed in
319     some future version.
320 root 1.1
321 root 1.4 $newprio = $coroutine->nice ($change)
322 root 1.1 Similar to "prio", but subtract the given value from the priority
323     (i.e. higher values mean lower priority, just as in unix).
324    
325 root 1.4 $olddesc = $coroutine->desc ($newdesc)
326 root 1.1 Sets (or gets in case the argument is missing) the description for
327 root 1.4 this coroutine. This is just a free-form string you can associate
328     with a coroutine.
329    
330 root 1.10 This method simply sets the "$coroutine->{desc}" member to the given
331     string. You can modify this member directly if you wish.
332    
333 root 1.5 GLOBAL FUNCTIONS
334     Coro::nready
335     Returns the number of coroutines that are currently in the ready
336 root 1.14 state, i.e. that can be switched to by calling "schedule" directory
337     or indirectly. The value 0 means that the only runnable coroutine is
338     the currently running one, so "cede" would have no effect, and
339     "schedule" would cause a deadlock unless there is an idle handler
340     that wakes up some coroutines.
341 root 1.5
342 root 1.6 my $guard = Coro::guard { ... }
343     This creates and returns a guard object. Nothing happens until the
344 root 1.7 object gets destroyed, in which case the codeblock given as argument
345 root 1.6 will be executed. This is useful to free locks or other resources in
346     case of a runtime error or when the coroutine gets canceled, as in
347     both cases the guard block will be executed. The guard object
348     supports only one method, "->cancel", which will keep the codeblock
349     from being executed.
350    
351     Example: set some flag and clear it again when the coroutine gets
352     canceled or the function returns:
353    
354     sub do_something {
355     my $guard = Coro::guard { $busy = 0 };
356     $busy = 1;
357    
358     # do something that requires $busy to be true
359     }
360    
361 root 1.4 unblock_sub { ... }
362     This utility function takes a BLOCK or code reference and "unblocks"
363 root 1.14 it, returning a new coderef. Unblocking means that calling the new
364     coderef will return immediately without blocking, returning nothing,
365     while the original code ref will be called (with parameters) from
366     within another coroutine.
367 root 1.4
368 root 1.8 The reason this function exists is that many event libraries (such
369 root 1.4 as the venerable Event module) are not coroutine-safe (a weaker form
370     of thread-safety). This means you must not block within event
371 root 1.14 callbacks, otherwise you might suffer from crashes or worse. The
372     only event library currently known that is safe to use without
373     "unblock_sub" is EV.
374 root 1.4
375     This function allows your callbacks to block by executing them in
376     another coroutine where it is safe to block. One example where
377     blocking is handy is when you use the Coro::AIO functions to save
378 root 1.14 results to disk, for example.
379 root 1.4
380     In short: simply use "unblock_sub { ... }" instead of "sub { ... }"
381     when creating event callbacks that want to block.
382 root 1.1
383 root 1.14 If your handler does not plan to block (e.g. simply sends a message
384     to another coroutine, or puts some other coroutine into the ready
385     queue), there is no reason to use "unblock_sub".
386    
387     Note that you also need to use "unblock_sub" for any other callbacks
388     that are indirectly executed by any C-based event loop. For example,
389     when you use a module that uses AnyEvent (and you use
390     Coro::AnyEvent) and it provides callbacks that are the result of
391     some event callback, then you must not block either, or use
392     "unblock_sub".
393    
394 root 1.18 $cb = Coro::rouse_cb
395     Create and return a "rouse callback". That's a code reference that,
396     when called, will save its arguments and notify the owner coroutine
397     of the callback.
398    
399     See the next function.
400    
401     @args = Coro::rouse_wait [$cb]
402     Wait for the specified rouse callback (or the last one tht was
403     created in this coroutine).
404    
405     As soon as the callback is invoked (or when the calback was invoked
406     before "rouse_wait"), it will return a copy of the arguments
407     originally passed to the rouse callback.
408    
409     See the section HOW TO WAIT FOR A CALLBACK for an actual usage
410     example.
411    
412     HOW TO WAIT FOR A CALLBACK
413     It is very common for a coroutine to wait for some callback to be
414     called. This occurs naturally when you use coroutines in an otherwise
415     event-based program, or when you use event-based libraries.
416    
417     These typically register a callback for some event, and call that
418     callback when the event occured. In a coroutine, however, you typically
419     want to just wait for the event, simplyifying things.
420    
421     For example "AnyEvent->child" registers a callback to be called when a
422     specific child has exited:
423    
424     my $child_watcher = AnyEvent->child (pid => $pid, cb => sub { ... });
425    
426     But from withina coroutine, you often just want to write this:
427    
428     my $status = wait_for_child $pid;
429    
430     Coro offers two functions specifically designed to make this easy,
431     "Coro::rouse_cb" and "Coro::rouse_wait".
432    
433     The first function, "rouse_cb", generates and returns a callback that,
434     when invoked, will save it's arguments and notify the coroutine that
435     created the callback.
436    
437     The second function, "rouse_wait", waits for the callback to be called
438     (by calling "schedule" to go to sleep) and returns the arguments
439     originally passed to the callback.
440    
441     Using these functions, it becomes easy to write the "wait_for_child"
442     function mentioned above:
443    
444     sub wait_for_child($) {
445     my ($pid) = @_;
446    
447     my $watcher = AnyEvent->child (pid => $pid, cb => Coro::rouse_cb);
448    
449     my ($rpid, $rstatus) = Coro::rouse_wait;
450     $rstatus
451     }
452    
453     In the case where "rouse_cb" and "rouse_wait" are not flexible enough,
454     you can roll your own, using "schedule":
455    
456     sub wait_for_child($) {
457     my ($pid) = @_;
458    
459     # store the current coroutine in $current,
460     # and provide result variables for the closure passed to ->child
461     my $current = $Coro::current;
462     my ($done, $rstatus);
463    
464     # pass a closure to ->child
465     my $watcher = AnyEvent->child (pid => $pid, cb => sub {
466     $rstatus = $_[1]; # remember rstatus
467     $done = 1; # mark $rstatus as valud
468     });
469    
470     # wait until the closure has been called
471     schedule while !$done;
472    
473     $rstatus
474     }
475    
476 root 1.1 BUGS/LIMITATIONS
477 root 1.18 fork with pthread backend
478     When Coro is compiled using the pthread backend (which isn't
479     recommended but required on many BSDs as their libcs are completely
480     broken), then coroutines will not survive a fork. There is no known
481     workaround except to fix your libc and use a saner backend.
482    
483     perl process emulation ("threads")
484     This module is not perl-pseudo-thread-safe. You should only ever use
485     this module from the same thread (this requirement might be removed
486     in the future to allow per-thread schedulers, but Coro::State does
487     not yet allow this). I recommend disabling thread support and using
488     processes, as having the windows process emulation enabled under
489     unix roughly halves perl performance, even when not used.
490    
491     coroutine switching not signal safe
492     You must not switch to another coroutine from within a signal
493     handler (only relevant with %SIG - most event libraries provide safe
494     signals).
495    
496     That means you *MUST NOT* call any function that might "block" the
497     current coroutine - "cede", "schedule" "Coro::Semaphore->down" or
498     anything that calls those. Everything else, including calling
499     "ready", works.
500 root 1.1
501     SEE ALSO
502 root 1.14 Event-Loop integration: Coro::AnyEvent, Coro::EV, Coro::Event.
503 root 1.12
504     Debugging: Coro::Debug.
505    
506     Support/Utility: Coro::Specific, Coro::Util.
507 root 1.2
508     Locking/IPC: Coro::Signal, Coro::Channel, Coro::Semaphore,
509     Coro::SemaphoreSet, Coro::RWLock.
510    
511 root 1.14 IO/Timers: Coro::Timer, Coro::Handle, Coro::Socket, Coro::AIO.
512    
513     Compatibility: Coro::LWP, Coro::BDB, Coro::Storable, Coro::Select.
514 root 1.12
515 root 1.14 XS API: Coro::MakeMaker.
516 root 1.2
517 root 1.14 Low level Configuration, Coroutine Environment: Coro::State.
518 root 1.1
519     AUTHOR
520     Marc Lehmann <schmorp@schmorp.de>
521     http://home.schmorp.de/
522