ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/Coro/Coro.pm
(Generate patch)

Comparing Coro/Coro.pm (file contents):
Revision 1.114 by root, Wed Jan 24 16:22:08 2007 UTC vs.
Revision 1.270 by root, Thu Oct 1 23:50:23 2009 UTC

1=head1 NAME 1=head1 NAME
2 2
3Coro - coroutine process abstraction 3Coro - the only real threads in perl
4 4
5=head1 SYNOPSIS 5=head1 SYNOPSIS
6 6
7 use Coro; 7 use Coro;
8 8
9 async { 9 async {
10 # some asynchronous thread of execution 10 # some asynchronous thread of execution
11 print "2\n";
12 cede; # yield back to main
13 print "4\n";
11 }; 14 };
12 15 print "1\n";
13 # alternatively create an async coroutine like this: 16 cede; # yield to coro
14 17 print "3\n";
15 sub some_func : Coro { 18 cede; # and again
16 # some more async code 19
17 } 20 # use locking
18 21 use Coro::Semaphore;
19 cede; 22 my $lock = new Coro::Semaphore;
23 my $locked;
24
25 $lock->down;
26 $locked = 1;
27 $lock->up;
20 28
21=head1 DESCRIPTION 29=head1 DESCRIPTION
22 30
23This module collection manages coroutines. Coroutines are similar 31For a tutorial-style introduction, please read the L<Coro::Intro>
24to threads but don't run in parallel at the same time even on SMP 32manpage. This manpage mainly contains reference information.
25machines. The specific flavor of coroutine use din this module also
26guarentees you that it will not switch between coroutines unless
27necessary, at easily-identified points in your program, so locking and
28parallel access are rarely an issue, making coroutine programming much
29safer than threads programming.
30 33
31(Perl, however, does not natively support real threads but instead does a 34This module collection manages continuations in general, most often in
32very slow and memory-intensive emulation of processes using threads. This 35the form of cooperative threads (also called coros, or simply "coro"
33is a performance win on Windows machines, and a loss everywhere else). 36in the documentation). They are similar to kernel threads but don't (in
37general) run in parallel at the same time even on SMP machines. The
38specific flavor of thread offered by this module also guarantees you that
39it will not switch between threads unless necessary, at easily-identified
40points in your program, so locking and parallel access are rarely an
41issue, making thread programming much safer and easier than using other
42thread models.
34 43
44Unlike the so-called "Perl threads" (which are not actually real threads
45but only the windows process emulation (see section of same name for more
46details) ported to unix, and as such act as processes), Coro provides
47a full shared address space, which makes communication between threads
48very easy. And Coro's threads are fast, too: disabling the Windows
49process emulation code in your perl and using Coro can easily result in
50a two to four times speed increase for your programs. A parallel matrix
51multiplication benchmark runs over 300 times faster on a single core than
52perl's pseudo-threads on a quad core using all four cores.
53
54Coro achieves that by supporting multiple running interpreters that share
55data, which is especially useful to code pseudo-parallel processes and
56for event-based programming, such as multiple HTTP-GET requests running
57concurrently. See L<Coro::AnyEvent> to learn more on how to integrate Coro
58into an event-based environment.
59
35In this module, coroutines are defined as "callchain + lexical variables + 60In this module, a thread is defined as "callchain + lexical variables +
36@_ + $_ + $@ + $/ + C stack), that is, a coroutine has its own callchain, 61some package variables + C stack), that is, a thread has its own callchain,
37its own set of lexicals and its own set of perls most important global 62its own set of lexicals and its own set of perls most important global
38variables. 63variables (see L<Coro::State> for more configuration and background info).
64
65See also the C<SEE ALSO> section at the end of this document - the Coro
66module family is quite large.
39 67
40=cut 68=cut
41 69
42package Coro; 70package Coro;
43 71
44use strict; 72use common::sense;
45no warnings "uninitialized"; 73
74use Carp ();
75
76use Guard ();
46 77
47use Coro::State; 78use Coro::State;
48 79
49use base qw(Coro::State Exporter); 80use base qw(Coro::State Exporter);
50 81
51our $idle; # idle handler 82our $idle; # idle handler
52our $main; # main coroutine 83our $main; # main coro
53our $current; # current coroutine 84our $current; # current coro
54 85
55our $VERSION = '3.5'; 86our $VERSION = 5.17;
56 87
57our @EXPORT = qw(async async_pool cede schedule terminate current unblock_sub); 88our @EXPORT = qw(async async_pool cede schedule terminate current unblock_sub);
58our %EXPORT_TAGS = ( 89our %EXPORT_TAGS = (
59 prio => [qw(PRIO_MAX PRIO_HIGH PRIO_NORMAL PRIO_LOW PRIO_IDLE PRIO_MIN)], 90 prio => [qw(PRIO_MAX PRIO_HIGH PRIO_NORMAL PRIO_LOW PRIO_IDLE PRIO_MIN)],
60); 91);
61our @EXPORT_OK = (@{$EXPORT_TAGS{prio}}, qw(nready)); 92our @EXPORT_OK = (@{$EXPORT_TAGS{prio}}, qw(nready));
62 93
63{ 94=head1 GLOBAL VARIABLES
64 my @async;
65 my $init;
66
67 # this way of handling attributes simply is NOT scalable ;()
68 sub import {
69 no strict 'refs';
70
71 Coro->export_to_level (1, @_);
72
73 my $old = *{(caller)[0]."::MODIFY_CODE_ATTRIBUTES"}{CODE};
74 *{(caller)[0]."::MODIFY_CODE_ATTRIBUTES"} = sub {
75 my ($package, $ref) = (shift, shift);
76 my @attrs;
77 for (@_) {
78 if ($_ eq "Coro") {
79 push @async, $ref;
80 unless ($init++) {
81 eval q{
82 sub INIT {
83 &async(pop @async) while @async;
84 }
85 };
86 }
87 } else {
88 push @attrs, $_;
89 }
90 }
91 return $old ? $old->($package, $ref, @attrs) : @attrs;
92 };
93 }
94
95}
96 95
97=over 4 96=over 4
98 97
99=item $main 98=item $Coro::main
100 99
101This coroutine represents the main program. 100This variable stores the Coro object that represents the main
101program. While you cna C<ready> it and do most other things you can do to
102coro, it is mainly useful to compare again C<$Coro::current>, to see
103whether you are running in the main program or not.
102 104
103=cut 105=cut
104 106
105$main = new Coro; 107# $main is now being initialised by Coro::State
106 108
107=item $current (or as function: current) 109=item $Coro::current
108 110
109The current coroutine (the last coroutine switched to). The initial value 111The Coro object representing the current coro (the last
112coro that the Coro scheduler switched to). The initial value is
110is C<$main> (of course). 113C<$Coro::main> (of course).
111 114
112This variable is B<strictly> I<read-only>. It is provided for performance 115This variable is B<strictly> I<read-only>. You can take copies of the
113reasons. If performance is not essentiel you are encouraged to use the 116value stored in it and use it as any other Coro object, but you must
114C<Coro::current> function instead. 117not otherwise modify the variable itself.
115 118
116=cut 119=cut
117 120
118# maybe some other module used Coro::Specific before...
119$main->{specific} = $current->{specific}
120 if $current;
121
122_set_current $main;
123
124sub current() { $current } 121sub current() { $current } # [DEPRECATED]
125 122
126=item $idle 123=item $Coro::idle
127 124
128A callback that is called whenever the scheduler finds no ready coroutines 125This variable is mainly useful to integrate Coro into event loops. It is
126usually better to rely on L<Coro::AnyEvent> or L<Coro::EV>, as this is
127pretty low-level functionality.
128
129This variable stores a Coro object that is put into the ready queue when
130there are no other ready threads (without invoking any ready hooks).
131
129to run. The default implementation prints "FATAL: deadlock detected" and 132The default implementation dies with "FATAL: deadlock detected.", followed
130exits, because the program has no other way to continue. 133by a thread listing, because the program has no other way to continue.
131 134
132This hook is overwritten by modules such as C<Coro::Timer> and 135This hook is overwritten by modules such as C<Coro::EV> and
133C<Coro::Event> to wait on an external event that hopefully wake up a 136C<Coro::AnyEvent> to wait on an external event that hopefully wake up a
134coroutine so the scheduler can run it. 137coro so the scheduler can run it.
135 138
136Please note that if your callback recursively invokes perl (e.g. for event 139See L<Coro::EV> or L<Coro::AnyEvent> for examples of using this technique.
137handlers), then it must be prepared to be called recursively.
138 140
139=cut 141=cut
140 142
141$idle = sub { 143$idle = new Coro sub {
142 require Carp; 144 require Coro::Debug;
143 Carp::croak ("FATAL: deadlock detected"); 145 die "FATAL: deadlock detected.\n"
146 . Coro::Debug::ps_listing ();
144}; 147};
145 148
146sub _cancel {
147 my ($self) = @_;
148
149 # free coroutine data and mark as destructed
150 $self->_destroy
151 or return;
152
153 # call all destruction callbacks
154 $_->(@{$self->{status}})
155 for @{(delete $self->{destroy_cb}) || []};
156}
157
158# this coroutine is necessary because a coroutine 149# this coro is necessary because a coro
159# cannot destroy itself. 150# cannot destroy itself.
160my @destroy; 151our @destroy;
161my $manager; 152our $manager;
162 153
163$manager = new Coro sub { 154$manager = new Coro sub {
164 while () { 155 while () {
165 (shift @destroy)->_cancel 156 Coro::State::cancel shift @destroy
166 while @destroy; 157 while @destroy;
167 158
168 &schedule; 159 &schedule;
169 } 160 }
170}; 161};
171 162$manager->{desc} = "[coro manager]";
172$manager->prio (PRIO_MAX); 163$manager->prio (PRIO_MAX);
173 164
174# static methods. not really.
175
176=back 165=back
177 166
178=head2 STATIC METHODS 167=head1 SIMPLE CORO CREATION
179
180Static methods are actually functions that operate on the current coroutine only.
181 168
182=over 4 169=over 4
183 170
184=item async { ... } [@args...] 171=item async { ... } [@args...]
185 172
186Create a new asynchronous coroutine and return it's coroutine object 173Create a new coro and return its Coro object (usually
187(usually unused). When the sub returns the new coroutine is automatically 174unused). The coro will be put into the ready queue, so
175it will start running automatically on the next scheduler run.
176
177The first argument is a codeblock/closure that should be executed in the
178coro. When it returns argument returns the coro is automatically
188terminated. 179terminated.
189 180
190Calling C<exit> in a coroutine will not work correctly, so do not do that. 181The remaining arguments are passed as arguments to the closure.
191 182
192When the coroutine dies, the program will exit, just as in the main 183See the C<Coro::State::new> constructor for info about the coro
193program. 184environment in which coro are executed.
194 185
186Calling C<exit> in a coro will do the same as calling exit outside
187the coro. Likewise, when the coro dies, the program will exit,
188just as it would in the main program.
189
190If you do not want that, you can provide a default C<die> handler, or
191simply avoid dieing (by use of C<eval>).
192
195 # create a new coroutine that just prints its arguments 193Example: Create a new coro that just prints its arguments.
194
196 async { 195 async {
197 print "@_\n"; 196 print "@_\n";
198 } 1,2,3,4; 197 } 1,2,3,4;
199 198
200=cut
201
202sub async(&@) {
203 my $coro = new Coro @_;
204 $coro->ready;
205 $coro
206}
207
208=item async_pool { ... } [@args...] 199=item async_pool { ... } [@args...]
209 200
210Similar to C<async>, but uses a coroutine pool, so you should not call 201Similar to C<async>, but uses a coro pool, so you should not call
211terminate or join (although you are allowed to), and you get a coroutine 202terminate or join on it (although you are allowed to), and you get a
212that might have executed other code already (which can be good or bad :). 203coro that might have executed other code already (which can be good
204or bad :).
213 205
206On the plus side, this function is about twice as fast as creating (and
207destroying) a completely new coro, so if you need a lot of generic
208coros in quick successsion, use C<async_pool>, not C<async>.
209
214Also, the block is executed in an C<eval> context and a warning will be 210The code block is executed in an C<eval> context and a warning will be
215issued in case of an exception instead of terminating the program, as 211issued in case of an exception instead of terminating the program, as
216C<async> does. As the coroutine is being reused, stuff like C<on_destroy> 212C<async> does. As the coro is being reused, stuff like C<on_destroy>
217will not work in the expected way, unless you call terminate or cancel, 213will not work in the expected way, unless you call terminate or cancel,
218which somehow defeats the purpose of pooling. 214which somehow defeats the purpose of pooling (but is fine in the
215exceptional case).
219 216
220The priority will be reset to C<0> after each job, otherwise the coroutine 217The priority will be reset to C<0> after each run, tracing will be
221will be re-used "as-is". 218disabled, the description will be reset and the default output filehandle
219gets restored, so you can change all these. Otherwise the coro will
220be re-used "as-is": most notably if you change other per-coro global
221stuff such as C<$/> you I<must needs> revert that change, which is most
222simply done by using local as in: C<< local $/ >>.
222 223
223The pool size is limited to 8 idle coroutines (this can be adjusted by 224The idle pool size is limited to C<8> idle coros (this can be
224changing $Coro::POOL_SIZE), and there can be as many non-idle coros as 225adjusted by changing $Coro::POOL_SIZE), but there can be as many non-idle
225required. 226coros as required.
226 227
227If you are concerned about pooled coroutines growing a lot because a 228If you are concerned about pooled coros growing a lot because a
228single C<async_pool> used a lot of stackspace you can e.g. C<async_pool { 229single C<async_pool> used a lot of stackspace you can e.g. C<async_pool
229terminate }> once per second or so to slowly replenish the pool. 230{ terminate }> once per second or so to slowly replenish the pool. In
231addition to that, when the stacks used by a handler grows larger than 32kb
232(adjustable via $Coro::POOL_RSS) it will also be destroyed.
230 233
231=cut 234=cut
232 235
233our $POOL_SIZE = 8; 236our $POOL_SIZE = 8;
237our $POOL_RSS = 32 * 1024;
234our @pool; 238our @async_pool;
235 239
236sub pool_handler { 240sub pool_handler {
237 while () { 241 while () {
238 eval { 242 eval {
239 my ($cb, @arg) = @{ delete $current->{_invoke} or return }; 243 &{&_pool_handler} while 1;
240 $cb->(@arg);
241 }; 244 };
245
242 warn $@ if $@; 246 warn $@ if $@;
243
244 last if @pool >= $POOL_SIZE;
245 push @pool, $current;
246
247 $current->prio (0);
248 schedule;
249 } 247 }
250} 248}
251 249
252sub async_pool(&@) { 250=back
253 # this is also inlined into the unlock_scheduler
254 my $coro = (pop @pool or new Coro \&pool_handler);
255 251
256 $coro->{_invoke} = [@_]; 252=head1 STATIC METHODS
257 $coro->ready;
258 253
259 $coro 254Static methods are actually functions that implicitly operate on the
255current coro.
256
257=over 4
258
259=item schedule
260
261Calls the scheduler. The scheduler will find the next coro that is
262to be run from the ready queue and switches to it. The next coro
263to be run is simply the one with the highest priority that is longest
264in its ready queue. If there is no coro ready, it will call the
265C<$Coro::idle> hook.
266
267Please note that the current coro will I<not> be put into the ready
268queue, so calling this function usually means you will never be called
269again unless something else (e.g. an event handler) calls C<< ->ready >>,
270thus waking you up.
271
272This makes C<schedule> I<the> generic method to use to block the current
273coro and wait for events: first you remember the current coro in
274a variable, then arrange for some callback of yours to call C<< ->ready
275>> on that once some event happens, and last you call C<schedule> to put
276yourself to sleep. Note that a lot of things can wake your coro up,
277so you need to check whether the event indeed happened, e.g. by storing the
278status in a variable.
279
280See B<HOW TO WAIT FOR A CALLBACK>, below, for some ways to wait for callbacks.
281
282=item cede
283
284"Cede" to other coros. This function puts the current coro into
285the ready queue and calls C<schedule>, which has the effect of giving
286up the current "timeslice" to other coros of the same or higher
287priority. Once your coro gets its turn again it will automatically be
288resumed.
289
290This function is often called C<yield> in other languages.
291
292=item Coro::cede_notself
293
294Works like cede, but is not exported by default and will cede to I<any>
295coro, regardless of priority. This is useful sometimes to ensure
296progress is made.
297
298=item terminate [arg...]
299
300Terminates the current coro with the given status values (see L<cancel>).
301
302=item Coro::on_enter BLOCK, Coro::on_leave BLOCK
303
304These function install enter and leave winders in the current scope. The
305enter block will be executed when on_enter is called and whenever the
306current coro is re-entered by the scheduler, while the leave block is
307executed whenever the current coro is blocked by the scheduler, and
308also when the containing scope is exited (by whatever means, be it exit,
309die, last etc.).
310
311I<Neither invoking the scheduler, nor exceptions, are allowed within those
312BLOCKs>. That means: do not even think about calling C<die> without an
313eval, and do not even think of entering the scheduler in any way.
314
315Since both BLOCKs are tied to the current scope, they will automatically
316be removed when the current scope exits.
317
318These functions implement the same concept as C<dynamic-wind> in scheme
319does, and are useful when you want to localise some resource to a specific
320coro.
321
322They slow down thread switching considerably for coros that use them
323(about 40% for a BLOCK with a single assignment, so thread switching is
324still reasonably fast if the handlers are fast).
325
326These functions are best understood by an example: The following function
327will change the current timezone to "Antarctica/South_Pole", which
328requires a call to C<tzset>, but by using C<on_enter> and C<on_leave>,
329which remember/change the current timezone and restore the previous
330value, respectively, the timezone is only changed for the coro that
331installed those handlers.
332
333 use POSIX qw(tzset);
334
335 async {
336 my $old_tz; # store outside TZ value here
337
338 Coro::on_enter {
339 $old_tz = $ENV{TZ}; # remember the old value
340
341 $ENV{TZ} = "Antarctica/South_Pole";
342 tzset; # enable new value
343 };
344
345 Coro::on_leave {
346 $ENV{TZ} = $old_tz;
347 tzset; # restore old value
348 };
349
350 # at this place, the timezone is Antarctica/South_Pole,
351 # without disturbing the TZ of any other coro.
352 };
353
354This can be used to localise about any resource (locale, uid, current
355working directory etc.) to a block, despite the existance of other
356coros.
357
358Another interesting example implements time-sliced multitasking using
359interval timers (this could obviously be optimised, but does the job):
360
361 # "timeslice" the given block
362 sub timeslice(&) {
363 use Time::HiRes ();
364
365 Coro::on_enter {
366 # on entering the thread, we set an VTALRM handler to cede
367 $SIG{VTALRM} = sub { cede };
368 # and then start the interval timer
369 Time::HiRes::setitimer &Time::HiRes::ITIMER_VIRTUAL, 0.01, 0.01;
370 };
371 Coro::on_leave {
372 # on leaving the thread, we stop the interval timer again
373 Time::HiRes::setitimer &Time::HiRes::ITIMER_VIRTUAL, 0, 0;
374 };
375
376 &{+shift};
377 }
378
379 # use like this:
380 timeslice {
381 # The following is an endless loop that would normally
382 # monopolise the process. Since it runs in a timesliced
383 # environment, it will regularly cede to other threads.
384 while () { }
385 };
386
387
388=item killall
389
390Kills/terminates/cancels all coros except the currently running one.
391
392Note that while this will try to free some of the main interpreter
393resources if the calling coro isn't the main coro, but one
394cannot free all of them, so if a coro that is not the main coro
395calls this function, there will be some one-time resource leak.
396
397=cut
398
399sub killall {
400 for (Coro::State::list) {
401 $_->cancel
402 if $_ != $current && UNIVERSAL::isa $_, "Coro";
403 }
260} 404}
261 405
262=item schedule 406=back
263 407
264Calls the scheduler. Please note that the current coroutine will not be put 408=head1 CORO OBJECT METHODS
265into the ready queue, so calling this function usually means you will
266never be called again unless something else (e.g. an event handler) calls
267ready.
268 409
269The canonical way to wait on external events is this: 410These are the methods you can call on coro objects (or to create
411them).
270 412
413=over 4
414
415=item new Coro \&sub [, @args...]
416
417Create a new coro and return it. When the sub returns, the coro
418automatically terminates as if C<terminate> with the returned values were
419called. To make the coro run you must first put it into the ready
420queue by calling the ready method.
421
422See C<async> and C<Coro::State::new> for additional info about the
423coro environment.
424
425=cut
426
427sub _coro_run {
428 terminate &{+shift};
429}
430
431=item $success = $coro->ready
432
433Put the given coro into the end of its ready queue (there is one
434queue for each priority) and return true. If the coro is already in
435the ready queue, do nothing and return false.
436
437This ensures that the scheduler will resume this coro automatically
438once all the coro of higher priority and all coro of the same
439priority that were put into the ready queue earlier have been resumed.
440
441=item $coro->suspend
442
443Suspends the specified coro. A suspended coro works just like any other
444coro, except that the scheduler will not select a suspended coro for
445execution.
446
447Suspending a coro can be useful when you want to keep the coro from
448running, but you don't want to destroy it, or when you want to temporarily
449freeze a coro (e.g. for debugging) to resume it later.
450
451A scenario for the former would be to suspend all (other) coros after a
452fork and keep them alive, so their destructors aren't called, but new
453coros can be created.
454
455=item $coro->resume
456
457If the specified coro was suspended, it will be resumed. Note that when
458the coro was in the ready queue when it was suspended, it might have been
459unreadied by the scheduler, so an activation might have been lost.
460
461To avoid this, it is best to put a suspended coro into the ready queue
462unconditionally, as every synchronisation mechanism must protect itself
463against spurious wakeups, and the one in the Coro family certainly do
464that.
465
466=item $is_ready = $coro->is_ready
467
468Returns true iff the Coro object is in the ready queue. Unless the Coro
469object gets destroyed, it will eventually be scheduled by the scheduler.
470
471=item $is_running = $coro->is_running
472
473Returns true iff the Coro object is currently running. Only one Coro object
474can ever be in the running state (but it currently is possible to have
475multiple running Coro::States).
476
477=item $is_suspended = $coro->is_suspended
478
479Returns true iff this Coro object has been suspended. Suspended Coros will
480not ever be scheduled.
481
482=item $coro->cancel (arg...)
483
484Terminates the given Coro and makes it return the given arguments as
485status (default: the empty list). Never returns if the Coro is the
486current Coro.
487
488=cut
489
490sub cancel {
491 my $self = shift;
492
493 if ($current == $self) {
494 terminate @_;
495 } else {
496 $self->{_status} = [@_];
497 Coro::State::cancel $self;
271 { 498 }
272 # remember current coroutine 499}
500
501=item $coro->schedule_to
502
503Puts the current coro to sleep (like C<Coro::schedule>), but instead
504of continuing with the next coro from the ready queue, always switch to
505the given coro object (regardless of priority etc.). The readyness
506state of that coro isn't changed.
507
508This is an advanced method for special cases - I'd love to hear about any
509uses for this one.
510
511=item $coro->cede_to
512
513Like C<schedule_to>, but puts the current coro into the ready
514queue. This has the effect of temporarily switching to the given
515coro, and continuing some time later.
516
517This is an advanced method for special cases - I'd love to hear about any
518uses for this one.
519
520=item $coro->throw ([$scalar])
521
522If C<$throw> is specified and defined, it will be thrown as an exception
523inside the coro at the next convenient point in time. Otherwise
524clears the exception object.
525
526Coro will check for the exception each time a schedule-like-function
527returns, i.e. after each C<schedule>, C<cede>, C<< Coro::Semaphore->down
528>>, C<< Coro::Handle->readable >> and so on. Most of these functions
529detect this case and return early in case an exception is pending.
530
531The exception object will be thrown "as is" with the specified scalar in
532C<$@>, i.e. if it is a string, no line number or newline will be appended
533(unlike with C<die>).
534
535This can be used as a softer means than C<cancel> to ask a coro to
536end itself, although there is no guarantee that the exception will lead to
537termination, and if the exception isn't caught it might well end the whole
538program.
539
540You might also think of C<throw> as being the moral equivalent of
541C<kill>ing a coro with a signal (in this case, a scalar).
542
543=item $coro->join
544
545Wait until the coro terminates and return any values given to the
546C<terminate> or C<cancel> functions. C<join> can be called concurrently
547from multiple coro, and all will be resumed and given the status
548return once the C<$coro> terminates.
549
550=cut
551
552sub join {
553 my $self = shift;
554
555 unless ($self->{_status}) {
273 my $current = $Coro::current; 556 my $current = $current;
274 557
275 # register a hypothetical event handler 558 push @{$self->{_on_destroy}}, sub {
276 on_event_invoke sub {
277 # wake up sleeping coroutine
278 $current->ready; 559 $current->ready;
279 undef $current; 560 undef $current;
280 }; 561 };
281 562
282 # call schedule until event occured.
283 # in case we are woken up for other reasons
284 # (current still defined), loop.
285 Coro::schedule while $current;
286 }
287
288=item cede
289
290"Cede" to other coroutines. This function puts the current coroutine into the
291ready queue and calls C<schedule>, which has the effect of giving up the
292current "timeslice" to other coroutines of the same or higher priority.
293
294Returns true if at least one coroutine switch has happened.
295
296=item Coro::cede_notself
297
298Works like cede, but is not exported by default and will cede to any
299coroutine, regardless of priority, once.
300
301Returns true if at least one coroutine switch has happened.
302
303=item terminate [arg...]
304
305Terminates the current coroutine with the given status values (see L<cancel>).
306
307=cut
308
309sub terminate {
310 $current->cancel (@_);
311}
312
313=back
314
315# dynamic methods
316
317=head2 COROUTINE METHODS
318
319These are the methods you can call on coroutine objects.
320
321=over 4
322
323=item new Coro \&sub [, @args...]
324
325Create a new coroutine and return it. When the sub returns the coroutine
326automatically terminates as if C<terminate> with the returned values were
327called. To make the coroutine run you must first put it into the ready queue
328by calling the ready method.
329
330Calling C<exit> in a coroutine will not work correctly, so do not do that.
331
332=cut
333
334sub _run_coro {
335 terminate &{+shift};
336}
337
338sub new {
339 my $class = shift;
340
341 $class->SUPER::new (\&_run_coro, @_)
342}
343
344=item $success = $coroutine->ready
345
346Put the given coroutine into the ready queue (according to it's priority)
347and return true. If the coroutine is already in the ready queue, do nothing
348and return false.
349
350=item $is_ready = $coroutine->is_ready
351
352Return wether the coroutine is currently the ready queue or not,
353
354=item $coroutine->cancel (arg...)
355
356Terminates the given coroutine and makes it return the given arguments as
357status (default: the empty list). Never returns if the coroutine is the
358current coroutine.
359
360=cut
361
362sub cancel {
363 my $self = shift;
364 $self->{status} = [@_];
365
366 if ($current == $self) {
367 push @destroy, $self;
368 $manager->ready;
369 &schedule while 1;
370 } else {
371 $self->_cancel;
372 }
373}
374
375=item $coroutine->join
376
377Wait until the coroutine terminates and return any values given to the
378C<terminate> or C<cancel> functions. C<join> can be called multiple times
379from multiple coroutine.
380
381=cut
382
383sub join {
384 my $self = shift;
385
386 unless ($self->{status}) {
387 my $current = $current;
388
389 push @{$self->{destroy_cb}}, sub {
390 $current->ready;
391 undef $current;
392 };
393
394 &schedule while $current; 563 &schedule while $current;
395 } 564 }
396 565
397 wantarray ? @{$self->{status}} : $self->{status}[0]; 566 wantarray ? @{$self->{_status}} : $self->{_status}[0];
398} 567}
399 568
400=item $coroutine->on_destroy (\&cb) 569=item $coro->on_destroy (\&cb)
401 570
402Registers a callback that is called when this coroutine gets destroyed, 571Registers a callback that is called when this coro gets destroyed,
403but before it is joined. The callback gets passed the terminate arguments, 572but before it is joined. The callback gets passed the terminate arguments,
404if any. 573if any, and I<must not> die, under any circumstances.
405 574
406=cut 575=cut
407 576
408sub on_destroy { 577sub on_destroy {
409 my ($self, $cb) = @_; 578 my ($self, $cb) = @_;
410 579
411 push @{ $self->{destroy_cb} }, $cb; 580 push @{ $self->{_on_destroy} }, $cb;
412} 581}
413 582
414=item $oldprio = $coroutine->prio ($newprio) 583=item $oldprio = $coro->prio ($newprio)
415 584
416Sets (or gets, if the argument is missing) the priority of the 585Sets (or gets, if the argument is missing) the priority of the
417coroutine. Higher priority coroutines get run before lower priority 586coro. Higher priority coro get run before lower priority
418coroutines. Priorities are small signed integers (currently -4 .. +3), 587coro. Priorities are small signed integers (currently -4 .. +3),
419that you can refer to using PRIO_xxx constants (use the import tag :prio 588that you can refer to using PRIO_xxx constants (use the import tag :prio
420to get then): 589to get then):
421 590
422 PRIO_MAX > PRIO_HIGH > PRIO_NORMAL > PRIO_LOW > PRIO_IDLE > PRIO_MIN 591 PRIO_MAX > PRIO_HIGH > PRIO_NORMAL > PRIO_LOW > PRIO_IDLE > PRIO_MIN
423 3 > 1 > 0 > -1 > -3 > -4 592 3 > 1 > 0 > -1 > -3 > -4
424 593
425 # set priority to HIGH 594 # set priority to HIGH
426 current->prio(PRIO_HIGH); 595 current->prio (PRIO_HIGH);
427 596
428The idle coroutine ($Coro::idle) always has a lower priority than any 597The idle coro ($Coro::idle) always has a lower priority than any
429existing coroutine. 598existing coro.
430 599
431Changing the priority of the current coroutine will take effect immediately, 600Changing the priority of the current coro will take effect immediately,
432but changing the priority of coroutines in the ready queue (but not 601but changing the priority of coro in the ready queue (but not
433running) will only take effect after the next schedule (of that 602running) will only take effect after the next schedule (of that
434coroutine). This is a bug that will be fixed in some future version. 603coro). This is a bug that will be fixed in some future version.
435 604
436=item $newprio = $coroutine->nice ($change) 605=item $newprio = $coro->nice ($change)
437 606
438Similar to C<prio>, but subtract the given value from the priority (i.e. 607Similar to C<prio>, but subtract the given value from the priority (i.e.
439higher values mean lower priority, just as in unix). 608higher values mean lower priority, just as in unix).
440 609
441=item $olddesc = $coroutine->desc ($newdesc) 610=item $olddesc = $coro->desc ($newdesc)
442 611
443Sets (or gets in case the argument is missing) the description for this 612Sets (or gets in case the argument is missing) the description for this
444coroutine. This is just a free-form string you can associate with a coroutine. 613coro. This is just a free-form string you can associate with a
614coro.
615
616This method simply sets the C<< $coro->{desc} >> member to the given
617string. You can modify this member directly if you wish.
445 618
446=cut 619=cut
447 620
448sub desc { 621sub desc {
449 my $old = $_[0]{desc}; 622 my $old = $_[0]{desc};
450 $_[0]{desc} = $_[1] if @_ > 1; 623 $_[0]{desc} = $_[1] if @_ > 1;
451 $old; 624 $old;
452} 625}
453 626
627sub transfer {
628 require Carp;
629 Carp::croak ("You must not call ->transfer on Coro objects. Use Coro::State objects or the ->schedule_to method. Caught");
630}
631
454=back 632=back
455 633
456=head2 GLOBAL FUNCTIONS 634=head1 GLOBAL FUNCTIONS
457 635
458=over 4 636=over 4
459 637
460=item Coro::nready 638=item Coro::nready
461 639
462Returns the number of coroutines that are currently in the ready state, 640Returns the number of coro that are currently in the ready state,
463i.e. that can be swicthed to. The value C<0> means that the only runnable 641i.e. that can be switched to by calling C<schedule> directory or
642indirectly. The value C<0> means that the only runnable coro is the
464coroutine is the currently running one, so C<cede> would have no effect, 643currently running one, so C<cede> would have no effect, and C<schedule>
465and C<schedule> would cause a deadlock unless there is an idle handler 644would cause a deadlock unless there is an idle handler that wakes up some
466that wakes up some coroutines. 645coro.
467 646
468=item my $guard = Coro::guard { ... } 647=item my $guard = Coro::guard { ... }
469 648
470This creates and returns a guard object. Nothing happens until the objetc 649This function still exists, but is deprecated. Please use the
471gets destroyed, in which case the codeblock given as argument will be 650C<Guard::guard> function instead.
472executed. This is useful to free locks or other resources in case of a
473runtime error or when the coroutine gets canceled, as in both cases the
474guard block will be executed. The guard object supports only one method,
475C<< ->cancel >>, which will keep the codeblock from being executed.
476 651
477Example: set some flag and clear it again when the coroutine gets canceled
478or the function returns:
479
480 sub do_something {
481 my $guard = Coro::guard { $busy = 0 };
482 $busy = 1;
483
484 # do something that requires $busy to be true
485 }
486
487=cut 652=cut
488 653
489sub guard(&) { 654BEGIN { *guard = \&Guard::guard }
490 bless \(my $cb = $_[0]), "Coro::guard"
491}
492
493sub Coro::guard::cancel {
494 ${$_[0]} = sub { };
495}
496
497sub Coro::guard::DESTROY {
498 ${$_[0]}->();
499}
500
501 655
502=item unblock_sub { ... } 656=item unblock_sub { ... }
503 657
504This utility function takes a BLOCK or code reference and "unblocks" it, 658This utility function takes a BLOCK or code reference and "unblocks" it,
505returning the new coderef. This means that the new coderef will return 659returning a new coderef. Unblocking means that calling the new coderef
506immediately without blocking, returning nothing, while the original code 660will return immediately without blocking, returning nothing, while the
507ref will be called (with parameters) from within its own coroutine. 661original code ref will be called (with parameters) from within another
662coro.
508 663
509The reason this fucntion exists is that many event libraries (such as the 664The reason this function exists is that many event libraries (such as the
510venerable L<Event|Event> module) are not coroutine-safe (a weaker form 665venerable L<Event|Event> module) are not thread-safe (a weaker form
511of thread-safety). This means you must not block within event callbacks, 666of reentrancy). This means you must not block within event callbacks,
512otherwise you might suffer from crashes or worse. 667otherwise you might suffer from crashes or worse. The only event library
668currently known that is safe to use without C<unblock_sub> is L<EV>.
513 669
514This function allows your callbacks to block by executing them in another 670This function allows your callbacks to block by executing them in another
515coroutine where it is safe to block. One example where blocking is handy 671coro where it is safe to block. One example where blocking is handy
516is when you use the L<Coro::AIO|Coro::AIO> functions to save results to 672is when you use the L<Coro::AIO|Coro::AIO> functions to save results to
517disk. 673disk, for example.
518 674
519In short: simply use C<unblock_sub { ... }> instead of C<sub { ... }> when 675In short: simply use C<unblock_sub { ... }> instead of C<sub { ... }> when
520creating event callbacks that want to block. 676creating event callbacks that want to block.
677
678If your handler does not plan to block (e.g. simply sends a message to
679another coro, or puts some other coro into the ready queue), there is
680no reason to use C<unblock_sub>.
681
682Note that you also need to use C<unblock_sub> for any other callbacks that
683are indirectly executed by any C-based event loop. For example, when you
684use a module that uses L<AnyEvent> (and you use L<Coro::AnyEvent>) and it
685provides callbacks that are the result of some event callback, then you
686must not block either, or use C<unblock_sub>.
521 687
522=cut 688=cut
523 689
524our @unblock_queue; 690our @unblock_queue;
525 691
526# we create a special coro because we want to cede, 692# we create a special coro because we want to cede,
527# to reduce pressure on the coro pool (because most callbacks 693# to reduce pressure on the coro pool (because most callbacks
528# return immediately and can be reused) and because we cannot cede 694# return immediately and can be reused) and because we cannot cede
529# inside an event callback. 695# inside an event callback.
530our $unblock_scheduler = async { 696our $unblock_scheduler = new Coro sub {
531 while () { 697 while () {
532 while (my $cb = pop @unblock_queue) { 698 while (my $cb = pop @unblock_queue) {
533 # this is an inlined copy of async_pool 699 &async_pool (@$cb);
534 my $coro = (pop @pool or new Coro \&pool_handler);
535 700
536 $coro->{_invoke} = $cb;
537 $coro->ready;
538 cede; # for short-lived callbacks, this reduces pressure on the coro pool 701 # for short-lived callbacks, this reduces pressure on the coro pool
702 # as the chance is very high that the async_poll coro will be back
703 # in the idle state when cede returns
704 cede;
539 } 705 }
540 schedule; # sleep well 706 schedule; # sleep well
541 } 707 }
542}; 708};
709$unblock_scheduler->{desc} = "[unblock_sub scheduler]";
543 710
544sub unblock_sub(&) { 711sub unblock_sub(&) {
545 my $cb = shift; 712 my $cb = shift;
546 713
547 sub { 714 sub {
548 unshift @unblock_queue, [$cb, @_]; 715 unshift @unblock_queue, [$cb, @_];
549 $unblock_scheduler->ready; 716 $unblock_scheduler->ready;
550 } 717 }
551} 718}
552 719
720=item $cb = Coro::rouse_cb
721
722Create and return a "rouse callback". That's a code reference that,
723when called, will remember a copy of its arguments and notify the owner
724coro of the callback.
725
726See the next function.
727
728=item @args = Coro::rouse_wait [$cb]
729
730Wait for the specified rouse callback (or the last one that was created in
731this coro).
732
733As soon as the callback is invoked (or when the callback was invoked
734before C<rouse_wait>), it will return the arguments originally passed to
735the rouse callback. In scalar context, that means you get the I<last>
736argument, just as if C<rouse_wait> had a C<return ($a1, $a2, $a3...)>
737statement at the end.
738
739See the section B<HOW TO WAIT FOR A CALLBACK> for an actual usage example.
740
553=back 741=back
554 742
555=cut 743=cut
556 744
5571; 7451;
558 746
747=head1 HOW TO WAIT FOR A CALLBACK
748
749It is very common for a coro to wait for some callback to be
750called. This occurs naturally when you use coro in an otherwise
751event-based program, or when you use event-based libraries.
752
753These typically register a callback for some event, and call that callback
754when the event occured. In a coro, however, you typically want to
755just wait for the event, simplyifying things.
756
757For example C<< AnyEvent->child >> registers a callback to be called when
758a specific child has exited:
759
760 my $child_watcher = AnyEvent->child (pid => $pid, cb => sub { ... });
761
762But from within a coro, you often just want to write this:
763
764 my $status = wait_for_child $pid;
765
766Coro offers two functions specifically designed to make this easy,
767C<Coro::rouse_cb> and C<Coro::rouse_wait>.
768
769The first function, C<rouse_cb>, generates and returns a callback that,
770when invoked, will save its arguments and notify the coro that
771created the callback.
772
773The second function, C<rouse_wait>, waits for the callback to be called
774(by calling C<schedule> to go to sleep) and returns the arguments
775originally passed to the callback.
776
777Using these functions, it becomes easy to write the C<wait_for_child>
778function mentioned above:
779
780 sub wait_for_child($) {
781 my ($pid) = @_;
782
783 my $watcher = AnyEvent->child (pid => $pid, cb => Coro::rouse_cb);
784
785 my ($rpid, $rstatus) = Coro::rouse_wait;
786 $rstatus
787 }
788
789In the case where C<rouse_cb> and C<rouse_wait> are not flexible enough,
790you can roll your own, using C<schedule>:
791
792 sub wait_for_child($) {
793 my ($pid) = @_;
794
795 # store the current coro in $current,
796 # and provide result variables for the closure passed to ->child
797 my $current = $Coro::current;
798 my ($done, $rstatus);
799
800 # pass a closure to ->child
801 my $watcher = AnyEvent->child (pid => $pid, cb => sub {
802 $rstatus = $_[1]; # remember rstatus
803 $done = 1; # mark $rstatus as valud
804 });
805
806 # wait until the closure has been called
807 schedule while !$done;
808
809 $rstatus
810 }
811
812
559=head1 BUGS/LIMITATIONS 813=head1 BUGS/LIMITATIONS
560 814
561 - you must make very sure that no coro is still active on global 815=over 4
562 destruction. very bad things might happen otherwise (usually segfaults).
563 816
817=item fork with pthread backend
818
819When Coro is compiled using the pthread backend (which isn't recommended
820but required on many BSDs as their libcs are completely broken), then
821coro will not survive a fork. There is no known workaround except to
822fix your libc and use a saner backend.
823
824=item perl process emulation ("threads")
825
564 - this module is not thread-safe. You should only ever use this module 826This module is not perl-pseudo-thread-safe. You should only ever use this
565 from the same thread (this requirement might be losened in the future 827module from the first thread (this requirement might be removed in the
566 to allow per-thread schedulers, but Coro::State does not yet allow 828future to allow per-thread schedulers, but Coro::State does not yet allow
567 this). 829this). I recommend disabling thread support and using processes, as having
830the windows process emulation enabled under unix roughly halves perl
831performance, even when not used.
832
833=item coro switching is not signal safe
834
835You must not switch to another coro from within a signal handler
836(only relevant with %SIG - most event libraries provide safe signals).
837
838That means you I<MUST NOT> call any function that might "block" the
839current coro - C<cede>, C<schedule> C<< Coro::Semaphore->down >> or
840anything that calls those. Everything else, including calling C<ready>,
841works.
842
843=back
844
845
846=head1 WINDOWS PROCESS EMULATION
847
848A great many people seem to be confused about ithreads (for example, Chip
849Salzenberg called me unintelligent, incapable, stupid and gullible,
850while in the same mail making rather confused statements about perl
851ithreads (for example, that memory or files would be shared), showing his
852lack of understanding of this area - if it is hard to understand for Chip,
853it is probably not obvious to everybody).
854
855What follows is an ultra-condensed version of my talk about threads in
856scripting languages given onthe perl workshop 2009:
857
858The so-called "ithreads" were originally implemented for two reasons:
859first, to (badly) emulate unix processes on native win32 perls, and
860secondly, to replace the older, real thread model ("5.005-threads").
861
862It does that by using threads instead of OS processes. The difference
863between processes and threads is that threads share memory (and other
864state, such as files) between threads within a single process, while
865processes do not share anything (at least not semantically). That
866means that modifications done by one thread are seen by others, while
867modifications by one process are not seen by other processes.
868
869The "ithreads" work exactly like that: when creating a new ithreads
870process, all state is copied (memory is copied physically, files and code
871is copied logically). Afterwards, it isolates all modifications. On UNIX,
872the same behaviour can be achieved by using operating system processes,
873except that UNIX typically uses hardware built into the system to do this
874efficiently, while the windows process emulation emulates this hardware in
875software (rather efficiently, but of course it is still much slower than
876dedicated hardware).
877
878As mentioned before, loading code, modifying code, modifying data
879structures and so on is only visible in the ithreads process doing the
880modification, not in other ithread processes within the same OS process.
881
882This is why "ithreads" do not implement threads for perl at all, only
883processes. What makes it so bad is that on non-windows platforms, you can
884actually take advantage of custom hardware for this purpose (as evidenced
885by the forks module, which gives you the (i-) threads API, just much
886faster).
887
888Sharing data is in the i-threads model is done by transfering data
889structures between threads using copying semantics, which is very slow -
890shared data simply does not exist. Benchmarks using i-threads which are
891communication-intensive show extremely bad behaviour with i-threads (in
892fact, so bad that Coro, which cannot take direct advantage of multiple
893CPUs, is often orders of magnitude faster because it shares data using
894real threads, refer to my talk for details).
895
896As summary, i-threads *use* threads to implement processes, while
897the compatible forks module *uses* processes to emulate, uhm,
898processes. I-threads slow down every perl program when enabled, and
899outside of windows, serve no (or little) practical purpose, but
900disadvantages every single-threaded Perl program.
901
902This is the reason that I try to avoid the name "ithreads", as it is
903misleading as it implies that it implements some kind of thread model for
904perl, and prefer the name "windows process emulation", which describes the
905actual use and behaviour of it much better.
568 906
569=head1 SEE ALSO 907=head1 SEE ALSO
570 908
909Event-Loop integration: L<Coro::AnyEvent>, L<Coro::EV>, L<Coro::Event>.
910
911Debugging: L<Coro::Debug>.
912
571Support/Utility: L<Coro::Cont>, L<Coro::Specific>, L<Coro::State>, L<Coro::Util>. 913Support/Utility: L<Coro::Specific>, L<Coro::Util>.
572 914
573Locking/IPC: L<Coro::Signal>, L<Coro::Channel>, L<Coro::Semaphore>, L<Coro::SemaphoreSet>, L<Coro::RWLock>. 915Locking and IPC: L<Coro::Signal>, L<Coro::Channel>, L<Coro::Semaphore>,
916L<Coro::SemaphoreSet>, L<Coro::RWLock>.
574 917
575Event/IO: L<Coro::Timer>, L<Coro::Event>, L<Coro::Handle>, L<Coro::Socket>, L<Coro::Select>. 918I/O and Timers: L<Coro::Timer>, L<Coro::Handle>, L<Coro::Socket>, L<Coro::AIO>.
576 919
577Embedding: L<Coro:MakeMaker> 920Compatibility with other modules: L<Coro::LWP> (but see also L<AnyEvent::HTTP> for
921a better-working alternative), L<Coro::BDB>, L<Coro::Storable>,
922L<Coro::Select>.
923
924XS API: L<Coro::MakeMaker>.
925
926Low level Configuration, Thread Environment, Continuations: L<Coro::State>.
578 927
579=head1 AUTHOR 928=head1 AUTHOR
580 929
581 Marc Lehmann <schmorp@schmorp.de> 930 Marc Lehmann <schmorp@schmorp.de>
582 http://home.schmorp.de/ 931 http://home.schmorp.de/

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines