ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/cvsroot/Coro/Coro.pm
(Generate patch)

Comparing cvsroot/Coro/Coro.pm (file contents):
Revision 1.248 by root, Mon Dec 15 15:03:31 2008 UTC vs.
Revision 1.278 by root, Sat Oct 23 09:28:48 2010 UTC

40points in your program, so locking and parallel access are rarely an 40points in your program, so locking and parallel access are rarely an
41issue, making thread programming much safer and easier than using other 41issue, making thread programming much safer and easier than using other
42thread models. 42thread models.
43 43
44Unlike the so-called "Perl threads" (which are not actually real threads 44Unlike the so-called "Perl threads" (which are not actually real threads
45but only the windows process emulation ported to unix), Coro provides a 45but only the windows process emulation (see section of same name for more
46details) ported to unix, and as such act as processes), Coro provides
46full shared address space, which makes communication between threads 47a full shared address space, which makes communication between threads
47very easy. And threads are fast, too: disabling the Windows process 48very easy. And Coro's threads are fast, too: disabling the Windows
48emulation code in your perl and using Coro can easily result in a two to 49process emulation code in your perl and using Coro can easily result in
49four times speed increase for your programs. 50a two to four times speed increase for your programs. A parallel matrix
51multiplication benchmark runs over 300 times faster on a single core than
52perl's pseudo-threads on a quad core using all four cores.
50 53
51Coro achieves that by supporting multiple running interpreters that share 54Coro achieves that by supporting multiple running interpreters that share
52data, which is especially useful to code pseudo-parallel processes and 55data, which is especially useful to code pseudo-parallel processes and
53for event-based programming, such as multiple HTTP-GET requests running 56for event-based programming, such as multiple HTTP-GET requests running
54concurrently. See L<Coro::AnyEvent> to learn more on how to integrate Coro 57concurrently. See L<Coro::AnyEvent> to learn more on how to integrate Coro
55into an event-based environment. 58into an event-based environment.
56 59
57In this module, a thread is defined as "callchain + lexical variables + 60In this module, a thread is defined as "callchain + lexical variables +
58@_ + $_ + $@ + $/ + C stack), that is, a thread has its own callchain, 61some package variables + C stack), that is, a thread has its own callchain,
59its own set of lexicals and its own set of perls most important global 62its own set of lexicals and its own set of perls most important global
60variables (see L<Coro::State> for more configuration and background info). 63variables (see L<Coro::State> for more configuration and background info).
61 64
62See also the C<SEE ALSO> section at the end of this document - the Coro 65See also the C<SEE ALSO> section at the end of this document - the Coro
63module family is quite large. 66module family is quite large.
64 67
65=cut 68=cut
66 69
67package Coro; 70package Coro;
68 71
69use strict qw(vars subs); 72use common::sense;
70no warnings "uninitialized"; 73
74use Carp ();
71 75
72use Guard (); 76use Guard ();
73 77
74use Coro::State; 78use Coro::State;
75 79
77 81
78our $idle; # idle handler 82our $idle; # idle handler
79our $main; # main coro 83our $main; # main coro
80our $current; # current coro 84our $current; # current coro
81 85
82our $VERSION = 5.13; 86our $VERSION = 5.24;
83 87
84our @EXPORT = qw(async async_pool cede schedule terminate current unblock_sub); 88our @EXPORT = qw(async async_pool cede schedule terminate current unblock_sub rouse_cb rouse_wait);
85our %EXPORT_TAGS = ( 89our %EXPORT_TAGS = (
86 prio => [qw(PRIO_MAX PRIO_HIGH PRIO_NORMAL PRIO_LOW PRIO_IDLE PRIO_MIN)], 90 prio => [qw(PRIO_MAX PRIO_HIGH PRIO_NORMAL PRIO_LOW PRIO_IDLE PRIO_MIN)],
87); 91);
88our @EXPORT_OK = (@{$EXPORT_TAGS{prio}}, qw(nready)); 92our @EXPORT_OK = (@{$EXPORT_TAGS{prio}}, qw(nready));
89 93
120 124
121This variable is mainly useful to integrate Coro into event loops. It is 125This variable is mainly useful to integrate Coro into event loops. It is
122usually better to rely on L<Coro::AnyEvent> or L<Coro::EV>, as this is 126usually better to rely on L<Coro::AnyEvent> or L<Coro::EV>, as this is
123pretty low-level functionality. 127pretty low-level functionality.
124 128
125This variable stores either a Coro object or a callback. 129This variable stores a Coro object that is put into the ready queue when
130there are no other ready threads (without invoking any ready hooks).
126 131
127If it is a callback, the it is called whenever the scheduler finds no 132The default implementation dies with "FATAL: deadlock detected.", followed
128ready coros to run. The default implementation prints "FATAL: 133by a thread listing, because the program has no other way to continue.
129deadlock detected" and exits, because the program has no other way to
130continue.
131
132If it is a coro object, then this object will be readied (without
133invoking any ready hooks, however) when the scheduler finds no other ready
134coros to run.
135 134
136This hook is overwritten by modules such as C<Coro::EV> and 135This hook is overwritten by modules such as C<Coro::EV> and
137C<Coro::AnyEvent> to wait on an external event that hopefully wake up a 136C<Coro::AnyEvent> to wait on an external event that hopefully wake up a
138coro so the scheduler can run it. 137coro so the scheduler can run it.
139 138
140Note that the callback I<must not>, under any circumstances, block
141the current coro. Normally, this is achieved by having an "idle
142coro" that calls the event loop and then blocks again, and then
143readying that coro in the idle handler, or by simply placing the idle
144coro in this variable.
145
146See L<Coro::Event> or L<Coro::AnyEvent> for examples of using this 139See L<Coro::EV> or L<Coro::AnyEvent> for examples of using this technique.
147technique.
148 140
149Please note that if your callback recursively invokes perl (e.g. for event
150handlers), then it must be prepared to be called recursively itself.
151
152=cut 141=cut
153 142
154$idle = sub { 143# ||= because other modules could have provided their own by now
155 require Carp; 144$idle ||= new Coro sub {
156 Carp::croak ("FATAL: deadlock detected"); 145 require Coro::Debug;
146 die "FATAL: deadlock detected.\n"
147 . Coro::Debug::ps_listing ();
157}; 148};
158 149
159# this coro is necessary because a coro 150# this coro is necessary because a coro
160# cannot destroy itself. 151# cannot destroy itself.
161our @destroy; 152our @destroy;
203Example: Create a new coro that just prints its arguments. 194Example: Create a new coro that just prints its arguments.
204 195
205 async { 196 async {
206 print "@_\n"; 197 print "@_\n";
207 } 1,2,3,4; 198 } 1,2,3,4;
208
209=cut
210
211sub async(&@) {
212 my $coro = new Coro @_;
213 $coro->ready;
214 $coro
215}
216 199
217=item async_pool { ... } [@args...] 200=item async_pool { ... } [@args...]
218 201
219Similar to C<async>, but uses a coro pool, so you should not call 202Similar to C<async>, but uses a coro pool, so you should not call
220terminate or join on it (although you are allowed to), and you get a 203terminate or join on it (although you are allowed to), and you get a
277=item schedule 260=item schedule
278 261
279Calls the scheduler. The scheduler will find the next coro that is 262Calls the scheduler. The scheduler will find the next coro that is
280to be run from the ready queue and switches to it. The next coro 263to be run from the ready queue and switches to it. The next coro
281to be run is simply the one with the highest priority that is longest 264to be run is simply the one with the highest priority that is longest
282in its ready queue. If there is no coro ready, it will clal the 265in its ready queue. If there is no coro ready, it will call the
283C<$Coro::idle> hook. 266C<$Coro::idle> hook.
284 267
285Please note that the current coro will I<not> be put into the ready 268Please note that the current coro will I<not> be put into the ready
286queue, so calling this function usually means you will never be called 269queue, so calling this function usually means you will never be called
287again unless something else (e.g. an event handler) calls C<< ->ready >>, 270again unless something else (e.g. an event handler) calls C<< ->ready >>,
335 318
336These functions implement the same concept as C<dynamic-wind> in scheme 319These functions implement the same concept as C<dynamic-wind> in scheme
337does, and are useful when you want to localise some resource to a specific 320does, and are useful when you want to localise some resource to a specific
338coro. 321coro.
339 322
340They slow down coro switching considerably for coros that use 323They slow down thread switching considerably for coros that use them
341them (But coro switching is still reasonably fast if the handlers are 324(about 40% for a BLOCK with a single assignment, so thread switching is
342fast). 325still reasonably fast if the handlers are fast).
343 326
344These functions are best understood by an example: The following function 327These functions are best understood by an example: The following function
345will change the current timezone to "Antarctica/South_Pole", which 328will change the current timezone to "Antarctica/South_Pole", which
346requires a call to C<tzset>, but by using C<on_enter> and C<on_leave>, 329requires a call to C<tzset>, but by using C<on_enter> and C<on_leave>,
347which remember/change the current timezone and restore the previous 330which remember/change the current timezone and restore the previous
348value, respectively, the timezone is only changes for the coro that 331value, respectively, the timezone is only changed for the coro that
349installed those handlers. 332installed those handlers.
350 333
351 use POSIX qw(tzset); 334 use POSIX qw(tzset);
352 335
353 async { 336 async {
370 }; 353 };
371 354
372This can be used to localise about any resource (locale, uid, current 355This can be used to localise about any resource (locale, uid, current
373working directory etc.) to a block, despite the existance of other 356working directory etc.) to a block, despite the existance of other
374coros. 357coros.
358
359Another interesting example implements time-sliced multitasking using
360interval timers (this could obviously be optimised, but does the job):
361
362 # "timeslice" the given block
363 sub timeslice(&) {
364 use Time::HiRes ();
365
366 Coro::on_enter {
367 # on entering the thread, we set an VTALRM handler to cede
368 $SIG{VTALRM} = sub { cede };
369 # and then start the interval timer
370 Time::HiRes::setitimer &Time::HiRes::ITIMER_VIRTUAL, 0.01, 0.01;
371 };
372 Coro::on_leave {
373 # on leaving the thread, we stop the interval timer again
374 Time::HiRes::setitimer &Time::HiRes::ITIMER_VIRTUAL, 0, 0;
375 };
376
377 &{+shift};
378 }
379
380 # use like this:
381 timeslice {
382 # The following is an endless loop that would normally
383 # monopolise the process. Since it runs in a timesliced
384 # environment, it will regularly cede to other threads.
385 while () { }
386 };
387
375 388
376=item killall 389=item killall
377 390
378Kills/terminates/cancels all coros except the currently running one. 391Kills/terminates/cancels all coros except the currently running one.
379 392
423the ready queue, do nothing and return false. 436the ready queue, do nothing and return false.
424 437
425This ensures that the scheduler will resume this coro automatically 438This ensures that the scheduler will resume this coro automatically
426once all the coro of higher priority and all coro of the same 439once all the coro of higher priority and all coro of the same
427priority that were put into the ready queue earlier have been resumed. 440priority that were put into the ready queue earlier have been resumed.
441
442=item $coro->suspend
443
444Suspends the specified coro. A suspended coro works just like any other
445coro, except that the scheduler will not select a suspended coro for
446execution.
447
448Suspending a coro can be useful when you want to keep the coro from
449running, but you don't want to destroy it, or when you want to temporarily
450freeze a coro (e.g. for debugging) to resume it later.
451
452A scenario for the former would be to suspend all (other) coros after a
453fork and keep them alive, so their destructors aren't called, but new
454coros can be created.
455
456=item $coro->resume
457
458If the specified coro was suspended, it will be resumed. Note that when
459the coro was in the ready queue when it was suspended, it might have been
460unreadied by the scheduler, so an activation might have been lost.
461
462To avoid this, it is best to put a suspended coro into the ready queue
463unconditionally, as every synchronisation mechanism must protect itself
464against spurious wakeups, and the one in the Coro family certainly do
465that.
428 466
429=item $is_ready = $coro->is_ready 467=item $is_ready = $coro->is_ready
430 468
431Returns true iff the Coro object is in the ready queue. Unless the Coro 469Returns true iff the Coro object is in the ready queue. Unless the Coro
432object gets destroyed, it will eventually be scheduled by the scheduler. 470object gets destroyed, it will eventually be scheduled by the scheduler.
627The reason this function exists is that many event libraries (such as the 665The reason this function exists is that many event libraries (such as the
628venerable L<Event|Event> module) are not thread-safe (a weaker form 666venerable L<Event|Event> module) are not thread-safe (a weaker form
629of reentrancy). This means you must not block within event callbacks, 667of reentrancy). This means you must not block within event callbacks,
630otherwise you might suffer from crashes or worse. The only event library 668otherwise you might suffer from crashes or worse. The only event library
631currently known that is safe to use without C<unblock_sub> is L<EV>. 669currently known that is safe to use without C<unblock_sub> is L<EV>.
670
671Coro will try to catch you when you block in the event loop
672("FATAL:$Coro::IDLE blocked itself"), but this is just best effort and
673only works when you do not run your own event loop.
632 674
633This function allows your callbacks to block by executing them in another 675This function allows your callbacks to block by executing them in another
634coro where it is safe to block. One example where blocking is handy 676coro where it is safe to block. One example where blocking is handy
635is when you use the L<Coro::AIO|Coro::AIO> functions to save results to 677is when you use the L<Coro::AIO|Coro::AIO> functions to save results to
636disk, for example. 678disk, for example.
678 unshift @unblock_queue, [$cb, @_]; 720 unshift @unblock_queue, [$cb, @_];
679 $unblock_scheduler->ready; 721 $unblock_scheduler->ready;
680 } 722 }
681} 723}
682 724
683=item $cb = Coro::rouse_cb 725=item $cb = rouse_cb
684 726
685Create and return a "rouse callback". That's a code reference that, 727Create and return a "rouse callback". That's a code reference that,
686when called, will remember a copy of its arguments and notify the owner 728when called, will remember a copy of its arguments and notify the owner
687coro of the callback. 729coro of the callback.
688 730
689See the next function. 731See the next function.
690 732
691=item @args = Coro::rouse_wait [$cb] 733=item @args = rouse_wait [$cb]
692 734
693Wait for the specified rouse callback (or the last one that was created in 735Wait for the specified rouse callback (or the last one that was created in
694this coro). 736this coro).
695 737
696As soon as the callback is invoked (or when the callback was invoked 738As soon as the callback is invoked (or when the callback was invoked
697before C<rouse_wait>), it will return the arguments originally passed to 739before C<rouse_wait>), it will return the arguments originally passed to
698the rouse callback. 740the rouse callback. In scalar context, that means you get the I<last>
741argument, just as if C<rouse_wait> had a C<return ($a1, $a2, $a3...)>
742statement at the end.
699 743
700See the section B<HOW TO WAIT FOR A CALLBACK> for an actual usage example. 744See the section B<HOW TO WAIT FOR A CALLBACK> for an actual usage example.
701 745
702=back 746=back
703 747
791the windows process emulation enabled under unix roughly halves perl 835the windows process emulation enabled under unix roughly halves perl
792performance, even when not used. 836performance, even when not used.
793 837
794=item coro switching is not signal safe 838=item coro switching is not signal safe
795 839
796You must not switch to another coro from within a signal handler 840You must not switch to another coro from within a signal handler (only
797(only relevant with %SIG - most event libraries provide safe signals). 841relevant with %SIG - most event libraries provide safe signals), I<unless>
842you are sure you are not interrupting a Coro function.
798 843
799That means you I<MUST NOT> call any function that might "block" the 844That means you I<MUST NOT> call any function that might "block" the
800current coro - C<cede>, C<schedule> C<< Coro::Semaphore->down >> or 845current coro - C<cede>, C<schedule> C<< Coro::Semaphore->down >> or
801anything that calls those. Everything else, including calling C<ready>, 846anything that calls those. Everything else, including calling C<ready>,
802works. 847works.
803 848
804=back 849=back
805 850
806 851
852=head1 WINDOWS PROCESS EMULATION
853
854A great many people seem to be confused about ithreads (for example, Chip
855Salzenberg called me unintelligent, incapable, stupid and gullible,
856while in the same mail making rather confused statements about perl
857ithreads (for example, that memory or files would be shared), showing his
858lack of understanding of this area - if it is hard to understand for Chip,
859it is probably not obvious to everybody).
860
861What follows is an ultra-condensed version of my talk about threads in
862scripting languages given onthe perl workshop 2009:
863
864The so-called "ithreads" were originally implemented for two reasons:
865first, to (badly) emulate unix processes on native win32 perls, and
866secondly, to replace the older, real thread model ("5.005-threads").
867
868It does that by using threads instead of OS processes. The difference
869between processes and threads is that threads share memory (and other
870state, such as files) between threads within a single process, while
871processes do not share anything (at least not semantically). That
872means that modifications done by one thread are seen by others, while
873modifications by one process are not seen by other processes.
874
875The "ithreads" work exactly like that: when creating a new ithreads
876process, all state is copied (memory is copied physically, files and code
877is copied logically). Afterwards, it isolates all modifications. On UNIX,
878the same behaviour can be achieved by using operating system processes,
879except that UNIX typically uses hardware built into the system to do this
880efficiently, while the windows process emulation emulates this hardware in
881software (rather efficiently, but of course it is still much slower than
882dedicated hardware).
883
884As mentioned before, loading code, modifying code, modifying data
885structures and so on is only visible in the ithreads process doing the
886modification, not in other ithread processes within the same OS process.
887
888This is why "ithreads" do not implement threads for perl at all, only
889processes. What makes it so bad is that on non-windows platforms, you can
890actually take advantage of custom hardware for this purpose (as evidenced
891by the forks module, which gives you the (i-) threads API, just much
892faster).
893
894Sharing data is in the i-threads model is done by transfering data
895structures between threads using copying semantics, which is very slow -
896shared data simply does not exist. Benchmarks using i-threads which are
897communication-intensive show extremely bad behaviour with i-threads (in
898fact, so bad that Coro, which cannot take direct advantage of multiple
899CPUs, is often orders of magnitude faster because it shares data using
900real threads, refer to my talk for details).
901
902As summary, i-threads *use* threads to implement processes, while
903the compatible forks module *uses* processes to emulate, uhm,
904processes. I-threads slow down every perl program when enabled, and
905outside of windows, serve no (or little) practical purpose, but
906disadvantages every single-threaded Perl program.
907
908This is the reason that I try to avoid the name "ithreads", as it is
909misleading as it implies that it implements some kind of thread model for
910perl, and prefer the name "windows process emulation", which describes the
911actual use and behaviour of it much better.
912
807=head1 SEE ALSO 913=head1 SEE ALSO
808 914
809Event-Loop integration: L<Coro::AnyEvent>, L<Coro::EV>, L<Coro::Event>. 915Event-Loop integration: L<Coro::AnyEvent>, L<Coro::EV>, L<Coro::Event>.
810 916
811Debugging: L<Coro::Debug>. 917Debugging: L<Coro::Debug>.

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines