ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/cvsroot/Coro/Coro.pm
(Generate patch)

Comparing cvsroot/Coro/Coro.pm (file contents):
Revision 1.248 by root, Mon Dec 15 15:03:31 2008 UTC vs.
Revision 1.280 by root, Thu Nov 11 15:07:16 2010 UTC

40points in your program, so locking and parallel access are rarely an 40points in your program, so locking and parallel access are rarely an
41issue, making thread programming much safer and easier than using other 41issue, making thread programming much safer and easier than using other
42thread models. 42thread models.
43 43
44Unlike the so-called "Perl threads" (which are not actually real threads 44Unlike the so-called "Perl threads" (which are not actually real threads
45but only the windows process emulation ported to unix), Coro provides a 45but only the windows process emulation (see section of same name for more
46details) ported to unix, and as such act as processes), Coro provides
46full shared address space, which makes communication between threads 47a full shared address space, which makes communication between threads
47very easy. And threads are fast, too: disabling the Windows process 48very easy. And Coro's threads are fast, too: disabling the Windows
48emulation code in your perl and using Coro can easily result in a two to 49process emulation code in your perl and using Coro can easily result in
49four times speed increase for your programs. 50a two to four times speed increase for your programs. A parallel matrix
51multiplication benchmark runs over 300 times faster on a single core than
52perl's pseudo-threads on a quad core using all four cores.
50 53
51Coro achieves that by supporting multiple running interpreters that share 54Coro achieves that by supporting multiple running interpreters that share
52data, which is especially useful to code pseudo-parallel processes and 55data, which is especially useful to code pseudo-parallel processes and
53for event-based programming, such as multiple HTTP-GET requests running 56for event-based programming, such as multiple HTTP-GET requests running
54concurrently. See L<Coro::AnyEvent> to learn more on how to integrate Coro 57concurrently. See L<Coro::AnyEvent> to learn more on how to integrate Coro
55into an event-based environment. 58into an event-based environment.
56 59
57In this module, a thread is defined as "callchain + lexical variables + 60In this module, a thread is defined as "callchain + lexical variables +
58@_ + $_ + $@ + $/ + C stack), that is, a thread has its own callchain, 61some package variables + C stack), that is, a thread has its own callchain,
59its own set of lexicals and its own set of perls most important global 62its own set of lexicals and its own set of perls most important global
60variables (see L<Coro::State> for more configuration and background info). 63variables (see L<Coro::State> for more configuration and background info).
61 64
62See also the C<SEE ALSO> section at the end of this document - the Coro 65See also the C<SEE ALSO> section at the end of this document - the Coro
63module family is quite large. 66module family is quite large.
64 67
65=cut 68=cut
66 69
67package Coro; 70package Coro;
68 71
69use strict qw(vars subs); 72use common::sense;
70no warnings "uninitialized"; 73
74use Carp ();
71 75
72use Guard (); 76use Guard ();
73 77
74use Coro::State; 78use Coro::State;
75 79
77 81
78our $idle; # idle handler 82our $idle; # idle handler
79our $main; # main coro 83our $main; # main coro
80our $current; # current coro 84our $current; # current coro
81 85
82our $VERSION = 5.13; 86our $VERSION = 5.25;
83 87
84our @EXPORT = qw(async async_pool cede schedule terminate current unblock_sub); 88our @EXPORT = qw(async async_pool cede schedule terminate current unblock_sub rouse_cb rouse_wait);
85our %EXPORT_TAGS = ( 89our %EXPORT_TAGS = (
86 prio => [qw(PRIO_MAX PRIO_HIGH PRIO_NORMAL PRIO_LOW PRIO_IDLE PRIO_MIN)], 90 prio => [qw(PRIO_MAX PRIO_HIGH PRIO_NORMAL PRIO_LOW PRIO_IDLE PRIO_MIN)],
87); 91);
88our @EXPORT_OK = (@{$EXPORT_TAGS{prio}}, qw(nready)); 92our @EXPORT_OK = (@{$EXPORT_TAGS{prio}}, qw(nready));
89 93
120 124
121This variable is mainly useful to integrate Coro into event loops. It is 125This variable is mainly useful to integrate Coro into event loops. It is
122usually better to rely on L<Coro::AnyEvent> or L<Coro::EV>, as this is 126usually better to rely on L<Coro::AnyEvent> or L<Coro::EV>, as this is
123pretty low-level functionality. 127pretty low-level functionality.
124 128
125This variable stores either a Coro object or a callback. 129This variable stores a Coro object that is put into the ready queue when
130there are no other ready threads (without invoking any ready hooks).
126 131
127If it is a callback, the it is called whenever the scheduler finds no 132The default implementation dies with "FATAL: deadlock detected.", followed
128ready coros to run. The default implementation prints "FATAL: 133by a thread listing, because the program has no other way to continue.
129deadlock detected" and exits, because the program has no other way to
130continue.
131
132If it is a coro object, then this object will be readied (without
133invoking any ready hooks, however) when the scheduler finds no other ready
134coros to run.
135 134
136This hook is overwritten by modules such as C<Coro::EV> and 135This hook is overwritten by modules such as C<Coro::EV> and
137C<Coro::AnyEvent> to wait on an external event that hopefully wake up a 136C<Coro::AnyEvent> to wait on an external event that hopefully wake up a
138coro so the scheduler can run it. 137coro so the scheduler can run it.
139 138
140Note that the callback I<must not>, under any circumstances, block
141the current coro. Normally, this is achieved by having an "idle
142coro" that calls the event loop and then blocks again, and then
143readying that coro in the idle handler, or by simply placing the idle
144coro in this variable.
145
146See L<Coro::Event> or L<Coro::AnyEvent> for examples of using this 139See L<Coro::EV> or L<Coro::AnyEvent> for examples of using this technique.
147technique.
148 140
149Please note that if your callback recursively invokes perl (e.g. for event
150handlers), then it must be prepared to be called recursively itself.
151
152=cut 141=cut
153 142
154$idle = sub { 143# ||= because other modules could have provided their own by now
155 require Carp; 144$idle ||= new Coro sub {
156 Carp::croak ("FATAL: deadlock detected"); 145 require Coro::Debug;
146 die "FATAL: deadlock detected.\n"
147 . Coro::Debug::ps_listing ();
157}; 148};
158 149
159# this coro is necessary because a coro 150# this coro is necessary because a coro
160# cannot destroy itself. 151# cannot destroy itself.
161our @destroy; 152our @destroy;
203Example: Create a new coro that just prints its arguments. 194Example: Create a new coro that just prints its arguments.
204 195
205 async { 196 async {
206 print "@_\n"; 197 print "@_\n";
207 } 1,2,3,4; 198 } 1,2,3,4;
208
209=cut
210
211sub async(&@) {
212 my $coro = new Coro @_;
213 $coro->ready;
214 $coro
215}
216 199
217=item async_pool { ... } [@args...] 200=item async_pool { ... } [@args...]
218 201
219Similar to C<async>, but uses a coro pool, so you should not call 202Similar to C<async>, but uses a coro pool, so you should not call
220terminate or join on it (although you are allowed to), and you get a 203terminate or join on it (although you are allowed to), and you get a
277=item schedule 260=item schedule
278 261
279Calls the scheduler. The scheduler will find the next coro that is 262Calls the scheduler. The scheduler will find the next coro that is
280to be run from the ready queue and switches to it. The next coro 263to be run from the ready queue and switches to it. The next coro
281to be run is simply the one with the highest priority that is longest 264to be run is simply the one with the highest priority that is longest
282in its ready queue. If there is no coro ready, it will clal the 265in its ready queue. If there is no coro ready, it will call the
283C<$Coro::idle> hook. 266C<$Coro::idle> hook.
284 267
285Please note that the current coro will I<not> be put into the ready 268Please note that the current coro will I<not> be put into the ready
286queue, so calling this function usually means you will never be called 269queue, so calling this function usually means you will never be called
287again unless something else (e.g. an event handler) calls C<< ->ready >>, 270again unless something else (e.g. an event handler) calls C<< ->ready >>,
335 318
336These functions implement the same concept as C<dynamic-wind> in scheme 319These functions implement the same concept as C<dynamic-wind> in scheme
337does, and are useful when you want to localise some resource to a specific 320does, and are useful when you want to localise some resource to a specific
338coro. 321coro.
339 322
340They slow down coro switching considerably for coros that use 323They slow down thread switching considerably for coros that use them
341them (But coro switching is still reasonably fast if the handlers are 324(about 40% for a BLOCK with a single assignment, so thread switching is
342fast). 325still reasonably fast if the handlers are fast).
343 326
344These functions are best understood by an example: The following function 327These functions are best understood by an example: The following function
345will change the current timezone to "Antarctica/South_Pole", which 328will change the current timezone to "Antarctica/South_Pole", which
346requires a call to C<tzset>, but by using C<on_enter> and C<on_leave>, 329requires a call to C<tzset>, but by using C<on_enter> and C<on_leave>,
347which remember/change the current timezone and restore the previous 330which remember/change the current timezone and restore the previous
348value, respectively, the timezone is only changes for the coro that 331value, respectively, the timezone is only changed for the coro that
349installed those handlers. 332installed those handlers.
350 333
351 use POSIX qw(tzset); 334 use POSIX qw(tzset);
352 335
353 async { 336 async {
370 }; 353 };
371 354
372This can be used to localise about any resource (locale, uid, current 355This can be used to localise about any resource (locale, uid, current
373working directory etc.) to a block, despite the existance of other 356working directory etc.) to a block, despite the existance of other
374coros. 357coros.
358
359Another interesting example implements time-sliced multitasking using
360interval timers (this could obviously be optimised, but does the job):
361
362 # "timeslice" the given block
363 sub timeslice(&) {
364 use Time::HiRes ();
365
366 Coro::on_enter {
367 # on entering the thread, we set an VTALRM handler to cede
368 $SIG{VTALRM} = sub { cede };
369 # and then start the interval timer
370 Time::HiRes::setitimer &Time::HiRes::ITIMER_VIRTUAL, 0.01, 0.01;
371 };
372 Coro::on_leave {
373 # on leaving the thread, we stop the interval timer again
374 Time::HiRes::setitimer &Time::HiRes::ITIMER_VIRTUAL, 0, 0;
375 };
376
377 &{+shift};
378 }
379
380 # use like this:
381 timeslice {
382 # The following is an endless loop that would normally
383 # monopolise the process. Since it runs in a timesliced
384 # environment, it will regularly cede to other threads.
385 while () { }
386 };
387
375 388
376=item killall 389=item killall
377 390
378Kills/terminates/cancels all coros except the currently running one. 391Kills/terminates/cancels all coros except the currently running one.
379 392
423the ready queue, do nothing and return false. 436the ready queue, do nothing and return false.
424 437
425This ensures that the scheduler will resume this coro automatically 438This ensures that the scheduler will resume this coro automatically
426once all the coro of higher priority and all coro of the same 439once all the coro of higher priority and all coro of the same
427priority that were put into the ready queue earlier have been resumed. 440priority that were put into the ready queue earlier have been resumed.
441
442=item $coro->suspend
443
444Suspends the specified coro. A suspended coro works just like any other
445coro, except that the scheduler will not select a suspended coro for
446execution.
447
448Suspending a coro can be useful when you want to keep the coro from
449running, but you don't want to destroy it, or when you want to temporarily
450freeze a coro (e.g. for debugging) to resume it later.
451
452A scenario for the former would be to suspend all (other) coros after a
453fork and keep them alive, so their destructors aren't called, but new
454coros can be created.
455
456=item $coro->resume
457
458If the specified coro was suspended, it will be resumed. Note that when
459the coro was in the ready queue when it was suspended, it might have been
460unreadied by the scheduler, so an activation might have been lost.
461
462To avoid this, it is best to put a suspended coro into the ready queue
463unconditionally, as every synchronisation mechanism must protect itself
464against spurious wakeups, and the one in the Coro family certainly do
465that.
428 466
429=item $is_ready = $coro->is_ready 467=item $is_ready = $coro->is_ready
430 468
431Returns true iff the Coro object is in the ready queue. Unless the Coro 469Returns true iff the Coro object is in the ready queue. Unless the Coro
432object gets destroyed, it will eventually be scheduled by the scheduler. 470object gets destroyed, it will eventually be scheduled by the scheduler.
575Sets (or gets in case the argument is missing) the description for this 613Sets (or gets in case the argument is missing) the description for this
576coro. This is just a free-form string you can associate with a 614coro. This is just a free-form string you can associate with a
577coro. 615coro.
578 616
579This method simply sets the C<< $coro->{desc} >> member to the given 617This method simply sets the C<< $coro->{desc} >> member to the given
580string. You can modify this member directly if you wish. 618string. You can modify this member directly if you wish, and in fact, this
619is often preferred to indicate major processing states that cna then be
620seen for example in a L<Coro::Debug> session:
621
622 sub my_long_function {
623 local $Coro::current->{desc} = "now in my_long_function";
624 ...
625 $Coro::current->{desc} = "my_long_function: phase 1";
626 ...
627 $Coro::current->{desc} = "my_long_function: phase 2";
628 ...
629 }
581 630
582=cut 631=cut
583 632
584sub desc { 633sub desc {
585 my $old = $_[0]{desc}; 634 my $old = $_[0]{desc};
627The reason this function exists is that many event libraries (such as the 676The reason this function exists is that many event libraries (such as the
628venerable L<Event|Event> module) are not thread-safe (a weaker form 677venerable L<Event|Event> module) are not thread-safe (a weaker form
629of reentrancy). This means you must not block within event callbacks, 678of reentrancy). This means you must not block within event callbacks,
630otherwise you might suffer from crashes or worse. The only event library 679otherwise you might suffer from crashes or worse. The only event library
631currently known that is safe to use without C<unblock_sub> is L<EV>. 680currently known that is safe to use without C<unblock_sub> is L<EV>.
681
682Coro will try to catch you when you block in the event loop
683("FATAL:$Coro::IDLE blocked itself"), but this is just best effort and
684only works when you do not run your own event loop.
632 685
633This function allows your callbacks to block by executing them in another 686This function allows your callbacks to block by executing them in another
634coro where it is safe to block. One example where blocking is handy 687coro where it is safe to block. One example where blocking is handy
635is when you use the L<Coro::AIO|Coro::AIO> functions to save results to 688is when you use the L<Coro::AIO|Coro::AIO> functions to save results to
636disk, for example. 689disk, for example.
678 unshift @unblock_queue, [$cb, @_]; 731 unshift @unblock_queue, [$cb, @_];
679 $unblock_scheduler->ready; 732 $unblock_scheduler->ready;
680 } 733 }
681} 734}
682 735
683=item $cb = Coro::rouse_cb 736=item $cb = rouse_cb
684 737
685Create and return a "rouse callback". That's a code reference that, 738Create and return a "rouse callback". That's a code reference that,
686when called, will remember a copy of its arguments and notify the owner 739when called, will remember a copy of its arguments and notify the owner
687coro of the callback. 740coro of the callback.
688 741
689See the next function. 742See the next function.
690 743
691=item @args = Coro::rouse_wait [$cb] 744=item @args = rouse_wait [$cb]
692 745
693Wait for the specified rouse callback (or the last one that was created in 746Wait for the specified rouse callback (or the last one that was created in
694this coro). 747this coro).
695 748
696As soon as the callback is invoked (or when the callback was invoked 749As soon as the callback is invoked (or when the callback was invoked
697before C<rouse_wait>), it will return the arguments originally passed to 750before C<rouse_wait>), it will return the arguments originally passed to
698the rouse callback. 751the rouse callback. In scalar context, that means you get the I<last>
752argument, just as if C<rouse_wait> had a C<return ($a1, $a2, $a3...)>
753statement at the end.
699 754
700See the section B<HOW TO WAIT FOR A CALLBACK> for an actual usage example. 755See the section B<HOW TO WAIT FOR A CALLBACK> for an actual usage example.
701 756
702=back 757=back
703 758
791the windows process emulation enabled under unix roughly halves perl 846the windows process emulation enabled under unix roughly halves perl
792performance, even when not used. 847performance, even when not used.
793 848
794=item coro switching is not signal safe 849=item coro switching is not signal safe
795 850
796You must not switch to another coro from within a signal handler 851You must not switch to another coro from within a signal handler (only
797(only relevant with %SIG - most event libraries provide safe signals). 852relevant with %SIG - most event libraries provide safe signals), I<unless>
853you are sure you are not interrupting a Coro function.
798 854
799That means you I<MUST NOT> call any function that might "block" the 855That means you I<MUST NOT> call any function that might "block" the
800current coro - C<cede>, C<schedule> C<< Coro::Semaphore->down >> or 856current coro - C<cede>, C<schedule> C<< Coro::Semaphore->down >> or
801anything that calls those. Everything else, including calling C<ready>, 857anything that calls those. Everything else, including calling C<ready>,
802works. 858works.
803 859
804=back 860=back
805 861
806 862
863=head1 WINDOWS PROCESS EMULATION
864
865A great many people seem to be confused about ithreads (for example, Chip
866Salzenberg called me unintelligent, incapable, stupid and gullible,
867while in the same mail making rather confused statements about perl
868ithreads (for example, that memory or files would be shared), showing his
869lack of understanding of this area - if it is hard to understand for Chip,
870it is probably not obvious to everybody).
871
872What follows is an ultra-condensed version of my talk about threads in
873scripting languages given onthe perl workshop 2009:
874
875The so-called "ithreads" were originally implemented for two reasons:
876first, to (badly) emulate unix processes on native win32 perls, and
877secondly, to replace the older, real thread model ("5.005-threads").
878
879It does that by using threads instead of OS processes. The difference
880between processes and threads is that threads share memory (and other
881state, such as files) between threads within a single process, while
882processes do not share anything (at least not semantically). That
883means that modifications done by one thread are seen by others, while
884modifications by one process are not seen by other processes.
885
886The "ithreads" work exactly like that: when creating a new ithreads
887process, all state is copied (memory is copied physically, files and code
888is copied logically). Afterwards, it isolates all modifications. On UNIX,
889the same behaviour can be achieved by using operating system processes,
890except that UNIX typically uses hardware built into the system to do this
891efficiently, while the windows process emulation emulates this hardware in
892software (rather efficiently, but of course it is still much slower than
893dedicated hardware).
894
895As mentioned before, loading code, modifying code, modifying data
896structures and so on is only visible in the ithreads process doing the
897modification, not in other ithread processes within the same OS process.
898
899This is why "ithreads" do not implement threads for perl at all, only
900processes. What makes it so bad is that on non-windows platforms, you can
901actually take advantage of custom hardware for this purpose (as evidenced
902by the forks module, which gives you the (i-) threads API, just much
903faster).
904
905Sharing data is in the i-threads model is done by transfering data
906structures between threads using copying semantics, which is very slow -
907shared data simply does not exist. Benchmarks using i-threads which are
908communication-intensive show extremely bad behaviour with i-threads (in
909fact, so bad that Coro, which cannot take direct advantage of multiple
910CPUs, is often orders of magnitude faster because it shares data using
911real threads, refer to my talk for details).
912
913As summary, i-threads *use* threads to implement processes, while
914the compatible forks module *uses* processes to emulate, uhm,
915processes. I-threads slow down every perl program when enabled, and
916outside of windows, serve no (or little) practical purpose, but
917disadvantages every single-threaded Perl program.
918
919This is the reason that I try to avoid the name "ithreads", as it is
920misleading as it implies that it implements some kind of thread model for
921perl, and prefer the name "windows process emulation", which describes the
922actual use and behaviour of it much better.
923
807=head1 SEE ALSO 924=head1 SEE ALSO
808 925
809Event-Loop integration: L<Coro::AnyEvent>, L<Coro::EV>, L<Coro::Event>. 926Event-Loop integration: L<Coro::AnyEvent>, L<Coro::EV>, L<Coro::Event>.
810 927
811Debugging: L<Coro::Debug>. 928Debugging: L<Coro::Debug>.

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines