ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/Coro/Coro.pm
(Generate patch)

Comparing Coro/Coro.pm (file contents):
Revision 1.255 by root, Wed Jun 17 21:36:35 2009 UTC vs.
Revision 1.281 by root, Tue Dec 7 17:13:43 2010 UTC

40points in your program, so locking and parallel access are rarely an 40points in your program, so locking and parallel access are rarely an
41issue, making thread programming much safer and easier than using other 41issue, making thread programming much safer and easier than using other
42thread models. 42thread models.
43 43
44Unlike the so-called "Perl threads" (which are not actually real threads 44Unlike the so-called "Perl threads" (which are not actually real threads
45but only the windows process emulation ported to unix, and as such act 45but only the windows process emulation (see section of same name for more
46as processes), Coro provides a full shared address space, which makes 46details) ported to unix, and as such act as processes), Coro provides
47communication between threads very easy. And Coro's threads are fast, 47a full shared address space, which makes communication between threads
48too: disabling the Windows process emulation code in your perl and using 48very easy. And Coro's threads are fast, too: disabling the Windows
49Coro can easily result in a two to four times speed increase for your 49process emulation code in your perl and using Coro can easily result in
50programs. A parallel matrix multiplication benchmark runs over 300 times 50a two to four times speed increase for your programs. A parallel matrix
51faster on a single core than perl's pseudo-threads on a quad core using 51multiplication benchmark runs over 300 times faster on a single core than
52all four cores. 52perl's pseudo-threads on a quad core using all four cores.
53 53
54Coro achieves that by supporting multiple running interpreters that share 54Coro achieves that by supporting multiple running interpreters that share
55data, which is especially useful to code pseudo-parallel processes and 55data, which is especially useful to code pseudo-parallel processes and
56for event-based programming, such as multiple HTTP-GET requests running 56for event-based programming, such as multiple HTTP-GET requests running
57concurrently. See L<Coro::AnyEvent> to learn more on how to integrate Coro 57concurrently. See L<Coro::AnyEvent> to learn more on how to integrate Coro
67 67
68=cut 68=cut
69 69
70package Coro; 70package Coro;
71 71
72use strict qw(vars subs); 72use common::sense;
73no warnings "uninitialized"; 73
74use Carp ();
74 75
75use Guard (); 76use Guard ();
76 77
77use Coro::State; 78use Coro::State;
78 79
80 81
81our $idle; # idle handler 82our $idle; # idle handler
82our $main; # main coro 83our $main; # main coro
83our $current; # current coro 84our $current; # current coro
84 85
85our $VERSION = 5.132; 86our $VERSION = 5.25;
86 87
87our @EXPORT = qw(async async_pool cede schedule terminate current unblock_sub); 88our @EXPORT = qw(async async_pool cede schedule terminate current unblock_sub rouse_cb rouse_wait);
88our %EXPORT_TAGS = ( 89our %EXPORT_TAGS = (
89 prio => [qw(PRIO_MAX PRIO_HIGH PRIO_NORMAL PRIO_LOW PRIO_IDLE PRIO_MIN)], 90 prio => [qw(PRIO_MAX PRIO_HIGH PRIO_NORMAL PRIO_LOW PRIO_IDLE PRIO_MIN)],
90); 91);
91our @EXPORT_OK = (@{$EXPORT_TAGS{prio}}, qw(nready)); 92our @EXPORT_OK = (@{$EXPORT_TAGS{prio}}, qw(nready));
92 93
123 124
124This variable is mainly useful to integrate Coro into event loops. It is 125This variable is mainly useful to integrate Coro into event loops. It is
125usually better to rely on L<Coro::AnyEvent> or L<Coro::EV>, as this is 126usually better to rely on L<Coro::AnyEvent> or L<Coro::EV>, as this is
126pretty low-level functionality. 127pretty low-level functionality.
127 128
128This variable stores either a Coro object or a callback. 129This variable stores a Coro object that is put into the ready queue when
130there are no other ready threads (without invoking any ready hooks).
129 131
130If it is a callback, the it is called whenever the scheduler finds no 132The default implementation dies with "FATAL: deadlock detected.", followed
131ready coros to run. The default implementation prints "FATAL: 133by a thread listing, because the program has no other way to continue.
132deadlock detected" and exits, because the program has no other way to
133continue.
134
135If it is a coro object, then this object will be readied (without
136invoking any ready hooks, however) when the scheduler finds no other ready
137coros to run.
138 134
139This hook is overwritten by modules such as C<Coro::EV> and 135This hook is overwritten by modules such as C<Coro::EV> and
140C<Coro::AnyEvent> to wait on an external event that hopefully wake up a 136C<Coro::AnyEvent> to wait on an external event that hopefully wake up a
141coro so the scheduler can run it. 137coro so the scheduler can run it.
142 138
143Note that the callback I<must not>, under any circumstances, block
144the current coro. Normally, this is achieved by having an "idle
145coro" that calls the event loop and then blocks again, and then
146readying that coro in the idle handler, or by simply placing the idle
147coro in this variable.
148
149See L<Coro::Event> or L<Coro::AnyEvent> for examples of using this 139See L<Coro::EV> or L<Coro::AnyEvent> for examples of using this technique.
150technique.
151 140
152Please note that if your callback recursively invokes perl (e.g. for event
153handlers), then it must be prepared to be called recursively itself.
154
155=cut 141=cut
156 142
157$idle = sub { 143# ||= because other modules could have provided their own by now
158 require Carp; 144$idle ||= new Coro sub {
159 Carp::croak ("FATAL: deadlock detected"); 145 require Coro::Debug;
146 die "FATAL: deadlock detected.\n"
147 . Coro::Debug::ps_listing ();
160}; 148};
161 149
162# this coro is necessary because a coro 150# this coro is necessary because a coro
163# cannot destroy itself. 151# cannot destroy itself.
164our @destroy; 152our @destroy;
206Example: Create a new coro that just prints its arguments. 194Example: Create a new coro that just prints its arguments.
207 195
208 async { 196 async {
209 print "@_\n"; 197 print "@_\n";
210 } 1,2,3,4; 198 } 1,2,3,4;
211
212=cut
213
214sub async(&@) {
215 my $coro = new Coro @_;
216 $coro->ready;
217 $coro
218}
219 199
220=item async_pool { ... } [@args...] 200=item async_pool { ... } [@args...]
221 201
222Similar to C<async>, but uses a coro pool, so you should not call 202Similar to C<async>, but uses a coro pool, so you should not call
223terminate or join on it (although you are allowed to), and you get a 203terminate or join on it (although you are allowed to), and you get a
280=item schedule 260=item schedule
281 261
282Calls the scheduler. The scheduler will find the next coro that is 262Calls the scheduler. The scheduler will find the next coro that is
283to be run from the ready queue and switches to it. The next coro 263to be run from the ready queue and switches to it. The next coro
284to be run is simply the one with the highest priority that is longest 264to be run is simply the one with the highest priority that is longest
285in its ready queue. If there is no coro ready, it will clal the 265in its ready queue. If there is no coro ready, it will call the
286C<$Coro::idle> hook. 266C<$Coro::idle> hook.
287 267
288Please note that the current coro will I<not> be put into the ready 268Please note that the current coro will I<not> be put into the ready
289queue, so calling this function usually means you will never be called 269queue, so calling this function usually means you will never be called
290again unless something else (e.g. an event handler) calls C<< ->ready >>, 270again unless something else (e.g. an event handler) calls C<< ->ready >>,
398 } 378 }
399 379
400 # use like this: 380 # use like this:
401 timeslice { 381 timeslice {
402 # The following is an endless loop that would normally 382 # The following is an endless loop that would normally
403 # monopolise the process. Sicne it runs in a timeslice 383 # monopolise the process. Since it runs in a timesliced
404 # environment, it will regularly cede to other threads. 384 # environment, it will regularly cede to other threads.
405 while () { } 385 while () { }
406 }; 386 };
407 387
408 388
633Sets (or gets in case the argument is missing) the description for this 613Sets (or gets in case the argument is missing) the description for this
634coro. This is just a free-form string you can associate with a 614coro. This is just a free-form string you can associate with a
635coro. 615coro.
636 616
637This method simply sets the C<< $coro->{desc} >> member to the given 617This method simply sets the C<< $coro->{desc} >> member to the given
638string. You can modify this member directly if you wish. 618string. You can modify this member directly if you wish, and in fact, this
619is often preferred to indicate major processing states that cna then be
620seen for example in a L<Coro::Debug> session:
621
622 sub my_long_function {
623 local $Coro::current->{desc} = "now in my_long_function";
624 ...
625 $Coro::current->{desc} = "my_long_function: phase 1";
626 ...
627 $Coro::current->{desc} = "my_long_function: phase 2";
628 ...
629 }
639 630
640=cut 631=cut
641 632
642sub desc { 633sub desc {
643 my $old = $_[0]{desc}; 634 my $old = $_[0]{desc};
685The reason this function exists is that many event libraries (such as the 676The reason this function exists is that many event libraries (such as the
686venerable L<Event|Event> module) are not thread-safe (a weaker form 677venerable L<Event|Event> module) are not thread-safe (a weaker form
687of reentrancy). This means you must not block within event callbacks, 678of reentrancy). This means you must not block within event callbacks,
688otherwise you might suffer from crashes or worse. The only event library 679otherwise you might suffer from crashes or worse. The only event library
689currently known that is safe to use without C<unblock_sub> is L<EV>. 680currently known that is safe to use without C<unblock_sub> is L<EV>.
681
682Coro will try to catch you when you block in the event loop
683("FATAL:$Coro::IDLE blocked itself"), but this is just best effort and
684only works when you do not run your own event loop.
690 685
691This function allows your callbacks to block by executing them in another 686This function allows your callbacks to block by executing them in another
692coro where it is safe to block. One example where blocking is handy 687coro where it is safe to block. One example where blocking is handy
693is when you use the L<Coro::AIO|Coro::AIO> functions to save results to 688is when you use the L<Coro::AIO|Coro::AIO> functions to save results to
694disk, for example. 689disk, for example.
736 unshift @unblock_queue, [$cb, @_]; 731 unshift @unblock_queue, [$cb, @_];
737 $unblock_scheduler->ready; 732 $unblock_scheduler->ready;
738 } 733 }
739} 734}
740 735
741=item $cb = Coro::rouse_cb 736=item $cb = rouse_cb
742 737
743Create and return a "rouse callback". That's a code reference that, 738Create and return a "rouse callback". That's a code reference that,
744when called, will remember a copy of its arguments and notify the owner 739when called, will remember a copy of its arguments and notify the owner
745coro of the callback. 740coro of the callback.
746 741
747See the next function. 742See the next function.
748 743
749=item @args = Coro::rouse_wait [$cb] 744=item @args = rouse_wait [$cb]
750 745
751Wait for the specified rouse callback (or the last one that was created in 746Wait for the specified rouse callback (or the last one that was created in
752this coro). 747this coro).
753 748
754As soon as the callback is invoked (or when the callback was invoked 749As soon as the callback is invoked (or when the callback was invoked
755before C<rouse_wait>), it will return the arguments originally passed to 750before C<rouse_wait>), it will return the arguments originally passed to
756the rouse callback. 751the rouse callback. In scalar context, that means you get the I<last>
752argument, just as if C<rouse_wait> had a C<return ($a1, $a2, $a3...)>
753statement at the end.
757 754
758See the section B<HOW TO WAIT FOR A CALLBACK> for an actual usage example. 755See the section B<HOW TO WAIT FOR A CALLBACK> for an actual usage example.
759 756
760=back 757=back
761 758
849the windows process emulation enabled under unix roughly halves perl 846the windows process emulation enabled under unix roughly halves perl
850performance, even when not used. 847performance, even when not used.
851 848
852=item coro switching is not signal safe 849=item coro switching is not signal safe
853 850
854You must not switch to another coro from within a signal handler 851You must not switch to another coro from within a signal handler (only
855(only relevant with %SIG - most event libraries provide safe signals). 852relevant with %SIG - most event libraries provide safe signals), I<unless>
853you are sure you are not interrupting a Coro function.
856 854
857That means you I<MUST NOT> call any function that might "block" the 855That means you I<MUST NOT> call any function that might "block" the
858current coro - C<cede>, C<schedule> C<< Coro::Semaphore->down >> or 856current coro - C<cede>, C<schedule> C<< Coro::Semaphore->down >> or
859anything that calls those. Everything else, including calling C<ready>, 857anything that calls those. Everything else, including calling C<ready>,
860works. 858works.
861 859
862=back 860=back
863 861
864 862
863=head1 WINDOWS PROCESS EMULATION
864
865A great many people seem to be confused about ithreads (for example, Chip
866Salzenberg called me unintelligent, incapable, stupid and gullible,
867while in the same mail making rather confused statements about perl
868ithreads (for example, that memory or files would be shared), showing his
869lack of understanding of this area - if it is hard to understand for Chip,
870it is probably not obvious to everybody).
871
872What follows is an ultra-condensed version of my talk about threads in
873scripting languages given on the perl workshop 2009:
874
875The so-called "ithreads" were originally implemented for two reasons:
876first, to (badly) emulate unix processes on native win32 perls, and
877secondly, to replace the older, real thread model ("5.005-threads").
878
879It does that by using threads instead of OS processes. The difference
880between processes and threads is that threads share memory (and other
881state, such as files) between threads within a single process, while
882processes do not share anything (at least not semantically). That
883means that modifications done by one thread are seen by others, while
884modifications by one process are not seen by other processes.
885
886The "ithreads" work exactly like that: when creating a new ithreads
887process, all state is copied (memory is copied physically, files and code
888is copied logically). Afterwards, it isolates all modifications. On UNIX,
889the same behaviour can be achieved by using operating system processes,
890except that UNIX typically uses hardware built into the system to do this
891efficiently, while the windows process emulation emulates this hardware in
892software (rather efficiently, but of course it is still much slower than
893dedicated hardware).
894
895As mentioned before, loading code, modifying code, modifying data
896structures and so on is only visible in the ithreads process doing the
897modification, not in other ithread processes within the same OS process.
898
899This is why "ithreads" do not implement threads for perl at all, only
900processes. What makes it so bad is that on non-windows platforms, you can
901actually take advantage of custom hardware for this purpose (as evidenced
902by the forks module, which gives you the (i-) threads API, just much
903faster).
904
905Sharing data is in the i-threads model is done by transfering data
906structures between threads using copying semantics, which is very slow -
907shared data simply does not exist. Benchmarks using i-threads which are
908communication-intensive show extremely bad behaviour with i-threads (in
909fact, so bad that Coro, which cannot take direct advantage of multiple
910CPUs, is often orders of magnitude faster because it shares data using
911real threads, refer to my talk for details).
912
913As summary, i-threads *use* threads to implement processes, while
914the compatible forks module *uses* processes to emulate, uhm,
915processes. I-threads slow down every perl program when enabled, and
916outside of windows, serve no (or little) practical purpose, but
917disadvantages every single-threaded Perl program.
918
919This is the reason that I try to avoid the name "ithreads", as it is
920misleading as it implies that it implements some kind of thread model for
921perl, and prefer the name "windows process emulation", which describes the
922actual use and behaviour of it much better.
923
865=head1 SEE ALSO 924=head1 SEE ALSO
866 925
867Event-Loop integration: L<Coro::AnyEvent>, L<Coro::EV>, L<Coro::Event>. 926Event-Loop integration: L<Coro::AnyEvent>, L<Coro::EV>, L<Coro::Event>.
868 927
869Debugging: L<Coro::Debug>. 928Debugging: L<Coro::Debug>.

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines