… | |
… | |
40 | points in your program, so locking and parallel access are rarely an |
40 | points in your program, so locking and parallel access are rarely an |
41 | issue, making thread programming much safer and easier than using other |
41 | issue, making thread programming much safer and easier than using other |
42 | thread models. |
42 | thread models. |
43 | |
43 | |
44 | Unlike the so-called "Perl threads" (which are not actually real threads |
44 | Unlike the so-called "Perl threads" (which are not actually real threads |
45 | but only the windows process emulation ported to unix, and as such act |
45 | but only the windows process emulation (see section of same name for more |
46 | as processes), Coro provides a full shared address space, which makes |
46 | details) ported to unix, and as such act as processes), Coro provides |
47 | communication between threads very easy. And Coro's threads are fast, |
47 | a full shared address space, which makes communication between threads |
48 | too: disabling the Windows process emulation code in your perl and using |
48 | very easy. And Coro's threads are fast, too: disabling the Windows |
49 | Coro can easily result in a two to four times speed increase for your |
49 | process emulation code in your perl and using Coro can easily result in |
50 | programs. A parallel matrix multiplication benchmark runs over 300 times |
50 | a two to four times speed increase for your programs. A parallel matrix |
51 | faster on a single core than perl's pseudo-threads on a quad core using |
51 | multiplication benchmark runs over 300 times faster on a single core than |
52 | all four cores. |
52 | perl's pseudo-threads on a quad core using all four cores. |
53 | |
53 | |
54 | Coro achieves that by supporting multiple running interpreters that share |
54 | Coro achieves that by supporting multiple running interpreters that share |
55 | data, which is especially useful to code pseudo-parallel processes and |
55 | data, which is especially useful to code pseudo-parallel processes and |
56 | for event-based programming, such as multiple HTTP-GET requests running |
56 | for event-based programming, such as multiple HTTP-GET requests running |
57 | concurrently. See L<Coro::AnyEvent> to learn more on how to integrate Coro |
57 | concurrently. See L<Coro::AnyEvent> to learn more on how to integrate Coro |
… | |
… | |
80 | |
80 | |
81 | our $idle; # idle handler |
81 | our $idle; # idle handler |
82 | our $main; # main coro |
82 | our $main; # main coro |
83 | our $current; # current coro |
83 | our $current; # current coro |
84 | |
84 | |
85 | our $VERSION = 5.132; |
85 | our $VERSION = 5.17; |
86 | |
86 | |
87 | our @EXPORT = qw(async async_pool cede schedule terminate current unblock_sub); |
87 | our @EXPORT = qw(async async_pool cede schedule terminate current unblock_sub); |
88 | our %EXPORT_TAGS = ( |
88 | our %EXPORT_TAGS = ( |
89 | prio => [qw(PRIO_MAX PRIO_HIGH PRIO_NORMAL PRIO_LOW PRIO_IDLE PRIO_MIN)], |
89 | prio => [qw(PRIO_MAX PRIO_HIGH PRIO_NORMAL PRIO_LOW PRIO_IDLE PRIO_MIN)], |
90 | ); |
90 | ); |
… | |
… | |
206 | Example: Create a new coro that just prints its arguments. |
206 | Example: Create a new coro that just prints its arguments. |
207 | |
207 | |
208 | async { |
208 | async { |
209 | print "@_\n"; |
209 | print "@_\n"; |
210 | } 1,2,3,4; |
210 | } 1,2,3,4; |
211 | |
|
|
212 | =cut |
|
|
213 | |
|
|
214 | sub async(&@) { |
|
|
215 | my $coro = new Coro @_; |
|
|
216 | $coro->ready; |
|
|
217 | $coro |
|
|
218 | } |
|
|
219 | |
211 | |
220 | =item async_pool { ... } [@args...] |
212 | =item async_pool { ... } [@args...] |
221 | |
213 | |
222 | Similar to C<async>, but uses a coro pool, so you should not call |
214 | Similar to C<async>, but uses a coro pool, so you should not call |
223 | terminate or join on it (although you are allowed to), and you get a |
215 | terminate or join on it (although you are allowed to), and you get a |
… | |
… | |
338 | |
330 | |
339 | These functions implement the same concept as C<dynamic-wind> in scheme |
331 | These functions implement the same concept as C<dynamic-wind> in scheme |
340 | does, and are useful when you want to localise some resource to a specific |
332 | does, and are useful when you want to localise some resource to a specific |
341 | coro. |
333 | coro. |
342 | |
334 | |
343 | They slow down coro switching considerably for coros that use |
335 | They slow down thread switching considerably for coros that use them |
344 | them (But coro switching is still reasonably fast if the handlers are |
336 | (about 40% for a BLOCK with a single assignment, so thread switching is |
345 | fast). |
337 | still reasonably fast if the handlers are fast). |
346 | |
338 | |
347 | These functions are best understood by an example: The following function |
339 | These functions are best understood by an example: The following function |
348 | will change the current timezone to "Antarctica/South_Pole", which |
340 | will change the current timezone to "Antarctica/South_Pole", which |
349 | requires a call to C<tzset>, but by using C<on_enter> and C<on_leave>, |
341 | requires a call to C<tzset>, but by using C<on_enter> and C<on_leave>, |
350 | which remember/change the current timezone and restore the previous |
342 | which remember/change the current timezone and restore the previous |
… | |
… | |
373 | }; |
365 | }; |
374 | |
366 | |
375 | This can be used to localise about any resource (locale, uid, current |
367 | This can be used to localise about any resource (locale, uid, current |
376 | working directory etc.) to a block, despite the existance of other |
368 | working directory etc.) to a block, despite the existance of other |
377 | coros. |
369 | coros. |
|
|
370 | |
|
|
371 | Another interesting example implements time-sliced multitasking using |
|
|
372 | interval timers (this could obviously be optimised, but does the job): |
|
|
373 | |
|
|
374 | # "timeslice" the given block |
|
|
375 | sub timeslice(&) { |
|
|
376 | use Time::HiRes (); |
|
|
377 | |
|
|
378 | Coro::on_enter { |
|
|
379 | # on entering the thread, we set an VTALRM handler to cede |
|
|
380 | $SIG{VTALRM} = sub { cede }; |
|
|
381 | # and then start the interval timer |
|
|
382 | Time::HiRes::setitimer &Time::HiRes::ITIMER_VIRTUAL, 0.01, 0.01; |
|
|
383 | }; |
|
|
384 | Coro::on_leave { |
|
|
385 | # on leaving the thread, we stop the interval timer again |
|
|
386 | Time::HiRes::setitimer &Time::HiRes::ITIMER_VIRTUAL, 0, 0; |
|
|
387 | }; |
|
|
388 | |
|
|
389 | &{+shift}; |
|
|
390 | } |
|
|
391 | |
|
|
392 | # use like this: |
|
|
393 | timeslice { |
|
|
394 | # The following is an endless loop that would normally |
|
|
395 | # monopolise the process. Since it runs in a timesliced |
|
|
396 | # environment, it will regularly cede to other threads. |
|
|
397 | while () { } |
|
|
398 | }; |
|
|
399 | |
378 | |
400 | |
379 | =item killall |
401 | =item killall |
380 | |
402 | |
381 | Kills/terminates/cancels all coros except the currently running one. |
403 | Kills/terminates/cancels all coros except the currently running one. |
382 | |
404 | |
… | |
… | |
721 | Wait for the specified rouse callback (or the last one that was created in |
743 | Wait for the specified rouse callback (or the last one that was created in |
722 | this coro). |
744 | this coro). |
723 | |
745 | |
724 | As soon as the callback is invoked (or when the callback was invoked |
746 | As soon as the callback is invoked (or when the callback was invoked |
725 | before C<rouse_wait>), it will return the arguments originally passed to |
747 | before C<rouse_wait>), it will return the arguments originally passed to |
726 | the rouse callback. |
748 | the rouse callback. In scalar context, that means you get the I<last> |
|
|
749 | argument, just as if C<rouse_wait> had a C<return ($a1, $a2, $a3...)> |
|
|
750 | statement at the end. |
727 | |
751 | |
728 | See the section B<HOW TO WAIT FOR A CALLBACK> for an actual usage example. |
752 | See the section B<HOW TO WAIT FOR A CALLBACK> for an actual usage example. |
729 | |
753 | |
730 | =back |
754 | =back |
731 | |
755 | |
… | |
… | |
830 | works. |
854 | works. |
831 | |
855 | |
832 | =back |
856 | =back |
833 | |
857 | |
834 | |
858 | |
|
|
859 | =head1 WINDOWS PROCESS EMULATION |
|
|
860 | |
|
|
861 | A great many people seem to be confused about ithreads (for example, Chip |
|
|
862 | Salzenberg called me unintelligent, incapable, stupid and ingullible, |
|
|
863 | while in the same mail making rather confused statements about perl |
|
|
864 | ithreads (for example, that memory or files would be shared), showing his |
|
|
865 | lack of understanding of this area - if it is hard to understand for Chip, |
|
|
866 | it is probably not obvious to everybody). |
|
|
867 | |
|
|
868 | What follows is an ultra-condensed version of my talk about threads in |
|
|
869 | scripting languages given onthe perl workshop 2009: |
|
|
870 | |
|
|
871 | The so-called "ithreads" were originally implemented for two reasons: |
|
|
872 | first, to (badly) emulate unix processes on native win32 perls, and |
|
|
873 | secondly, to replace the older, real thread model ("5.005-threads"). |
|
|
874 | |
|
|
875 | It does that by using threads instead of OS processes. The difference |
|
|
876 | between processes and threads is that threads share memory (and other |
|
|
877 | state, such as files) between threads within a single process, while |
|
|
878 | processes do not share anything (at least not semantically). That |
|
|
879 | means that modifications done by one thread are seen by others, while |
|
|
880 | modifications by one process are not seen by other processes. |
|
|
881 | |
|
|
882 | The "ithreads" work exactly like that: when creating a new ithreads |
|
|
883 | process, all state is copied (memory is copied physically, files and code |
|
|
884 | is copied logically). Afterwards, it isolates all modifications. On UNIX, |
|
|
885 | the same behaviour can be achieved by using operating system processes, |
|
|
886 | except that UNIX typically uses hardware built into the system to do this |
|
|
887 | efficiently, while the windows process emulation emulates this hardware in |
|
|
888 | software (rather efficiently, but of course it is still much slower than |
|
|
889 | dedicated hardware). |
|
|
890 | |
|
|
891 | As mentioned before, loading code, modifying code, modifying data |
|
|
892 | structures and so on is only visible in the ithreads process doing the |
|
|
893 | modification, not in other ithread processes within the same OS process. |
|
|
894 | |
|
|
895 | This is why "ithreads" do not implement threads for perl at all, only |
|
|
896 | processes. What makes it so bad is that on non-windows platforms, you can |
|
|
897 | actually take advantage of custom hardware for this purpose (as evidenced |
|
|
898 | by the forks module, which gives you the (i-) threads API, just much |
|
|
899 | faster). |
|
|
900 | |
|
|
901 | Sharing data is in the i-threads model is done by transfering data |
|
|
902 | structures between threads using copying semantics, which is very slow - |
|
|
903 | shared data simply does not exist. Benchmarks using i-threads which are |
|
|
904 | communication-intensive show extremely bad behaviour with i-threads (in |
|
|
905 | fact, so bad that Coro, which cannot take direct advantage of multiple |
|
|
906 | CPUs, is often orders of magnitude faster because it shares data using |
|
|
907 | real threads, refer to my talk for details). |
|
|
908 | |
|
|
909 | As summary, i-threads *use* threads to implement processes, while |
|
|
910 | the compatible forks module *uses* processes to emulate, uhm, |
|
|
911 | processes. I-threads slow down every perl program when enabled, and |
|
|
912 | outside of windows, serve no (or little) practical purpose, but |
|
|
913 | disadvantages every single-threaded Perl program. |
|
|
914 | |
|
|
915 | This is the reason that I try to avoid the name "ithreads", as it is |
|
|
916 | misleading as it implies that it implements some kind of thread model for |
|
|
917 | perl, and prefer the name "windows process emulation", which describes the |
|
|
918 | actual use and behaviour of it much better. |
|
|
919 | |
835 | =head1 SEE ALSO |
920 | =head1 SEE ALSO |
836 | |
921 | |
837 | Event-Loop integration: L<Coro::AnyEvent>, L<Coro::EV>, L<Coro::Event>. |
922 | Event-Loop integration: L<Coro::AnyEvent>, L<Coro::EV>, L<Coro::Event>. |
838 | |
923 | |
839 | Debugging: L<Coro::Debug>. |
924 | Debugging: L<Coro::Debug>. |