ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/AnyEvent-Fork/Fork.pm
(Generate patch)

Comparing AnyEvent-Fork/Fork.pm (file contents):
Revision 1.25 by root, Sat Apr 6 08:55:16 2013 UTC vs.
Revision 1.37 by root, Sat Apr 6 20:06:39 2013 UTC

125becomes very hard to use the event loop from a child program, as the 125becomes very hard to use the event loop from a child program, as the
126watchers already exist but are only meaningful in the parent. Worse, a 126watchers already exist but are only meaningful in the parent. Worse, a
127module might want to use such a module, not knowing whether another module 127module might want to use such a module, not knowing whether another module
128or the main program also does, leading to problems. 128or the main program also does, leading to problems.
129 129
130Apart from event loops, graphical toolkits also commonly fall into the
131"unsafe module" category, or just about anything that communicates with
132the external world, such as network libraries and file I/O modules, which
133usually don't like being copied and then allowed to continue in two
134processes.
135
130With this module only the main program is allowed to create new processes 136With this module only the main program is allowed to create new processes
131by forking (because only the main program can know when it is still safe 137by forking (because only the main program can know when it is still safe
132to do so) - all other processes are created via fork+exec, which makes it 138to do so) - all other processes are created via fork+exec, which makes it
133possible to use modules such as event loops or window interfaces safely. 139possible to use modules such as event loops or window interfaces safely.
134 140
146 152
147 # now $master_filehandle is connected to the 153 # now $master_filehandle is connected to the
148 # $slave_filehandle in the new process. 154 # $slave_filehandle in the new process.
149 }); 155 });
150 156
151 # MyModule::worker might look like this 157C<MyModule> might look like this:
158
159 package MyModule;
160
152 sub MyModule::worker { 161 sub worker {
153 my ($slave_filehandle) = @_; 162 my ($slave_filehandle) = @_;
154 163
155 # now $slave_filehandle is connected to the $master_filehandle 164 # now $slave_filehandle is connected to the $master_filehandle
156 # in the original prorcess. have fun! 165 # in the original prorcess. have fun!
157 } 166 }
176 } 185 }
177 186
178 # now do other things - maybe use the filehandle provided by run 187 # now do other things - maybe use the filehandle provided by run
179 # to wait for the processes to die. or whatever. 188 # to wait for the processes to die. or whatever.
180 189
181 # My::Server::run might look like this 190C<My::Server> might look like this:
182 sub My::Server::run { 191
192 package My::Server;
193
194 sub run {
183 my ($slave, $listener, $id) = @_; 195 my ($slave, $listener, $id) = @_;
184 196
185 close $slave; # we do not use the socket, so close it to save resources 197 close $slave; # we do not use the socket, so close it to save resources
186 198
187 # we could go ballistic and use e.g. AnyEvent here, or IO::AIO, 199 # we could go ballistic and use e.g. AnyEvent here, or IO::AIO,
191 } 203 }
192 } 204 }
193 205
194=head2 use AnyEvent::Fork as a faster fork+exec 206=head2 use AnyEvent::Fork as a faster fork+exec
195 207
196This runs /bin/echo hi, with stdout redirected to /tmp/log and stderr to 208This runs C</bin/echo hi>, with stdandard output redirected to /tmp/log
197the communications socket. It is usually faster than fork+exec, but still 209and standard error redirected to the communications socket. It is usually
198let's you prepare the environment. 210faster than fork+exec, but still lets you prepare the environment.
199 211
200 open my $output, ">/tmp/log" or die "$!"; 212 open my $output, ">/tmp/log" or die "$!";
201 213
202 AnyEvent::Fork 214 AnyEvent::Fork
203 ->new 215 ->new
303 my ($fork_fh) = @_; 315 my ($fork_fh) = @_;
304 }); 316 });
305 317
306=back 318=back
307 319
308=head1 FUNCTIONS 320=head1 THE C<AnyEvent::Fork> CLASS
321
322This module exports nothing, and only implements a single class -
323C<AnyEvent::Fork>.
324
325There are two class constructors that both create new processes - C<new>
326and C<new_exec>. The C<fork> method creates a new process by forking an
327existing one and could be considered a third constructor.
328
329Most of the remaining methods deal with preparing the new process, by
330loading code, evaluating code and sending data to the new process. They
331usually return the process object, so you can chain method calls.
332
333If a process object is destroyed before calling its C<run> method, then
334the process simply exits. After C<run> is called, all responsibility is
335passed to the specified function.
336
337As long as there is any outstanding work to be done, process objects
338resist being destroyed, so there is no reason to store them unless you
339need them later - configure and forget works just fine.
309 340
310=over 4 341=over 4
311 342
312=cut 343=cut
313 344
323use IO::FDPass; 354use IO::FDPass;
324 355
325our $VERSION = 0.5; 356our $VERSION = 0.5;
326 357
327our $PERL; # the path to the perl interpreter, deduces with various forms of magic 358our $PERL; # the path to the perl interpreter, deduces with various forms of magic
328
329=item my $pool = new AnyEvent::Fork key => value...
330
331Create a new process pool. The following named parameters are supported:
332 359
333=over 4 360=over 4
334 361
335=back 362=back
336 363
432Create a new "empty" perl interpreter process and returns its process 459Create a new "empty" perl interpreter process and returns its process
433object for further manipulation. 460object for further manipulation.
434 461
435The new process is forked from a template process that is kept around 462The new process is forked from a template process that is kept around
436for this purpose. When it doesn't exist yet, it is created by a call to 463for this purpose. When it doesn't exist yet, it is created by a call to
437C<new_exec> and kept around for future calls. 464C<new_exec> first and then stays around for future calls.
438
439When the process object is destroyed, it will release the file handle
440that connects it with the new process. When the new process has not yet
441called C<run>, then the process will exit. Otherwise, what happens depends
442entirely on the code that is executed.
443 465
444=cut 466=cut
445 467
446sub new { 468sub new {
447 my $class = shift; 469 my $class = shift;
537} 559}
538 560
539=item $pid = $proc->pid 561=item $pid = $proc->pid
540 562
541Returns the process id of the process I<iff it is a direct child of the 563Returns the process id of the process I<iff it is a direct child of the
542process> running AnyEvent::Fork, and C<undef> otherwise. 564process running AnyEvent::Fork>, and C<undef> otherwise.
543 565
544Normally, only processes created via C<< AnyEvent::Fork->new_exec >> and 566Normally, only processes created via C<< AnyEvent::Fork->new_exec >> and
545L<AnyEvent::Fork::Template> are direct children, and you are responsible 567L<AnyEvent::Fork::Template> are direct children, and you are responsible
546to clean up their zombies when they die. 568to clean up their zombies when they die.
547 569
548All other processes are not direct children, and will be cleaned up by 570All other processes are not direct children, and will be cleaned up by
549AnyEvent::Fork. 571AnyEvent::Fork itself.
550 572
551=cut 573=cut
552 574
553sub pid { 575sub pid {
554 $_[0][0] 576 $_[0][0]
565 587
566The code will usually be executed after this call returns, and there is no 588The code will usually be executed after this call returns, and there is no
567way to pass anything back to the calling process. Any evaluation errors 589way to pass anything back to the calling process. Any evaluation errors
568will be reported to stderr and cause the process to exit. 590will be reported to stderr and cause the process to exit.
569 591
570If you want to execute some code to take over the process (see the 592If you want to execute some code (that isn't in a module) to take over the
571"fork+exec" example in the SYNOPSIS), you should compile a function via 593process, you should compile a function via C<eval> first, and then call
572C<eval> first, and then call it via C<run>. This also gives you access to 594it via C<run>. This also gives you access to any arguments passed via the
573any arguments passed via the C<send_xxx> methods, such as file handles. 595C<send_xxx> methods, such as file handles. See the L<use AnyEvent::Fork as
596a faster fork+exec> example to see it in action.
574 597
575Returns the process object for easy chaining of method calls. 598Returns the process object for easy chaining of method calls.
576 599
577=cut 600=cut
578 601
604=item $proc = $proc->send_fh ($handle, ...) 627=item $proc = $proc->send_fh ($handle, ...)
605 628
606Send one or more file handles (I<not> file descriptors) to the process, 629Send one or more file handles (I<not> file descriptors) to the process,
607to prepare a call to C<run>. 630to prepare a call to C<run>.
608 631
609The process object keeps a reference to the handles until this is done, 632The process object keeps a reference to the handles until they have
610so you must not explicitly close the handles. This is most easily 633been passed over to the process, so you must not explicitly close the
611accomplished by simply not storing the file handles anywhere after passing 634handles. This is most easily accomplished by simply not storing the file
612them to this method. 635handles anywhere after passing them to this method - when AnyEvent::Fork
636is finished using them, perl will automatically close them.
613 637
614Returns the process object for easy chaining of method calls. 638Returns the process object for easy chaining of method calls.
615 639
616Example: pass a file handle to a process, and release it without 640Example: pass a file handle to a process, and release it without
617closing. It will be closed automatically when it is no longer used. 641closing. It will be closed automatically when it is no longer used.
633} 657}
634 658
635=item $proc = $proc->send_arg ($string, ...) 659=item $proc = $proc->send_arg ($string, ...)
636 660
637Send one or more argument strings to the process, to prepare a call to 661Send one or more argument strings to the process, to prepare a call to
638C<run>. The strings can be any octet string. 662C<run>. The strings can be any octet strings.
639 663
640The protocol is optimised to pass a moderate number of relatively short 664The protocol is optimised to pass a moderate number of relatively short
641strings - while you can pass up to 4GB of data in one go, this is more 665strings - while you can pass up to 4GB of data in one go, this is more
642meant to pass some ID information or other startup info, not big chunks of 666meant to pass some ID information or other startup info, not big chunks of
643data. 667data.
659Enter the function specified by the function name in C<$func> in the 683Enter the function specified by the function name in C<$func> in the
660process. The function is called with the communication socket as first 684process. The function is called with the communication socket as first
661argument, followed by all file handles and string arguments sent earlier 685argument, followed by all file handles and string arguments sent earlier
662via C<send_fh> and C<send_arg> methods, in the order they were called. 686via C<send_fh> and C<send_arg> methods, in the order they were called.
663 687
688The process object becomes unusable on return from this function - any
689further method calls result in undefined behaviour.
690
664The function name should be fully qualified, but if it isn't, it will be 691The function name should be fully qualified, but if it isn't, it will be
665looked up in the main package. 692looked up in the C<main> package.
666 693
667If the called function returns, doesn't exist, or any error occurs, the 694If the called function returns, doesn't exist, or any error occurs, the
668process exits. 695process exits.
669 696
670Preparing the process is done in the background - when all commands have 697Preparing the process is done in the background - when all commands have
671been sent, the callback is invoked with the local communications socket 698been sent, the callback is invoked with the local communications socket
672as argument. At this point you can start using the socket in any way you 699as argument. At this point you can start using the socket in any way you
673like. 700like.
674
675The process object becomes unusable on return from this function - any
676further method calls result in undefined behaviour.
677 701
678If the communication socket isn't used, it should be closed on both sides, 702If the communication socket isn't used, it should be closed on both sides,
679to save on kernel memory. 703to save on kernel memory.
680 704
681The socket is non-blocking in the parent, and blocking in the newly 705The socket is non-blocking in the parent, and blocking in the newly
756 479 vfork+execs per second, using AnyEvent::Fork->new_exec 780 479 vfork+execs per second, using AnyEvent::Fork->new_exec
757 781
758So how can C<< AnyEvent->new >> be faster than a standard fork, even 782So how can C<< AnyEvent->new >> be faster than a standard fork, even
759though it uses the same operations, but adds a lot of overhead? 783though it uses the same operations, but adds a lot of overhead?
760 784
761The difference is simply the process size: forking the 6MB process takes 785The difference is simply the process size: forking the 5MB process takes
762so much longer than forking the 2.5MB template process that the overhead 786so much longer than forking the 2.5MB template process that the extra
763introduced is canceled out. 787overhead introduced is canceled out.
764 788
765If the benchmark process grows, the normal fork becomes even slower: 789If the benchmark process grows, the normal fork becomes even slower:
766 790
767 1340 new processes, manual fork in a 20MB process 791 1340 new processes, manual fork of a 20MB process
768 731 new processes, manual fork in a 200MB process 792 731 new processes, manual fork of a 200MB process
769 235 new processes, manual fork in a 2000MB process 793 235 new processes, manual fork of a 2000MB process
770 794
771What that means (to me) is that I can use this module without having a 795What that means (to me) is that I can use this module without having a bad
772very bad conscience because of the extra overhead required to start new 796conscience because of the extra overhead required to start new processes.
773processes.
774 797
775=head1 TYPICAL PROBLEMS 798=head1 TYPICAL PROBLEMS
776 799
777This section lists typical problems that remain. I hope by recognising 800This section lists typical problems that remain. I hope by recognising
778them, most can be avoided. 801them, most can be avoided.
779 802
780=over 4 803=over 4
781 804
782=item "leaked" file descriptors for exec'ed processes 805=item leaked file descriptors for exec'ed processes
783 806
784POSIX systems inherit file descriptors by default when exec'ing a new 807POSIX systems inherit file descriptors by default when exec'ing a new
785process. While perl itself laudably sets the close-on-exec flags on new 808process. While perl itself laudably sets the close-on-exec flags on new
786file handles, most C libraries don't care, and even if all cared, it's 809file handles, most C libraries don't care, and even if all cared, it's
787often not possible to set the flag in a race-free manner. 810often not possible to set the flag in a race-free manner.
807libraries or the code that leaks those file descriptors. 830libraries or the code that leaks those file descriptors.
808 831
809Fortunately, most of these leaked descriptors do no harm, other than 832Fortunately, most of these leaked descriptors do no harm, other than
810sitting on some resources. 833sitting on some resources.
811 834
812=item "leaked" file descriptors for fork'ed processes 835=item leaked file descriptors for fork'ed processes
813 836
814Normally, L<AnyEvent::Fork> does start new processes by exec'ing them, 837Normally, L<AnyEvent::Fork> does start new processes by exec'ing them,
815which closes file descriptors not marked for being inherited. 838which closes file descriptors not marked for being inherited.
816 839
817However, L<AnyEvent::Fork::Early> and L<AnyEvent::Fork::Template> offer 840However, L<AnyEvent::Fork::Early> and L<AnyEvent::Fork::Template> offer
826 849
827The solution is to either not load these modules before use'ing 850The solution is to either not load these modules before use'ing
828L<AnyEvent::Fork::Early> or L<AnyEvent::Fork::Template>, or to delay 851L<AnyEvent::Fork::Early> or L<AnyEvent::Fork::Template>, or to delay
829initialising them, for example, by calling C<init Gtk2> manually. 852initialising them, for example, by calling C<init Gtk2> manually.
830 853
831=item exit runs destructors 854=item exiting calls object destructors
832 855
833This only applies to users of Lc<AnyEvent::Fork:Early> and 856This only applies to users of Lc<AnyEvent::Fork:Early> and
834L<AnyEvent::Fork::Template>. 857L<AnyEvent::Fork::Template>.
835 858
836When a process created by AnyEvent::Fork exits, it might do so by calling 859When a process created by AnyEvent::Fork exits, it might do so by calling
858to make it so, mostly due to the bloody broken perl that nobody seems to 881to make it so, mostly due to the bloody broken perl that nobody seems to
859care about. The fork emulation is a bad joke - I have yet to see something 882care about. The fork emulation is a bad joke - I have yet to see something
860useful that you can do with it without running into memory corruption 883useful that you can do with it without running into memory corruption
861issues or other braindamage. Hrrrr. 884issues or other braindamage. Hrrrr.
862 885
863Cygwin perl is not supported at the moment, as it should implement fd 886Cygwin perl is not supported at the moment due to some hilarious
864passing, but doesn't, and rolling my own is hard, as cygwin doesn't 887shortcomings of its API - see L<IO::FDPoll> for more details.
865support enough functionality to do it.
866 888
867=head1 SEE ALSO 889=head1 SEE ALSO
868 890
869L<AnyEvent::Fork::Early> (to avoid executing a perl interpreter), 891L<AnyEvent::Fork::Early> (to avoid executing a perl interpreter),
870L<AnyEvent::Fork::Template> (to create a process by forking the main 892L<AnyEvent::Fork::Template> (to create a process by forking the main

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines