ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/AnyEvent-Fork-RPC/RPC.pm
(Generate patch)

Comparing AnyEvent-Fork-RPC/RPC.pm (file contents):
Revision 1.10 by root, Wed Apr 17 22:04:49 2013 UTC vs.
Revision 1.16 by root, Thu Apr 18 14:07:15 2013 UTC

12 ->require ("MyModule") 12 ->require ("MyModule")
13 ->AnyEvent::Fork::RPC::run ( 13 ->AnyEvent::Fork::RPC::run (
14 "MyModule::server", 14 "MyModule::server",
15 ); 15 );
16 16
17 use AnyEvent;
18
17 my $cv = AE::cv; 19 my $cv = AE::cv;
18 20
19 $rpc->(1, 2, 3, sub { 21 $rpc->(1, 2, 3, sub {
20 print "MyModule::server returned @_\n"; 22 print "MyModule::server returned @_\n";
21 $cv->send; 23 $cv->send;
49silly, but illustrates the use of events. 51silly, but illustrates the use of events.
50 52
51First the parent process: 53First the parent process:
52 54
53 use AnyEvent; 55 use AnyEvent;
54 use AnyEvent::Fork;
55 use AnyEvent::Fork::RPC; 56 use AnyEvent::Fork::RPC;
56 57
57 my $done = AE::cv; 58 my $done = AE::cv;
58 59
59 my $rpc = AnyEvent::Fork 60 my $rpc = AnyEvent::Fork
174child process may exit at any time, so you should call C<$done> only when 175child process may exit at any time, so you should call C<$done> only when
175you really I<are> done. 176you really I<are> done.
176 177
177=head2 Example 2: Asynchronous Backend 178=head2 Example 2: Asynchronous Backend
178 179
179#TODO 180This example implements multiple count-downs in the child, using
181L<AnyEvent> timers. While this is a bit silly (one could use timers in te
182parent just as well), it illustrates the ability to use AnyEvent in the
183child and the fact that responses can arrive in a different order then the
184requests.
185
186It also shows how to embed the actual child code into a C<__DATA__>
187section, so it doesn't need any external files at all.
188
189And when your parent process is often busy, and you have stricter timing
190requirements, then running timers in a child process suddenly doesn't look
191so silly anymore.
192
193Without further ado, here is the code:
194
195 use AnyEvent;
196 use AnyEvent::Fork::RPC;
197
198 my $done = AE::cv;
199
200 my $rpc = AnyEvent::Fork
201 ->new
202 ->require ("AnyEvent::Fork::RPC::Async")
203 ->eval (do { local $/; <DATA> })
204 ->AnyEvent::Fork::RPC::run ("run",
205 async => 1,
206 on_error => sub { warn "FATAL: $_[0]"; exit 1 },
207 on_event => sub { print $_[0] },
208 on_destroy => $done,
209 );
210
211 for my $count (3, 2, 1) {
212 $rpc->($count, sub {
213 warn "job $count finished\n";
214 });
215 }
216
217 undef $rpc;
218
219 $done->recv;
220
221 __DATA__
222
223 # this ends up in main, as we don't use a package declaration
224
225 use AnyEvent;
226
227 sub run {
228 my ($done, $count) = @_;
229
230 my $n;
231
232 AnyEvent::Fork::RPC::event "starting to count up to $count\n";
233
234 my $w; $w = AE::timer 1, 1, sub {
235 ++$n;
236
237 AnyEvent::Fork::RPC::event "count $n of $count\n";
238
239 if ($n == $count) {
240 undef $w;
241 $done->();
242 }
243 };
244 }
245
246The parent part (the one before the C<__DATA__> section) isn't very
247different from the earlier examples. It sets async mode, preloads
248the backend module (so the C<AnyEvent::Fork::RPC::event> function is
249declared), uses a slightly different C<on_event> handler (which we use
250simply for logging purposes) and then, instead of loading a module with
251the actual worker code, it C<eval>'s the code from the data section in the
252child process.
253
254It then starts three countdowns, from 3 to 1 seconds downwards, destroys
255the rpc object so the example finishes eventually, and then just waits for
256the stuff to trickle in.
257
258The worker code uses the event function to log some progress messages, but
259mostly just creates a recurring one-second timer.
260
261The timer callback increments a counter, logs a message, and eventually,
262when the count has been reached, calls the finish callback.
263
264On my system, this results in the following output. Since all timers fire
265at roughly the same time, the actual order isn't guaranteed, but the order
266shown is very likely what you would get, too.
267
268 starting to count up to 3
269 starting to count up to 2
270 starting to count up to 1
271 count 1 of 3
272 count 1 of 2
273 count 1 of 1
274 job 1 finished
275 count 2 of 2
276 job 2 finished
277 count 2 of 3
278 count 3 of 3
279 job 3 finished
280
281While the overall ordering isn't guaranteed, the async backend still
282guarantees that events and responses are delivered to the parent process
283in the exact same ordering as they were generated in the child process.
284
285And unless your system is I<very> busy, it should clearly show that the
286job started last will finish first, as it has the lowest count.
287
288This concludes the async example. Since L<AnyEvent::Fork> does not
289actually fork, you are free to use about any module in the child, not just
290L<AnyEvent>, but also L<IO::AIO>, or L<Tk> for example.
180 291
181=head1 PARENT PROCESS USAGE 292=head1 PARENT PROCESS USAGE
182 293
183This module exports nothing, and only implements a single function: 294This module exports nothing, and only implements a single function:
184 295
263 374
264The default server used in the child does all I/O blockingly, and only 375The default server used in the child does all I/O blockingly, and only
265allows a single RPC call to execute concurrently. 376allows a single RPC call to execute concurrently.
266 377
267Setting C<async> to a true value switches to another implementation that 378Setting C<async> to a true value switches to another implementation that
268uses L<AnyEvent> in the child and allows multiple concurrent RPC calls. 379uses L<AnyEvent> in the child and allows multiple concurrent RPC calls (it
380does not support recursion in the event loop however, blocking condvar
381calls will fail).
269 382
270The actual API in the child is documented in the section that describes 383The actual API in the child is documented in the section that describes
271the calling semantics of the returned C<$rpc> function. 384the calling semantics of the returned C<$rpc> function.
272 385
273If you want to pre-load the actual back-end modules to enable memory 386If you want to pre-load the actual back-end modules to enable memory
275synchronous, and C<AnyEvent::Fork::RPC::Async> for asynchronous mode. 388synchronous, and C<AnyEvent::Fork::RPC::Async> for asynchronous mode.
276 389
277If you use a template process and want to fork both sync and async 390If you use a template process and want to fork both sync and async
278children, then it is permissible to load both modules. 391children, then it is permissible to load both modules.
279 392
280=item serialiser => $string (default: '(sub { pack "(w/a*)*", @_ }, sub { unpack "(w/a*)*", shift })') 393=item serialiser => $string (default: $AnyEvent::Fork::RPC::STRING_SERIALISER)
281 394
282All arguments, result data and event data have to be serialised to be 395All arguments, result data and event data have to be serialised to be
283transferred between the processes. For this, they have to be frozen and 396transferred between the processes. For this, they have to be frozen and
284thawed in both parent and child processes. 397thawed in both parent and child processes.
285 398
286By default, only octet strings can be passed between the processes, which 399By default, only octet strings can be passed between the processes, which
287is reasonably fast and efficient. 400is reasonably fast and efficient and requires no extra modules.
288 401
289For more complicated use cases, you can provide your own freeze and thaw 402For more complicated use cases, you can provide your own freeze and thaw
290functions, by specifying a string with perl source code. It's supposed to 403functions, by specifying a string with perl source code. It's supposed to
291return two code references when evaluated: the first receives a list of 404return two code references when evaluated: the first receives a list of
292perl values and must return an octet string. The second receives the octet 405perl values and must return an octet string. The second receives the octet
294 407
295If you need an external module for serialisation, then you can either 408If you need an external module for serialisation, then you can either
296pre-load it into your L<AnyEvent::Fork> process, or you can add a C<use> 409pre-load it into your L<AnyEvent::Fork> process, or you can add a C<use>
297or C<require> statement into the serialiser string. Or both. 410or C<require> statement into the serialiser string. Or both.
298 411
412Here are some examples - some of them are also available as global
413variables that make them easier to use.
414
415=over 4
416
417=item octet strings - C<$AnyEvent::Fork::RPC::STRING_SERIALISER>
418
419This serialiser concatenates length-prefixes octet strings, and is the
420default.
421
422Implementation:
423
424 (
425 sub { pack "(w/a*)*", @_ },
426 sub { unpack "(w/a*)*", shift }
427 )
428
429=item json - C<$AnyEvent::Fork::RPC::JSON_SERIALISER>
430
431This serialiser creates JSON arrays - you have to make sure the L<JSON>
432module is installed for this serialiser to work. It can be beneficial for
433sharing when you preload the L<JSON> module in a template process.
434
435L<JSON> (with L<JSON::XS> installed) is slower than the octet string
436serialiser, but usually much faster than L<Storable>, unless big chunks of
437binary data need to be transferred.
438
439Implementation:
440
441 use JSON ();
442 (
443 sub { JSON::encode_json \@_ },
444 sub { @{ JSON::decode_json shift } }
445 )
446
447=item storable - C<$AnyEvent::Fork::RPC::STORABLE_SERIALISER>
448
449This serialiser uses L<Storable>, which means it has high chance of
450serialising just about anything you throw at it, at the cost of having
451very high overhead per operation. It also comes with perl.
452
453Implementation:
454
455 use Storable ();
456 (
457 sub { Storable::freeze \@_ },
458 sub { @{ Storable::thaw shift } }
459 )
460
461=back
462
299=back 463=back
300 464
301See the examples section earlier in this document for some actual 465See the examples section earlier in this document for some actual
302examples. 466examples.
303 467
304=cut 468=cut
305 469
306our $STRING_SERIALISER = '(sub { pack "(w/a*)*", @_ }, sub { unpack "(w/a*)*", shift })'; 470our $STRING_SERIALISER = '(sub { pack "(w/a*)*", @_ }, sub { unpack "(w/a*)*", shift })';
471our $JSON_SERIALISER = 'use JSON (); (sub { JSON::encode_json \@_ }, sub { @{ JSON::decode_json shift } })';
472our $STORABLE_SERIALISER = 'use Storable (); (sub { Storable::freeze \@_ }, sub { @{ Storable::thaw shift } })';
307 473
308sub run { 474sub run {
309 my ($self, $function, %arg) = @_; 475 my ($self, $function, %arg) = @_;
310 476
311 my $serialiser = delete $arg{serialiser} || $STRING_SERIALISER; 477 my $serialiser = delete $arg{serialiser} || $STRING_SERIALISER;
380 } 546 }
381 } elsif (defined $len) { 547 } elsif (defined $len) {
382 undef $rw; undef $ww; # it ends here 548 undef $rw; undef $ww; # it ends here
383 549
384 if (@rcb || %rcb) { 550 if (@rcb || %rcb) {
385 use Data::Dump;ddx[\@rcb,\%rcb];#d#
386 $on_error->("unexpected eof"); 551 $on_error->("unexpected eof");
387 } else { 552 } else {
388 $on_destroy->(); 553 $on_destroy->();
389 } 554 }
390 } elsif ($! != Errno::EAGAIN && $! != Errno::EWOULDBLOCK) { 555 } elsif ($! != Errno::EAGAIN && $! != Errno::EWOULDBLOCK) {
470See the examples section earlier in this document for some actual 635See the examples section earlier in this document for some actual
471examples. 636examples.
472 637
473=back 638=back
474 639
640=head1 ADVANCED TOPICS
641
642=head2 Choosing a backend
643
644So how do you decide which backend to use? Well, that's your problem to
645solve, but here are some thoughts on the matter:
646
647=over 4
648
649=item Synchronous
650
651The synchronous backend does not rely on any external modules (well,
652except L<common::sense>, which works around a bug in how perl's warning
653system works). This keeps the process very small, for example, on my
654system, an empty perl interpreter uses 1492kB RSS, which becomes 2020kB
655after C<use warnings; use strict> (for people who grew up with C64s around
656them this is probably shocking every single time they see it). The worker
657process in the first example in this document uses 1792kB.
658
659Since the calls are done synchronously, slow jobs will keep newer jobs
660from executing.
661
662The synchronous backend also has no overhead due to running an event loop
663- reading requests is therefore very efficient, while writing responses is
664less so, as every response results in a write syscall.
665
666If the parent process is busy and a bit slow reading responses, the child
667waits instead of processing further requests. This also limits the amount
668of memory needed for buffering, as never more than one response has to be
669buffered.
670
671The API in the child is simple - you just have to define a function that
672does something and returns something.
673
674It's hard to use modules or code that relies on an event loop, as the
675child cannot execute anything while it waits for more input.
676
677=item Asynchronous
678
679The asynchronous backend relies on L<AnyEvent>, which tries to be small,
680but still comes at a price: On my system, the worker from example 1a uses
6813420kB RSS (for L<AnyEvent>, which loads L<EV>, which needs L<XSLoader>
682which in turn loads a lot of other modules such as L<warnings>, L<strict>,
683L<vars>, L<Exporter>...).
684
685It batches requests and responses reasonably efficiently, doing only as
686few reads and writes as needed, but needs to poll for events via the event
687loop.
688
689Responses are queued when the parent process is busy. This means the child
690can continue to execute any queued requests. It also means that a child
691might queue a lot of responses in memory when it generates them and the
692parent process is slow accepting them.
693
694The API is not a straightforward RPC pattern - you have to call a
695"done" callback to pass return values and signal completion. Also, more
696importantly, the API starts jobs as fast as possible - when 1000 jobs
697are queued and the jobs are slow, they will all run concurrently. The
698child must implement some queueing/limiting mechanism if this causes
699problems. Alternatively, the parent could limit the amount of rpc calls
700that are outstanding.
701
702Blocking use of condvars is not supported.
703
704Using event-based modules such as L<IO::AIO>, L<Gtk2>, L<Tk> and so on is
705easy.
706
707=back
708
709=head2 Passing file descriptors
710
711Unlike L<AnyEvent::Fork>, this module has no in-built file handle or file
712descriptor passing abilities.
713
714The reason is that passing file descriptors is extraordinary tricky
715business, and conflicts with efficient batching of messages.
716
717There still is a method you can use: Create a
718C<AnyEvent::Util::portable_socketpair> and C<send_fh> one half of it to
719the process before you pass control to C<AnyEvent::Fork::RPC::run>.
720
721Whenever you want to pass a file descriptor, send an rpc request to the
722child process (so it expects the descriptor), then send it over the other
723half of the socketpair. The child should fetch the descriptor from the
724half it has passed earlier.
725
726Here is some (untested) pseudocode to that effect:
727
728 use AnyEvent::Util;
729 use AnyEvent::Fork::RPC;
730 use IO::FDPass;
731
732 my ($s1, $s2) = AnyEvent::Util::portable_socketpair;
733
734 my $rpc = AnyEvent::Fork
735 ->new
736 ->send_fh ($s2)
737 ->require ("MyWorker")
738 ->AnyEvent::Fork::RPC::run ("MyWorker::run"
739 init => "MyWorker::init",
740 );
741
742 undef $s2; # no need to keep it around
743
744 # pass an fd
745 $rpc->("i'll send some fd now, please expect it!", my $cv = AE::cv);
746
747 IO::FDPass fileno $s1, fileno $handle_to_pass;
748
749 $cv->recv;
750
751The MyWorker module could look like this:
752
753 package MyWorker;
754
755 use IO::FDPass;
756
757 my $s2;
758
759 sub init {
760 $s2 = $_[0];
761 }
762
763 sub run {
764 if ($_[0] eq "i'll send some fd now, please expect it!") {
765 my $fd = IO::FDPass::recv fileno $s2;
766 ...
767 }
768 }
769
770Of course, this might be blocking if you pass a lot of file descriptors,
771so you might want to look into L<AnyEvent::FDpasser> which can handle the
772gory details.
773
475=head1 SEE ALSO 774=head1 SEE ALSO
476 775
477L<AnyEvent::Fork> (to create the processes in the first place), 776L<AnyEvent::Fork>, to create the processes in the first place.
777
478L<AnyEvent::Fork::Pool> (to manage whole pools of processes). 778L<AnyEvent::Fork::Pool>, to manage whole pools of processes.
479 779
480=head1 AUTHOR AND CONTACT INFORMATION 780=head1 AUTHOR AND CONTACT INFORMATION
481 781
482 Marc Lehmann <schmorp@schmorp.de> 782 Marc Lehmann <schmorp@schmorp.de>
483 http://software.schmorp.de/pkg/AnyEvent-Fork-RPC 783 http://software.schmorp.de/pkg/AnyEvent-Fork-RPC

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines