ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/AnyEvent-Fork-RPC/RPC.pm
(Generate patch)

Comparing AnyEvent-Fork-RPC/RPC.pm (file contents):
Revision 1.8 by root, Wed Apr 17 20:24:36 2013 UTC vs.
Revision 1.29 by root, Sun Aug 25 21:52:15 2013 UTC

2 2
3AnyEvent::Fork::RPC - simple RPC extension for AnyEvent::Fork 3AnyEvent::Fork::RPC - simple RPC extension for AnyEvent::Fork
4 4
5=head1 SYNOPSIS 5=head1 SYNOPSIS
6 6
7 use AnyEvent::Fork;
7 use AnyEvent::Fork::RPC; 8 use AnyEvent::Fork::RPC;
8 # use AnyEvent::Fork is not needed
9 9
10 my $rpc = AnyEvent::Fork 10 my $rpc = AnyEvent::Fork
11 ->new 11 ->new
12 ->require ("MyModule") 12 ->require ("MyModule")
13 ->AnyEvent::Fork::RPC::run ( 13 ->AnyEvent::Fork::RPC::run (
14 "MyModule::server", 14 "MyModule::server",
15 ); 15 );
16 16
17 use AnyEvent;
18
17 my $cv = AE::cv; 19 my $cv = AE::cv;
18 20
19 $rpc->(1, 2, 3, sub { 21 $rpc->(1, 2, 3, sub {
20 print "MyModule::server returned @_\n"; 22 print "MyModule::server returned @_\n";
21 $cv->send; 23 $cv->send;
24 $cv->recv; 26 $cv->recv;
25 27
26=head1 DESCRIPTION 28=head1 DESCRIPTION
27 29
28This module implements a simple RPC protocol and backend for processes 30This module implements a simple RPC protocol and backend for processes
29created via L<AnyEvent::Fork>, allowing you to call a function in the 31created via L<AnyEvent::Fork> or L<AnyEvent::Fork::Remote>, allowing you
30child process and receive its return values (up to 4GB serialised). 32to call a function in the child process and receive its return values (up
33to 4GB serialised).
31 34
32It implements two different backends: a synchronous one that works like a 35It implements two different backends: a synchronous one that works like a
33normal function call, and an asynchronous one that can run multiple jobs 36normal function call, and an asynchronous one that can run multiple jobs
34concurrently in the child, using AnyEvent. 37concurrently in the child, using AnyEvent.
35 38
36It also implements an asynchronous event mechanism from the child to the 39It also implements an asynchronous event mechanism from the child to the
37parent, that could be used for progress indications or other information. 40parent, that could be used for progress indications or other information.
38 41
39Loading this module also always loads L<AnyEvent::Fork>, so you can make a
40separate C<use AnyEvent::Fork> if you wish, but you don't have to.
41
42=head1 EXAMPLES 42=head1 EXAMPLES
43 43
44=head2 Synchronous Backend 44=head2 Example 1: Synchronous Backend
45 45
46Here is a simple example that implements a backend that executes C<unlink> 46Here is a simple example that implements a backend that executes C<unlink>
47and C<rmdir> calls, and reports their status back. It also reports the 47and C<rmdir> calls, and reports their status back. It also reports the
48number of requests it has processed every three requests, which is clearly 48number of requests it has processed every three requests, which is clearly
49silly, but illustrates the use of events. 49silly, but illustrates the use of events.
58 58
59 my $rpc = AnyEvent::Fork 59 my $rpc = AnyEvent::Fork
60 ->new 60 ->new
61 ->require ("MyWorker") 61 ->require ("MyWorker")
62 ->AnyEvent::Fork::RPC::run ("MyWorker::run", 62 ->AnyEvent::Fork::RPC::run ("MyWorker::run",
63 on_error => sub { warn "FATAL: $_[0]"; exit 1 }, 63 on_error => sub { warn "ERROR: $_[0]"; exit 1 },
64 on_event => sub { warn "$_[0] requests handled\n" }, 64 on_event => sub { warn "$_[0] requests handled\n" },
65 on_destroy => $done, 65 on_destroy => $done,
66 ); 66 );
67 67
68 for my $id (1..6) { 68 for my $id (1..6) {
137 137
138And as a final remark, there is a fine module on CPAN that can 138And as a final remark, there is a fine module on CPAN that can
139asynchronously C<rmdir> and C<unlink> and a lot more, and more efficiently 139asynchronously C<rmdir> and C<unlink> and a lot more, and more efficiently
140than this example, namely L<IO::AIO>. 140than this example, namely L<IO::AIO>.
141 141
142=head3 Example 1a: the same with the asynchronous backend
143
144This example only shows what needs to be changed to use the async backend
145instead. Doing this is not very useful, the purpose of this example is
146to show the minimum amount of change that is required to go from the
147synchronous to the asynchronous backend.
148
149To use the async backend in the previous example, you need to add the
150C<async> parameter to the C<AnyEvent::Fork::RPC::run> call:
151
152 ->AnyEvent::Fork::RPC::run ("MyWorker::run",
153 async => 1,
154 ...
155
156And since the function call protocol is now changed, you need to adopt
157C<MyWorker::run> to the async API.
158
159First, you need to accept the extra initial C<$done> callback:
160
161 sub run {
162 my ($done, $cmd, $path) = @_;
163
164And since a response is now generated when C<$done> is called, as opposed
165to when the function returns, we need to call the C<$done> function with
166the status:
167
168 $done->($status or (0, "$!"));
169
170A few remarks are in order. First, it's quite pointless to use the async
171backend for this example - but it I<is> possible. Second, you can call
172C<$done> before or after returning from the function. Third, having both
173returned from the function and having called the C<$done> callback, the
174child process may exit at any time, so you should call C<$done> only when
175you really I<are> done.
176
177=head2 Example 2: Asynchronous Backend
178
179This example implements multiple count-downs in the child, using
180L<AnyEvent> timers. While this is a bit silly (one could use timers in te
181parent just as well), it illustrates the ability to use AnyEvent in the
182child and the fact that responses can arrive in a different order then the
183requests.
184
185It also shows how to embed the actual child code into a C<__DATA__>
186section, so it doesn't need any external files at all.
187
188And when your parent process is often busy, and you have stricter timing
189requirements, then running timers in a child process suddenly doesn't look
190so silly anymore.
191
192Without further ado, here is the code:
193
194 use AnyEvent;
195 use AnyEvent::Fork;
196 use AnyEvent::Fork::RPC;
197
198 my $done = AE::cv;
199
200 my $rpc = AnyEvent::Fork
201 ->new
202 ->require ("AnyEvent::Fork::RPC::Async")
203 ->eval (do { local $/; <DATA> })
204 ->AnyEvent::Fork::RPC::run ("run",
205 async => 1,
206 on_error => sub { warn "ERROR: $_[0]"; exit 1 },
207 on_event => sub { print $_[0] },
208 on_destroy => $done,
209 );
210
211 for my $count (3, 2, 1) {
212 $rpc->($count, sub {
213 warn "job $count finished\n";
214 });
215 }
216
217 undef $rpc;
218
219 $done->recv;
220
221 __DATA__
222
223 # this ends up in main, as we don't use a package declaration
224
225 use AnyEvent;
226
227 sub run {
228 my ($done, $count) = @_;
229
230 my $n;
231
232 AnyEvent::Fork::RPC::event "starting to count up to $count\n";
233
234 my $w; $w = AE::timer 1, 1, sub {
235 ++$n;
236
237 AnyEvent::Fork::RPC::event "count $n of $count\n";
238
239 if ($n == $count) {
240 undef $w;
241 $done->();
242 }
243 };
244 }
245
246The parent part (the one before the C<__DATA__> section) isn't very
247different from the earlier examples. It sets async mode, preloads
248the backend module (so the C<AnyEvent::Fork::RPC::event> function is
249declared), uses a slightly different C<on_event> handler (which we use
250simply for logging purposes) and then, instead of loading a module with
251the actual worker code, it C<eval>'s the code from the data section in the
252child process.
253
254It then starts three countdowns, from 3 to 1 seconds downwards, destroys
255the rpc object so the example finishes eventually, and then just waits for
256the stuff to trickle in.
257
258The worker code uses the event function to log some progress messages, but
259mostly just creates a recurring one-second timer.
260
261The timer callback increments a counter, logs a message, and eventually,
262when the count has been reached, calls the finish callback.
263
264On my system, this results in the following output. Since all timers fire
265at roughly the same time, the actual order isn't guaranteed, but the order
266shown is very likely what you would get, too.
267
268 starting to count up to 3
269 starting to count up to 2
270 starting to count up to 1
271 count 1 of 3
272 count 1 of 2
273 count 1 of 1
274 job 1 finished
275 count 2 of 2
276 job 2 finished
277 count 2 of 3
278 count 3 of 3
279 job 3 finished
280
281While the overall ordering isn't guaranteed, the async backend still
282guarantees that events and responses are delivered to the parent process
283in the exact same ordering as they were generated in the child process.
284
285And unless your system is I<very> busy, it should clearly show that the
286job started last will finish first, as it has the lowest count.
287
288This concludes the async example. Since L<AnyEvent::Fork> does not
289actually fork, you are free to use about any module in the child, not just
290L<AnyEvent>, but also L<IO::AIO>, or L<Tk> for example.
291
292=head2 Example 3: Asynchronous backend with Coro
293
294With L<Coro> you can create a nice asynchronous backend implementation by
295defining an rpc server function that creates a new Coro thread for every
296request that calls a function "normally", i.e. the parameters from the
297parent process are passed to it, and any return values are returned to the
298parent process, e.g.:
299
300 package My::Arith;
301
302 sub add {
303 return $_[0] + $_[1];
304 }
305
306 sub mul {
307 return $_[0] * $_[1];
308 }
309
310 sub run {
311 my ($done, $func, @arg) = @_;
312
313 Coro::async_pool {
314 $done->($func->(@arg));
315 };
316 }
317
318The C<run> function creates a new thread for every invocation, using the
319first argument as function name, and calls the C<$done> callback on it's
320return values. This makes it quite natural to define the C<add> and C<mul>
321functions to add or multiply two numbers and return the result.
322
323Since this is the asynchronous backend, it's quite possible to define RPC
324function that do I/O or wait for external events - their execution will
325overlap as needed.
326
327The above could be used like this:
328
329 my $rpc = AnyEvent::Fork
330 ->new
331 ->require ("MyWorker")
332 ->AnyEvent::Fork::RPC::run ("My::Arith::run",
333 on_error => ..., on_event => ..., on_destroy => ...,
334 );
335
336 $rpc->(add => 1, 3, Coro::rouse_cb); say Coro::rouse_wait;
337 $rpc->(mul => 3, 2, Coro::rouse_cb); say Coro::rouse_wait;
338
339The C<say>'s will print C<4> and C<6>.
340
142=head1 PARENT PROCESS USAGE 341=head1 PARENT PROCESS USAGE
143 342
144This module exports nothing, and only implements a single function: 343This module exports nothing, and only implements a single function:
145 344
146=over 4 345=over 4
153 352
154use Errno (); 353use Errno ();
155use Guard (); 354use Guard ();
156 355
157use AnyEvent; 356use AnyEvent;
158use AnyEvent::Fork; # we don't actually depend on it, this is for convenience
159 357
160our $VERSION = 0.1; 358our $VERSION = 1.1;
161 359
162=item my $rpc = AnyEvent::Fork::RPC::run $fork, $function, [key => value...] 360=item my $rpc = AnyEvent::Fork::RPC::run $fork, $function, [key => value...]
163 361
164The traditional way to call it. But it is way cooler to call it in the 362The traditional way to call it. But it is way cooler to call it in the
165following way: 363following way:
185Called on (fatal) errors, with a descriptive (hopefully) message. If 383Called on (fatal) errors, with a descriptive (hopefully) message. If
186this callback is not provided, but C<on_event> is, then the C<on_event> 384this callback is not provided, but C<on_event> is, then the C<on_event>
187callback is called with the first argument being the string C<error>, 385callback is called with the first argument being the string C<error>,
188followed by the error message. 386followed by the error message.
189 387
190If neither handler is provided it prints the error to STDERR and will 388If neither handler is provided, then the error is reported with loglevel
191start failing badly. 389C<error> via C<AE::log>.
192 390
193=item on_event => $cb->(...) 391=item on_event => $cb->(...)
194 392
195Called for every call to the C<AnyEvent::Fork::RPC::event> function in the 393Called for every call to the C<AnyEvent::Fork::RPC::event> function in the
196child, with the arguments of that function passed to the callback. 394child, with the arguments of that function passed to the callback.
224 422
225The default server used in the child does all I/O blockingly, and only 423The default server used in the child does all I/O blockingly, and only
226allows a single RPC call to execute concurrently. 424allows a single RPC call to execute concurrently.
227 425
228Setting C<async> to a true value switches to another implementation that 426Setting C<async> to a true value switches to another implementation that
229uses L<AnyEvent> in the child and allows multiple concurrent RPC calls. 427uses L<AnyEvent> in the child and allows multiple concurrent RPC calls (it
428does not support recursion in the event loop however, blocking condvar
429calls will fail).
230 430
231The actual API in the child is documented in the section that describes 431The actual API in the child is documented in the section that describes
232the calling semantics of the returned C<$rpc> function. 432the calling semantics of the returned C<$rpc> function.
233 433
234If you want to pre-load the actual back-end modules to enable memory 434If you want to pre-load the actual back-end modules to enable memory
236synchronous, and C<AnyEvent::Fork::RPC::Async> for asynchronous mode. 436synchronous, and C<AnyEvent::Fork::RPC::Async> for asynchronous mode.
237 437
238If you use a template process and want to fork both sync and async 438If you use a template process and want to fork both sync and async
239children, then it is permissible to load both modules. 439children, then it is permissible to load both modules.
240 440
241=item serialiser => $string (default: '(sub { pack "(w/a*)*", @_ }, sub { unpack "(w/a*)*", shift })') 441=item serialiser => $string (default: $AnyEvent::Fork::RPC::STRING_SERIALISER)
242 442
243All arguments, result data and event data have to be serialised to be 443All arguments, result data and event data have to be serialised to be
244transferred between the processes. For this, they have to be frozen and 444transferred between the processes. For this, they have to be frozen and
245thawed in both parent and child processes. 445thawed in both parent and child processes.
246 446
247By default, only octet strings can be passed between the processes, which 447By default, only octet strings can be passed between the processes, which
248is reasonably fast and efficient. 448is reasonably fast and efficient and requires no extra modules.
249 449
250For more complicated use cases, you can provide your own freeze and thaw 450For more complicated use cases, you can provide your own freeze and thaw
251functions, by specifying a string with perl source code. It's supposed to 451functions, by specifying a string with perl source code. It's supposed to
252return two code references when evaluated: the first receives a list of 452return two code references when evaluated: the first receives a list of
253perl values and must return an octet string. The second receives the octet 453perl values and must return an octet string. The second receives the octet
255 455
256If you need an external module for serialisation, then you can either 456If you need an external module for serialisation, then you can either
257pre-load it into your L<AnyEvent::Fork> process, or you can add a C<use> 457pre-load it into your L<AnyEvent::Fork> process, or you can add a C<use>
258or C<require> statement into the serialiser string. Or both. 458or C<require> statement into the serialiser string. Or both.
259 459
460Here are some examples - some of them are also available as global
461variables that make them easier to use.
462
463=over 4
464
465=item octet strings - C<$AnyEvent::Fork::RPC::STRING_SERIALISER>
466
467This serialiser concatenates length-prefixes octet strings, and is the
468default. That means you can only pass (and return) strings containing
469character codes 0-255.
470
471Implementation:
472
473 (
474 sub { pack "(w/a*)*", @_ },
475 sub { unpack "(w/a*)*", shift }
476 )
477
478=item json - C<$AnyEvent::Fork::RPC::JSON_SERIALISER>
479
480This serialiser creates JSON arrays - you have to make sure the L<JSON>
481module is installed for this serialiser to work. It can be beneficial for
482sharing when you preload the L<JSON> module in a template process.
483
484L<JSON> (with L<JSON::XS> installed) is slower than the octet string
485serialiser, but usually much faster than L<Storable>, unless big chunks of
486binary data need to be transferred.
487
488Implementation:
489
490 use JSON ();
491 (
492 sub { JSON::encode_json \@_ },
493 sub { @{ JSON::decode_json shift } }
494 )
495
496=item storable - C<$AnyEvent::Fork::RPC::STORABLE_SERIALISER>
497
498This serialiser uses L<Storable>, which means it has high chance of
499serialising just about anything you throw at it, at the cost of having
500very high overhead per operation. It also comes with perl. It should be
501used when you need to serialise complex data structures.
502
503Implementation:
504
505 use Storable ();
506 (
507 sub { Storable::freeze \@_ },
508 sub { @{ Storable::thaw shift } }
509 )
510
511=item portable storable - C<$AnyEvent::Fork::RPC::NSTORABLE_SERIALISER>
512
513This serialiser also uses L<Storable>, but uses it's "network" format
514to serialise data, which makes it possible to talk to different
515perl binaries (for example, when talking to a process created with
516L<AnyEvent::Fork::Remote>).
517
518Implementation:
519
520 use Storable ();
521 (
522 sub { Storable::nfreeze \@_ },
523 sub { @{ Storable::thaw shift } }
524 )
525
260=back 526=back
261 527
528=back
529
262See the examples section earlier in this document for some actual examples. 530See the examples section earlier in this document for some actual
531examples.
263 532
264=cut 533=cut
265 534
266our $STRING_SERIALISER = '(sub { pack "(w/a*)*", @_ }, sub { unpack "(w/a*)*", shift })'; 535our $STRING_SERIALISER = '(sub { pack "(w/a*)*", @_ }, sub { unpack "(w/a*)*", shift })';
536our $JSON_SERIALISER = 'use JSON (); (sub { JSON::encode_json \@_ }, sub { @{ JSON::decode_json shift } })';
537our $STORABLE_SERIALISER = 'use Storable (); (sub { Storable::freeze \@_ }, sub { @{ Storable::thaw shift } })';
538our $NSTORABLE_SERIALISER = 'use Storable (); (sub { Storable::nfreeze \@_ }, sub { @{ Storable::thaw shift } })';
267 539
268sub run { 540sub run {
269 my ($self, $function, %arg) = @_; 541 my ($self, $function, %arg) = @_;
270 542
271 my $serialiser = delete $arg{serialiser} || $STRING_SERIALISER; 543 my $serialiser = delete $arg{serialiser} || $STRING_SERIALISER;
274 my $on_destroy = delete $arg{on_destroy}; 546 my $on_destroy = delete $arg{on_destroy};
275 547
276 # default for on_error is to on_event, if specified 548 # default for on_error is to on_event, if specified
277 $on_error ||= $on_event 549 $on_error ||= $on_event
278 ? sub { $on_event->(error => shift) } 550 ? sub { $on_event->(error => shift) }
279 : sub { die "AnyEvent::Fork::RPC: uncaught error: $_[0].\n" }; 551 : sub { AE::log die => "AnyEvent::Fork::RPC: uncaught error: $_[0]." };
280 552
281 # default for on_event is to raise an error 553 # default for on_event is to raise an error
282 $on_event ||= sub { $on_error->("event received, but no on_event handler") }; 554 $on_event ||= sub { $on_error->("event received, but no on_event handler") };
283 555
284 my ($f, $t) = eval $serialiser; die $@ if $@; 556 my ($f, $t) = eval $serialiser; die $@ if $@;
285 557
286 my (@rcb, $fh, $shutdown, $wbuf, $ww, $rw); 558 my (@rcb, %rcb, $fh, $shutdown, $wbuf, $ww);
287 my ($rlen, $rbuf) = 512 - 16; 559 my ($rlen, $rbuf, $rw) = 512 - 16;
288 560
289 my $wcb = sub { 561 my $wcb = sub {
290 my $len = syswrite $fh, $wbuf; 562 my $len = syswrite $fh, $wbuf;
291 563
292 if (!defined $len) { 564 unless (defined $len) {
293 if ($! != Errno::EAGAIN && $! != Errno::EWOULDBLOCK) { 565 if ($! != Errno::EAGAIN && $! != Errno::EWOULDBLOCK) {
294 undef $rw; undef $ww; # it ends here 566 undef $rw; undef $ww; # it ends here
295 $on_error->("$!"); 567 $on_error->("$!");
296 } 568 }
297 } 569 }
308 580
309 $self->require ($module) 581 $self->require ($module)
310 ->send_arg ($function, $arg{init}, $serialiser) 582 ->send_arg ($function, $arg{init}, $serialiser)
311 ->run ("$module\::run", sub { 583 ->run ("$module\::run", sub {
312 $fh = shift; 584 $fh = shift;
585
586 my ($id, $len);
313 $rw = AE::io $fh, 0, sub { 587 $rw = AE::io $fh, 0, sub {
314 $rlen = $rlen * 2 + 16 if $rlen - 128 < length $rbuf; 588 $rlen = $rlen * 2 + 16 if $rlen - 128 < length $rbuf;
315 my $len = sysread $fh, $rbuf, $rlen - length $rbuf, length $rbuf; 589 $len = sysread $fh, $rbuf, $rlen - length $rbuf, length $rbuf;
316 590
317 if ($len) { 591 if ($len) {
318 while (4 <= length $rbuf) { 592 while (8 <= length $rbuf) {
319 $len = unpack "L", $rbuf; 593 ($id, $len) = unpack "NN", $rbuf;
320 4 + $len <= length $rbuf 594 8 + $len <= length $rbuf
321 or last; 595 or last;
322 596
323 my @r = $t->(substr $rbuf, 4, $len); 597 my @r = $t->(substr $rbuf, 8, $len);
324 substr $rbuf, 0, $len + 4, ""; 598 substr $rbuf, 0, 8 + $len, "";
599
600 if ($id) {
601 if (@rcb) {
602 (shift @rcb)->(@r);
603 } elsif (my $cb = delete $rcb{$id}) {
604 $cb->(@r);
605 } else {
606 undef $rw; undef $ww;
607 $on_error->("unexpected data from child");
325 608 }
326 if (pop @r) { 609 } else {
327 $on_event->(@r); 610 $on_event->(@r);
328 } elsif (@rcb) {
329 (shift @rcb)->(@r);
330 } else {
331 undef $rw; undef $ww;
332 $on_error->("unexpected data from child");
333 } 611 }
334 } 612 }
335 } elsif (defined $len) { 613 } elsif (defined $len) {
336 undef $rw; undef $ww; # it ends here 614 undef $rw; undef $ww; # it ends here
337 615
338 if (@rcb) { 616 if (@rcb || %rcb) {
339 $on_error->("unexpected eof"); 617 $on_error->("unexpected eof");
340 } else { 618 } else {
341 $on_destroy->(); 619 $on_destroy->()
620 if $on_destroy;
342 } 621 }
343 } elsif ($! != Errno::EAGAIN && $! != Errno::EWOULDBLOCK) { 622 } elsif ($! != Errno::EAGAIN && $! != Errno::EWOULDBLOCK) {
344 undef $rw; undef $ww; # it ends here 623 undef $rw; undef $ww; # it ends here
345 $on_error->("read: $!"); 624 $on_error->("read: $!");
346 } 625 }
349 $ww ||= AE::io $fh, 1, $wcb; 628 $ww ||= AE::io $fh, 1, $wcb;
350 }); 629 });
351 630
352 my $guard = Guard::guard { 631 my $guard = Guard::guard {
353 $shutdown = 1; 632 $shutdown = 1;
354 $ww ||= $fh && AE::io $fh, 1, $wcb; 633
634 shutdown $fh, 1 if $fh && !$ww;
355 }; 635 };
356 636
637 my $id;
638
639 $arg{async}
357 sub { 640 ? sub {
358 push @rcb, pop; 641 $id = ($id == 0xffffffff ? 0 : $id) + 1;
642 $id = ($id == 0xffffffff ? 0 : $id) + 1 while exists $rcb{$id}; # rarely loops
359 643
644 $rcb{$id} = pop;
645
360 $guard; # keep it alive 646 $guard if 0; # keep it alive
361 647
362 $wbuf .= pack "L/a*", &$f; 648 $wbuf .= pack "NN/a*", $id, &$f;
363 $ww ||= $fh && AE::io $fh, 1, $wcb; 649 $ww ||= $fh && AE::io $fh, 1, $wcb;
364 } 650 }
651 : sub {
652 push @rcb, pop;
653
654 $guard; # keep it alive
655
656 $wbuf .= pack "N/a*", &$f;
657 $ww ||= $fh && AE::io $fh, 1, $wcb;
658 }
365} 659}
366 660
367=item $rpc->(..., $cb->(...)) 661=item $rpc->(..., $cb->(...))
368 662
369The RPC object returned by C<AnyEvent::Fork::RPC::run> is actually a code 663The RPC object returned by C<AnyEvent::Fork::RPC::run> is actually a code
409See the examples section earlier in this document for some actual 703See the examples section earlier in this document for some actual
410examples. 704examples.
411 705
412=back 706=back
413 707
708=head1 ADVANCED TOPICS
709
710=head2 Choosing a backend
711
712So how do you decide which backend to use? Well, that's your problem to
713solve, but here are some thoughts on the matter:
714
715=over 4
716
717=item Synchronous
718
719The synchronous backend does not rely on any external modules (well,
720except L<common::sense>, which works around a bug in how perl's warning
721system works). This keeps the process very small, for example, on my
722system, an empty perl interpreter uses 1492kB RSS, which becomes 2020kB
723after C<use warnings; use strict> (for people who grew up with C64s around
724them this is probably shocking every single time they see it). The worker
725process in the first example in this document uses 1792kB.
726
727Since the calls are done synchronously, slow jobs will keep newer jobs
728from executing.
729
730The synchronous backend also has no overhead due to running an event loop
731- reading requests is therefore very efficient, while writing responses is
732less so, as every response results in a write syscall.
733
734If the parent process is busy and a bit slow reading responses, the child
735waits instead of processing further requests. This also limits the amount
736of memory needed for buffering, as never more than one response has to be
737buffered.
738
739The API in the child is simple - you just have to define a function that
740does something and returns something.
741
742It's hard to use modules or code that relies on an event loop, as the
743child cannot execute anything while it waits for more input.
744
745=item Asynchronous
746
747The asynchronous backend relies on L<AnyEvent>, which tries to be small,
748but still comes at a price: On my system, the worker from example 1a uses
7493420kB RSS (for L<AnyEvent>, which loads L<EV>, which needs L<XSLoader>
750which in turn loads a lot of other modules such as L<warnings>, L<strict>,
751L<vars>, L<Exporter>...).
752
753It batches requests and responses reasonably efficiently, doing only as
754few reads and writes as needed, but needs to poll for events via the event
755loop.
756
757Responses are queued when the parent process is busy. This means the child
758can continue to execute any queued requests. It also means that a child
759might queue a lot of responses in memory when it generates them and the
760parent process is slow accepting them.
761
762The API is not a straightforward RPC pattern - you have to call a
763"done" callback to pass return values and signal completion. Also, more
764importantly, the API starts jobs as fast as possible - when 1000 jobs
765are queued and the jobs are slow, they will all run concurrently. The
766child must implement some queueing/limiting mechanism if this causes
767problems. Alternatively, the parent could limit the amount of rpc calls
768that are outstanding.
769
770Blocking use of condvars is not supported.
771
772Using event-based modules such as L<IO::AIO>, L<Gtk2>, L<Tk> and so on is
773easy.
774
775=back
776
777=head2 Passing file descriptors
778
779Unlike L<AnyEvent::Fork>, this module has no in-built file handle or file
780descriptor passing abilities.
781
782The reason is that passing file descriptors is extraordinary tricky
783business, and conflicts with efficient batching of messages.
784
785There still is a method you can use: Create a
786C<AnyEvent::Util::portable_socketpair> and C<send_fh> one half of it to
787the process before you pass control to C<AnyEvent::Fork::RPC::run>.
788
789Whenever you want to pass a file descriptor, send an rpc request to the
790child process (so it expects the descriptor), then send it over the other
791half of the socketpair. The child should fetch the descriptor from the
792half it has passed earlier.
793
794Here is some (untested) pseudocode to that effect:
795
796 use AnyEvent::Util;
797 use AnyEvent::Fork;
798 use AnyEvent::Fork::RPC;
799 use IO::FDPass;
800
801 my ($s1, $s2) = AnyEvent::Util::portable_socketpair;
802
803 my $rpc = AnyEvent::Fork
804 ->new
805 ->send_fh ($s2)
806 ->require ("MyWorker")
807 ->AnyEvent::Fork::RPC::run ("MyWorker::run"
808 init => "MyWorker::init",
809 );
810
811 undef $s2; # no need to keep it around
812
813 # pass an fd
814 $rpc->("i'll send some fd now, please expect it!", my $cv = AE::cv);
815
816 IO::FDPass fileno $s1, fileno $handle_to_pass;
817
818 $cv->recv;
819
820The MyWorker module could look like this:
821
822 package MyWorker;
823
824 use IO::FDPass;
825
826 my $s2;
827
828 sub init {
829 $s2 = $_[0];
830 }
831
832 sub run {
833 if ($_[0] eq "i'll send some fd now, please expect it!") {
834 my $fd = IO::FDPass::recv fileno $s2;
835 ...
836 }
837 }
838
839Of course, this might be blocking if you pass a lot of file descriptors,
840so you might want to look into L<AnyEvent::FDpasser> which can handle the
841gory details.
842
843=head1 EXCEPTIONS
844
845There are no provisions whatsoever for catching exceptions at this time -
846in the child, exeptions might kill the process, causing calls to be lost
847and the parent encountering a fatal error. In the parent, exceptions in
848the result callback will not be caught and cause undefined behaviour.
849
414=head1 SEE ALSO 850=head1 SEE ALSO
415 851
416L<AnyEvent::Fork> (to create the processes in the first place), 852L<AnyEvent::Fork>, to create the processes in the first place.
853
854L<AnyEvent::Fork::Remote>, likewise, but helpful for remote processes.
855
417L<AnyEvent::Fork::Pool> (to manage whole pools of processes). 856L<AnyEvent::Fork::Pool>, to manage whole pools of processes.
418 857
419=head1 AUTHOR AND CONTACT INFORMATION 858=head1 AUTHOR AND CONTACT INFORMATION
420 859
421 Marc Lehmann <schmorp@schmorp.de> 860 Marc Lehmann <schmorp@schmorp.de>
422 http://software.schmorp.de/pkg/AnyEvent-Fork-RPC 861 http://software.schmorp.de/pkg/AnyEvent-Fork-RPC

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines