ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/AnyEvent-Fork-RPC/RPC.pm
(Generate patch)

Comparing AnyEvent-Fork-RPC/RPC.pm (file contents):
Revision 1.4 by root, Wed Apr 17 19:38:25 2013 UTC vs.
Revision 1.34 by root, Wed Nov 20 15:26:56 2013 UTC

12 ->require ("MyModule") 12 ->require ("MyModule")
13 ->AnyEvent::Fork::RPC::run ( 13 ->AnyEvent::Fork::RPC::run (
14 "MyModule::server", 14 "MyModule::server",
15 ); 15 );
16 16
17 use AnyEvent;
18
17 my $cv = AE::cv; 19 my $cv = AE::cv;
18 20
19 $rpc->(1, 2, 3, sub { 21 $rpc->(1, 2, 3, sub {
20 print "MyModule::server returned @_\n"; 22 print "MyModule::server returned @_\n";
21 $cv->send; 23 $cv->send;
24 $cv->recv; 26 $cv->recv;
25 27
26=head1 DESCRIPTION 28=head1 DESCRIPTION
27 29
28This module implements a simple RPC protocol and backend for processes 30This module implements a simple RPC protocol and backend for processes
29created via L<AnyEvent::Fork>, allowing you to call a function in the 31created via L<AnyEvent::Fork> or L<AnyEvent::Fork::Remote>, allowing you
30child process and receive its return values (up to 4GB serialised). 32to call a function in the child process and receive its return values (up
33to 4GB serialised).
31 34
32It implements two different backends: a synchronous one that works like a 35It implements two different backends: a synchronous one that works like a
33normal function call, and an asynchronous one that can run multiple jobs 36normal function call, and an asynchronous one that can run multiple jobs
34concurrently in the child, using AnyEvent. 37concurrently in the child, using AnyEvent.
35 38
36It also implements an asynchronous event mechanism from the child to the 39It also implements an asynchronous event mechanism from the child to the
37parent, that could be used for progress indications or other information. 40parent, that could be used for progress indications or other information.
38 41
39=head1 EXAMPLES 42=head1 EXAMPLES
40 43
41=head2 Synchronous Backend 44=head2 Example 1: Synchronous Backend
42 45
43Here is a simple example that implements a backend that executes C<unlink> 46Here is a simple example that implements a backend that executes C<unlink>
44and C<rmdir> calls, and reports their status back. It also reports the 47and C<rmdir> calls, and reports their status back. It also reports the
45number of requests it has processed every three requests, which is clearly 48number of requests it has processed every three requests, which is clearly
46silly, but illustrates the use of events. 49silly, but illustrates the use of events.
55 58
56 my $rpc = AnyEvent::Fork 59 my $rpc = AnyEvent::Fork
57 ->new 60 ->new
58 ->require ("MyWorker") 61 ->require ("MyWorker")
59 ->AnyEvent::Fork::RPC::run ("MyWorker::run", 62 ->AnyEvent::Fork::RPC::run ("MyWorker::run",
63 on_error => sub { warn "ERROR: $_[0]"; exit 1 },
60 on_event => sub { warn "$_[0] requests handled\n" }, 64 on_event => sub { warn "$_[0] requests handled\n" },
61 on_destroy => $done, 65 on_destroy => $done,
62 ); 66 );
63 67
64 for my $id (1..6) { 68 for my $id (1..6) {
102dies with a fatal error - obviously, you must never let this happen :). 106dies with a fatal error - obviously, you must never let this happen :).
103 107
104Eventually it returns the status value true if the command was successful, 108Eventually it returns the status value true if the command was successful,
105or the status value 0 and the stringified error message. 109or the status value 0 and the stringified error message.
106 110
107On my system, running the first cdoe fragment with the given 111On my system, running the first code fragment with the given
108F<MyWorker.pm> in the current directory yields: 112F<MyWorker.pm> in the current directory yields:
109 113
110 /tmp/somepath/1: No such file or directory 114 /tmp/somepath/1: No such file or directory
111 /tmp/somepath/2: No such file or directory 115 /tmp/somepath/2: No such file or directory
112 3 requests handled 116 3 requests handled
133 137
134And as a final remark, there is a fine module on CPAN that can 138And as a final remark, there is a fine module on CPAN that can
135asynchronously C<rmdir> and C<unlink> and a lot more, and more efficiently 139asynchronously C<rmdir> and C<unlink> and a lot more, and more efficiently
136than this example, namely L<IO::AIO>. 140than this example, namely L<IO::AIO>.
137 141
142=head3 Example 1a: the same with the asynchronous backend
143
144This example only shows what needs to be changed to use the async backend
145instead. Doing this is not very useful, the purpose of this example is
146to show the minimum amount of change that is required to go from the
147synchronous to the asynchronous backend.
148
149To use the async backend in the previous example, you need to add the
150C<async> parameter to the C<AnyEvent::Fork::RPC::run> call:
151
152 ->AnyEvent::Fork::RPC::run ("MyWorker::run",
153 async => 1,
154 ...
155
156And since the function call protocol is now changed, you need to adopt
157C<MyWorker::run> to the async API.
158
159First, you need to accept the extra initial C<$done> callback:
160
161 sub run {
162 my ($done, $cmd, $path) = @_;
163
164And since a response is now generated when C<$done> is called, as opposed
165to when the function returns, we need to call the C<$done> function with
166the status:
167
168 $done->($status or (0, "$!"));
169
170A few remarks are in order. First, it's quite pointless to use the async
171backend for this example - but it I<is> possible. Second, you can call
172C<$done> before or after returning from the function. Third, having both
173returned from the function and having called the C<$done> callback, the
174child process may exit at any time, so you should call C<$done> only when
175you really I<are> done.
176
177=head2 Example 2: Asynchronous Backend
178
179This example implements multiple count-downs in the child, using
180L<AnyEvent> timers. While this is a bit silly (one could use timers in the
181parent just as well), it illustrates the ability to use AnyEvent in the
182child and the fact that responses can arrive in a different order then the
183requests.
184
185It also shows how to embed the actual child code into a C<__DATA__>
186section, so it doesn't need any external files at all.
187
188And when your parent process is often busy, and you have stricter timing
189requirements, then running timers in a child process suddenly doesn't look
190so silly anymore.
191
192Without further ado, here is the code:
193
194 use AnyEvent;
195 use AnyEvent::Fork;
196 use AnyEvent::Fork::RPC;
197
198 my $done = AE::cv;
199
200 my $rpc = AnyEvent::Fork
201 ->new
202 ->require ("AnyEvent::Fork::RPC::Async")
203 ->eval (do { local $/; <DATA> })
204 ->AnyEvent::Fork::RPC::run ("run",
205 async => 1,
206 on_error => sub { warn "ERROR: $_[0]"; exit 1 },
207 on_event => sub { print $_[0] },
208 on_destroy => $done,
209 );
210
211 for my $count (3, 2, 1) {
212 $rpc->($count, sub {
213 warn "job $count finished\n";
214 });
215 }
216
217 undef $rpc;
218
219 $done->recv;
220
221 __DATA__
222
223 # this ends up in main, as we don't use a package declaration
224
225 use AnyEvent;
226
227 sub run {
228 my ($done, $count) = @_;
229
230 my $n;
231
232 AnyEvent::Fork::RPC::event "starting to count up to $count\n";
233
234 my $w; $w = AE::timer 1, 1, sub {
235 ++$n;
236
237 AnyEvent::Fork::RPC::event "count $n of $count\n";
238
239 if ($n == $count) {
240 undef $w;
241 $done->();
242 }
243 };
244 }
245
246The parent part (the one before the C<__DATA__> section) isn't very
247different from the earlier examples. It sets async mode, preloads
248the backend module (so the C<AnyEvent::Fork::RPC::event> function is
249declared), uses a slightly different C<on_event> handler (which we use
250simply for logging purposes) and then, instead of loading a module with
251the actual worker code, it C<eval>'s the code from the data section in the
252child process.
253
254It then starts three countdowns, from 3 to 1 seconds downwards, destroys
255the rpc object so the example finishes eventually, and then just waits for
256the stuff to trickle in.
257
258The worker code uses the event function to log some progress messages, but
259mostly just creates a recurring one-second timer.
260
261The timer callback increments a counter, logs a message, and eventually,
262when the count has been reached, calls the finish callback.
263
264On my system, this results in the following output. Since all timers fire
265at roughly the same time, the actual order isn't guaranteed, but the order
266shown is very likely what you would get, too.
267
268 starting to count up to 3
269 starting to count up to 2
270 starting to count up to 1
271 count 1 of 3
272 count 1 of 2
273 count 1 of 1
274 job 1 finished
275 count 2 of 2
276 job 2 finished
277 count 2 of 3
278 count 3 of 3
279 job 3 finished
280
281While the overall ordering isn't guaranteed, the async backend still
282guarantees that events and responses are delivered to the parent process
283in the exact same ordering as they were generated in the child process.
284
285And unless your system is I<very> busy, it should clearly show that the
286job started last will finish first, as it has the lowest count.
287
288This concludes the async example. Since L<AnyEvent::Fork> does not
289actually fork, you are free to use about any module in the child, not just
290L<AnyEvent>, but also L<IO::AIO>, or L<Tk> for example.
291
292=head2 Example 3: Asynchronous backend with Coro
293
294With L<Coro> you can create a nice asynchronous backend implementation by
295defining an rpc server function that creates a new Coro thread for every
296request that calls a function "normally", i.e. the parameters from the
297parent process are passed to it, and any return values are returned to the
298parent process, e.g.:
299
300 package My::Arith;
301
302 sub add {
303 return $_[0] + $_[1];
304 }
305
306 sub mul {
307 return $_[0] * $_[1];
308 }
309
310 sub run {
311 my ($done, $func, @arg) = @_;
312
313 Coro::async_pool {
314 $done->($func->(@arg));
315 };
316 }
317
318The C<run> function creates a new thread for every invocation, using the
319first argument as function name, and calls the C<$done> callback on it's
320return values. This makes it quite natural to define the C<add> and C<mul>
321functions to add or multiply two numbers and return the result.
322
323Since this is the asynchronous backend, it's quite possible to define RPC
324function that do I/O or wait for external events - their execution will
325overlap as needed.
326
327The above could be used like this:
328
329 my $rpc = AnyEvent::Fork
330 ->new
331 ->require ("MyWorker")
332 ->AnyEvent::Fork::RPC::run ("My::Arith::run",
333 on_error => ..., on_event => ..., on_destroy => ...,
334 );
335
336 $rpc->(add => 1, 3, Coro::rouse_cb); say Coro::rouse_wait;
337 $rpc->(mul => 3, 2, Coro::rouse_cb); say Coro::rouse_wait;
338
339The C<say>'s will print C<4> and C<6>.
340
341=head2 Example 4: Forward AnyEvent::Log messages using C<on_event>
342
343This partial example shows how to use the C<event> function to forward
344L<AnyEvent::Log> messages to the parent.
345
346For this, the parent needs to provide a suitable C<on_event>:
347
348 ->AnyEvent::Fork::RPC::run (
349 on_event => sub {
350 if ($_[0] eq "ae_log") {
351 my (undef, $level, $message) = @_;
352 AE::log $level, $message;
353 } else {
354 # other event types
355 }
356 },
357 )
358
359In the child, as early as possible, the following code should reconfigure
360L<AnyEvent::Log> to log via C<AnyEvent::Fork::RPC::event>:
361
362 $AnyEvent::Log::LOG->log_cb (sub {
363 my ($timestamp, $orig_ctx, $level, $message) = @{+shift};
364
365 if (defined &AnyEvent::Fork::RPC::event) {
366 AnyEvent::Fork::RPC::event (ae_log => $level, $message);
367 } else {
368 warn "[$$ before init] $message\n";
369 }
370 });
371
372There is an important twist - the C<AnyEvent::Fork::RPC::event> function
373is only defined when the child is fully initialised. If you redirect the
374log messages in your C<init> function for example, then the C<event>
375function might not yet be available. This is why the log callback checks
376whether the fucntion is there using C<defined>, and only then uses it to
377log the message.
378
138=head1 PARENT PROCESS USAGE 379=head1 PARENT PROCESS USAGE
139 380
140This module exports nothing, and only implements a single function: 381This module exports nothing, and only implements a single function:
141 382
142=over 4 383=over 4
149 390
150use Errno (); 391use Errno ();
151use Guard (); 392use Guard ();
152 393
153use AnyEvent; 394use AnyEvent;
154#use AnyEvent::Fork;
155 395
156our $VERSION = 0.1; 396our $VERSION = 1.21;
157 397
158=item my $rpc = AnyEvent::Fork::RPC::run $fork, $function, [key => value...] 398=item my $rpc = AnyEvent::Fork::RPC::run $fork, $function, [key => value...]
159 399
160The traditional way to call it. But it is way cooler to call it in the 400The traditional way to call it. But it is way cooler to call it in the
161following way: 401following way:
181Called on (fatal) errors, with a descriptive (hopefully) message. If 421Called on (fatal) errors, with a descriptive (hopefully) message. If
182this callback is not provided, but C<on_event> is, then the C<on_event> 422this callback is not provided, but C<on_event> is, then the C<on_event>
183callback is called with the first argument being the string C<error>, 423callback is called with the first argument being the string C<error>,
184followed by the error message. 424followed by the error message.
185 425
186If neither handler is provided it prints the error to STDERR and will 426If neither handler is provided, then the error is reported with loglevel
187start failing badly. 427C<error> via C<AE::log>.
188 428
189=item on_event => $cb->(...) 429=item on_event => $cb->(...)
190 430
191Called for every call to the C<AnyEvent::Fork::RPC::event> function in the 431Called for every call to the C<AnyEvent::Fork::RPC::event> function in the
192child, with the arguments of that function passed to the callback. 432child, with the arguments of that function passed to the callback.
197 437
198Called when the C<$rpc> object has been destroyed and all requests have 438Called when the C<$rpc> object has been destroyed and all requests have
199been successfully handled. This is useful when you queue some requests and 439been successfully handled. This is useful when you queue some requests and
200want the child to go away after it has handled them. The problem is that 440want the child to go away after it has handled them. The problem is that
201the parent must not exit either until all requests have been handled, and 441the parent must not exit either until all requests have been handled, and
202this cna be accomplished by waiting for this callback. 442this can be accomplished by waiting for this callback.
203 443
204=item init => $function (default none) 444=item init => $function (default none)
205 445
206When specified (by name), this function is called in the child as the very 446When specified (by name), this function is called in the child as the very
207first thing when taking over the process, with all the arguments normally 447first thing when taking over the process, with all the arguments normally
214It is called very early - before the serialisers are created or the 454It is called very early - before the serialisers are created or the
215C<$function> name is resolved into a function reference, so it could be 455C<$function> name is resolved into a function reference, so it could be
216used to load any modules that provide the serialiser or function. It can 456used to load any modules that provide the serialiser or function. It can
217not, however, create events. 457not, however, create events.
218 458
459=item done => $function (default C<CORE::exit>)
460
461The function to call when the asynchronous backend detects an end of file
462condition when reading from the communications socket I<and> there are no
463outstanding requests. It's ignored by the synchronous backend.
464
465By overriding this you can prolong the life of a RPC process after e.g.
466the parent has exited by running the event loop in the provided function
467(or simply calling it, for example, when your child process uses L<EV> you
468could provide L<EV::loop> as C<done> function).
469
470Of course, in that case you are responsible for exiting at the appropriate
471time and not returning from
472
219=item async => $boolean (default: 0) 473=item async => $boolean (default: 0)
220 474
221The default server used in the child does all I/O blockingly, and only 475The default server used in the child does all I/O blockingly, and only
222allows a single RPC call to execute concurrently. 476allows a single RPC call to execute concurrently.
223 477
224Setting C<async> to a true value switches to another implementation that 478Setting C<async> to a true value switches to another implementation that
225uses L<AnyEvent> in the child and allows multiple concurrent RPC calls. 479uses L<AnyEvent> in the child and allows multiple concurrent RPC calls (it
480does not support recursion in the event loop however, blocking condvar
481calls will fail).
226 482
227The actual API in the child is documented in the section that describes 483The actual API in the child is documented in the section that describes
228the calling semantics of the returned C<$rpc> function. 484the calling semantics of the returned C<$rpc> function.
229 485
230If you want to pre-load the actual back-end modules to enable memory 486If you want to pre-load the actual back-end modules to enable memory
231sharing, then you should load C<AnyEvent::Fork::RPC::Sync> for 487sharing, then you should load C<AnyEvent::Fork::RPC::Sync> for
232synchronous, and C<AnyEvent::Fork::RPC::Async> for asynchronous mode. 488synchronous, and C<AnyEvent::Fork::RPC::Async> for asynchronous mode.
233 489
234If you use a template process and want to fork both sync and async 490If you use a template process and want to fork both sync and async
235children, then it is permissible to laod both modules. 491children, then it is permissible to load both modules.
236 492
237=item serialiser => $string (default: '(sub { pack "(w/a*)*", @_ }, sub { unpack "(w/a*)*", shift })') 493=item serialiser => $string (default: $AnyEvent::Fork::RPC::STRING_SERIALISER)
238 494
239All arguments, result data and event data have to be serialised to be 495All arguments, result data and event data have to be serialised to be
240transferred between the processes. For this, they have to be frozen and 496transferred between the processes. For this, they have to be frozen and
241thawed in both parent and child processes. 497thawed in both parent and child processes.
242 498
243By default, only octet strings can be passed between the processes, which 499By default, only octet strings can be passed between the processes, which
244is reasonably fast and efficient. 500is reasonably fast and efficient and requires no extra modules.
245 501
246For more complicated use cases, you can provide your own freeze and thaw 502For more complicated use cases, you can provide your own freeze and thaw
247functions, by specifying a string with perl source code. It's supposed to 503functions, by specifying a string with perl source code. It's supposed to
248return two code references when evaluated: the first receives a list of 504return two code references when evaluated: the first receives a list of
249perl values and must return an octet string. The second receives the octet 505perl values and must return an octet string. The second receives the octet
251 507
252If you need an external module for serialisation, then you can either 508If you need an external module for serialisation, then you can either
253pre-load it into your L<AnyEvent::Fork> process, or you can add a C<use> 509pre-load it into your L<AnyEvent::Fork> process, or you can add a C<use>
254or C<require> statement into the serialiser string. Or both. 510or C<require> statement into the serialiser string. Or both.
255 511
512Here are some examples - some of them are also available as global
513variables that make them easier to use.
514
515=over 4
516
517=item octet strings - C<$AnyEvent::Fork::RPC::STRING_SERIALISER>
518
519This serialiser concatenates length-prefixes octet strings, and is the
520default. That means you can only pass (and return) strings containing
521character codes 0-255.
522
523Implementation:
524
525 (
526 sub { pack "(w/a*)*", @_ },
527 sub { unpack "(w/a*)*", shift }
528 )
529
530=item json - C<$AnyEvent::Fork::RPC::JSON_SERIALISER>
531
532This serialiser creates JSON arrays - you have to make sure the L<JSON>
533module is installed for this serialiser to work. It can be beneficial for
534sharing when you preload the L<JSON> module in a template process.
535
536L<JSON> (with L<JSON::XS> installed) is slower than the octet string
537serialiser, but usually much faster than L<Storable>, unless big chunks of
538binary data need to be transferred.
539
540Implementation:
541
542 use JSON ();
543 (
544 sub { JSON::encode_json \@_ },
545 sub { @{ JSON::decode_json shift } }
546 )
547
548=item storable - C<$AnyEvent::Fork::RPC::STORABLE_SERIALISER>
549
550This serialiser uses L<Storable>, which means it has high chance of
551serialising just about anything you throw at it, at the cost of having
552very high overhead per operation. It also comes with perl. It should be
553used when you need to serialise complex data structures.
554
555Implementation:
556
557 use Storable ();
558 (
559 sub { Storable::freeze \@_ },
560 sub { @{ Storable::thaw shift } }
561 )
562
563=item portable storable - C<$AnyEvent::Fork::RPC::NSTORABLE_SERIALISER>
564
565This serialiser also uses L<Storable>, but uses it's "network" format
566to serialise data, which makes it possible to talk to different
567perl binaries (for example, when talking to a process created with
568L<AnyEvent::Fork::Remote>).
569
570Implementation:
571
572 use Storable ();
573 (
574 sub { Storable::nfreeze \@_ },
575 sub { @{ Storable::thaw shift } }
576 )
577
256=back 578=back
257 579
580=back
581
582See the examples section earlier in this document for some actual
583examples.
584
258=cut 585=cut
259 586
260our $STRING_SERIALISER = '(sub { pack "(w/a*)*", @_ }, sub { unpack "(w/a*)*", shift })'; 587our $STRING_SERIALISER = '(sub { pack "(w/a*)*", @_ }, sub { unpack "(w/a*)*", shift })';
588our $JSON_SERIALISER = 'use JSON (); (sub { JSON::encode_json \@_ }, sub { @{ JSON::decode_json shift } })';
589our $STORABLE_SERIALISER = 'use Storable (); (sub { Storable::freeze \@_ }, sub { @{ Storable::thaw shift } })';
590our $NSTORABLE_SERIALISER = 'use Storable (); (sub { Storable::nfreeze \@_ }, sub { @{ Storable::thaw shift } })';
261 591
262sub run { 592sub run {
263 my ($self, $function, %arg) = @_; 593 my ($self, $function, %arg) = @_;
264 594
265 my $serialiser = delete $arg{serialiser} || $STRING_SERIALISER; 595 my $serialiser = delete $arg{serialiser} || $STRING_SERIALISER;
268 my $on_destroy = delete $arg{on_destroy}; 598 my $on_destroy = delete $arg{on_destroy};
269 599
270 # default for on_error is to on_event, if specified 600 # default for on_error is to on_event, if specified
271 $on_error ||= $on_event 601 $on_error ||= $on_event
272 ? sub { $on_event->(error => shift) } 602 ? sub { $on_event->(error => shift) }
273 : sub { die "AnyEvent::Fork::RPC: uncaught error: $_[0].\n" }; 603 : sub { AE::log die => "AnyEvent::Fork::RPC: uncaught error: $_[0]." };
274 604
275 # default for on_event is to raise an error 605 # default for on_event is to raise an error
276 $on_event ||= sub { $on_error->("event received, but no on_event handler") }; 606 $on_event ||= sub { $on_error->("event received, but no on_event handler") };
277 607
278 my ($f, $t) = eval $serialiser; die $@ if $@; 608 my ($f, $t) = eval $serialiser; die $@ if $@;
279 609
280 my (@rcb, $fh, $shutdown, $wbuf, $ww, $rw); 610 my (@rcb, %rcb, $fh, $shutdown, $wbuf, $ww);
281 my ($rlen, $rbuf) = 512 - 16; 611 my ($rlen, $rbuf, $rw) = 512 - 16;
282 612
283 my $wcb = sub { 613 my $wcb = sub {
284 my $len = syswrite $fh, $wbuf; 614 my $len = syswrite $fh, $wbuf;
285 615
286 if (!defined $len) { 616 unless (defined $len) {
287 if ($! != Errno::EAGAIN && $! != Errno::EWOULDBLOCK) { 617 if ($! != Errno::EAGAIN && $! != Errno::EWOULDBLOCK) {
288 undef $rw; undef $ww; # it ends here 618 undef $rw; undef $ww; # it ends here
289 $on_error->("$!"); 619 $on_error->("$!");
290 } 620 }
291 } 621 }
299 }; 629 };
300 630
301 my $module = "AnyEvent::Fork::RPC::" . ($arg{async} ? "Async" : "Sync"); 631 my $module = "AnyEvent::Fork::RPC::" . ($arg{async} ? "Async" : "Sync");
302 632
303 $self->require ($module) 633 $self->require ($module)
304 ->send_arg ($function, $arg{init}, $serialiser) 634 ->send_arg ($function, $arg{init}, $serialiser, $arg{done} || "$module\::do_exit")
305 ->run ("$module\::run", sub { 635 ->run ("$module\::run", sub {
306 $fh = shift; 636 $fh = shift;
637
638 my ($id, $len);
307 $rw = AE::io $fh, 0, sub { 639 $rw = AE::io $fh, 0, sub {
308 $rlen = $rlen * 2 + 16 if $rlen - 128 < length $rbuf; 640 $rlen = $rlen * 2 + 16 if $rlen - 128 < length $rbuf;
309 my $len = sysread $fh, $rbuf, $rlen - length $rbuf, length $rbuf; 641 $len = sysread $fh, $rbuf, $rlen - length $rbuf, length $rbuf;
310 642
311 if ($len) { 643 if ($len) {
312 while (5 <= length $rbuf) { 644 while (8 <= length $rbuf) {
313 $len = unpack "L", $rbuf; 645 ($id, $len) = unpack "NN", $rbuf;
314 4 + $len <= length $rbuf 646 8 + $len <= length $rbuf
315 or last; 647 or last;
316 648
317 my @r = $t->(substr $rbuf, 4, $len); 649 my @r = $t->(substr $rbuf, 8, $len);
318 substr $rbuf, 0, $len + 4, ""; 650 substr $rbuf, 0, 8 + $len, "";
651
652 if ($id) {
653 if (@rcb) {
654 (shift @rcb)->(@r);
655 } elsif (my $cb = delete $rcb{$id}) {
656 $cb->(@r);
657 } else {
658 undef $rw; undef $ww;
659 $on_error->("unexpected data from child");
319 660 }
320 if (pop @r) { 661 } else {
321 $on_event->(@r); 662 $on_event->(@r);
322 } elsif (@rcb) {
323 (shift @rcb)->(@r);
324 } else {
325 undef $rw; undef $ww;
326 $on_error->("unexpected data from child");
327 } 663 }
328 } 664 }
329 } elsif (defined $len) { 665 } elsif (defined $len) {
330 undef $rw; undef $ww; # it ends here 666 undef $rw; undef $ww; # it ends here
331 667
332 if (@rcb) { 668 if (@rcb || %rcb) {
333 $on_error->("unexpected eof"); 669 $on_error->("unexpected eof");
334 } else { 670 } else {
335 $on_destroy->(); 671 $on_destroy->()
672 if $on_destroy;
336 } 673 }
337 } elsif ($! != Errno::EAGAIN && $! != Errno::EWOULDBLOCK) { 674 } elsif ($! != Errno::EAGAIN && $! != Errno::EWOULDBLOCK) {
338 undef $rw; undef $ww; # it ends here 675 undef $rw; undef $ww; # it ends here
339 $on_error->("read: $!"); 676 $on_error->("read: $!");
340 } 677 }
343 $ww ||= AE::io $fh, 1, $wcb; 680 $ww ||= AE::io $fh, 1, $wcb;
344 }); 681 });
345 682
346 my $guard = Guard::guard { 683 my $guard = Guard::guard {
347 $shutdown = 1; 684 $shutdown = 1;
348 $ww ||= $fh && AE::io $fh, 1, $wcb; 685
686 shutdown $fh, 1 if $fh && !$ww;
349 }; 687 };
350 688
689 my $id;
690
691 $arg{async}
351 sub { 692 ? sub {
352 push @rcb, pop; 693 $id = ($id == 0xffffffff ? 0 : $id) + 1;
694 $id = ($id == 0xffffffff ? 0 : $id) + 1 while exists $rcb{$id}; # rarely loops
353 695
696 $rcb{$id} = pop;
697
354 $guard; # keep it alive 698 $guard if 0; # keep it alive
355 699
356 $wbuf .= pack "L/a*", &$f; 700 $wbuf .= pack "NN/a*", $id, &$f;
357 $ww ||= $fh && AE::io $fh, 1, $wcb; 701 $ww ||= $fh && AE::io $fh, 1, $wcb;
358 } 702 }
703 : sub {
704 push @rcb, pop;
705
706 $guard; # keep it alive
707
708 $wbuf .= pack "N/a*", &$f;
709 $ww ||= $fh && AE::io $fh, 1, $wcb;
710 }
359} 711}
360 712
361=item $rpc->(..., $cb->(...)) 713=item $rpc->(..., $cb->(...))
362 714
363The RPC object returned by C<AnyEvent::Fork::RPC::run> is actually a code 715The RPC object returned by C<AnyEvent::Fork::RPC::run> is actually a code
378 730
379The other thing that can be done with the RPC object is to destroy it. In 731The other thing that can be done with the RPC object is to destroy it. In
380this case, the child process will execute all remaining RPC calls, report 732this case, the child process will execute all remaining RPC calls, report
381their results, and then exit. 733their results, and then exit.
382 734
735See the examples section earlier in this document for some actual
736examples.
737
383=back 738=back
384 739
385=head1 CHILD PROCESS USAGE 740=head1 CHILD PROCESS USAGE
386 741
387The following function is not available in this module. They are only 742The following function is not available in this module. They are only
395 750
396Send an event to the parent. Events are a bit like RPC calls made by the 751Send an event to the parent. Events are a bit like RPC calls made by the
397child process to the parent, except that there is no notion of return 752child process to the parent, except that there is no notion of return
398values. 753values.
399 754
755See the examples section earlier in this document for some actual
756examples.
757
400=back 758=back
401 759
760=head2 PROCESS EXIT
761
762If and when the child process exits depends on the backend and
763configuration. Apart from explicit exits (e.g. by calling C<exit>) or
764runtime conditions (uncaught exceptions, signals etc.), the backends exit
765under these conditions:
766
767=over 4
768
769=item Synchronous Backend
770
771The synchronous backend is very simple: when the process waits for another
772request to arrive and the writing side (usually in the parent) is closed,
773it will exit normally, i.e. as if your main program reached the end of the
774file.
775
776That means that if your parent process exits, the RPC process will usually
777exit as well, either because it is idle anyway, or because it executes a
778request. In the latter case, you will likely get an error when the RPc
779process tries to send the results to the parent (because agruably, you
780shouldn't exit your parent while there are still outstanding requests).
781
782The process is usually quiescent when it happens, so it should rarely be a
783problem, and C<END> handlers can be used to clean up.
784
785=item Asynchronous Backend
786
787For the asynchronous backend, things are more complicated: Whenever it
788listens for another request by the parent, it might detect that the socket
789was closed (e.g. because the parent exited). It will sotp listening for
790new requests and instead try to write out any remaining data (if any) or
791simply check whether the socket can be written to. After this, the RPC
792process is effectively done - no new requests are incoming, no outstanding
793request data can be written back.
794
795Since chances are high that there are event watchers that the RPC server
796knows nothing about (why else would one use the async backend if not for
797the ability to register watchers?), the event loop would often happily
798continue.
799
800This is why the asynchronous backend explicitly calls C<CORE::exit> when
801it is done (under other circumstances, such as when there is an I/O error
802and there is outstanding data to write, it will log a fatal message via
803L<AnyEvent::Log>, also causing the program to exit).
804
805You can override this by specifying a function name to call via the C<done>
806parameter instead.
807
808=back
809
810=head1 ADVANCED TOPICS
811
812=head2 Choosing a backend
813
814So how do you decide which backend to use? Well, that's your problem to
815solve, but here are some thoughts on the matter:
816
817=over 4
818
819=item Synchronous
820
821The synchronous backend does not rely on any external modules (well,
822except L<common::sense>, which works around a bug in how perl's warning
823system works). This keeps the process very small, for example, on my
824system, an empty perl interpreter uses 1492kB RSS, which becomes 2020kB
825after C<use warnings; use strict> (for people who grew up with C64s around
826them this is probably shocking every single time they see it). The worker
827process in the first example in this document uses 1792kB.
828
829Since the calls are done synchronously, slow jobs will keep newer jobs
830from executing.
831
832The synchronous backend also has no overhead due to running an event loop
833- reading requests is therefore very efficient, while writing responses is
834less so, as every response results in a write syscall.
835
836If the parent process is busy and a bit slow reading responses, the child
837waits instead of processing further requests. This also limits the amount
838of memory needed for buffering, as never more than one response has to be
839buffered.
840
841The API in the child is simple - you just have to define a function that
842does something and returns something.
843
844It's hard to use modules or code that relies on an event loop, as the
845child cannot execute anything while it waits for more input.
846
847=item Asynchronous
848
849The asynchronous backend relies on L<AnyEvent>, which tries to be small,
850but still comes at a price: On my system, the worker from example 1a uses
8513420kB RSS (for L<AnyEvent>, which loads L<EV>, which needs L<XSLoader>
852which in turn loads a lot of other modules such as L<warnings>, L<strict>,
853L<vars>, L<Exporter>...).
854
855It batches requests and responses reasonably efficiently, doing only as
856few reads and writes as needed, but needs to poll for events via the event
857loop.
858
859Responses are queued when the parent process is busy. This means the child
860can continue to execute any queued requests. It also means that a child
861might queue a lot of responses in memory when it generates them and the
862parent process is slow accepting them.
863
864The API is not a straightforward RPC pattern - you have to call a
865"done" callback to pass return values and signal completion. Also, more
866importantly, the API starts jobs as fast as possible - when 1000 jobs
867are queued and the jobs are slow, they will all run concurrently. The
868child must implement some queueing/limiting mechanism if this causes
869problems. Alternatively, the parent could limit the amount of rpc calls
870that are outstanding.
871
872Blocking use of condvars is not supported.
873
874Using event-based modules such as L<IO::AIO>, L<Gtk2>, L<Tk> and so on is
875easy.
876
877=back
878
879=head2 Passing file descriptors
880
881Unlike L<AnyEvent::Fork>, this module has no in-built file handle or file
882descriptor passing abilities.
883
884The reason is that passing file descriptors is extraordinary tricky
885business, and conflicts with efficient batching of messages.
886
887There still is a method you can use: Create a
888C<AnyEvent::Util::portable_socketpair> and C<send_fh> one half of it to
889the process before you pass control to C<AnyEvent::Fork::RPC::run>.
890
891Whenever you want to pass a file descriptor, send an rpc request to the
892child process (so it expects the descriptor), then send it over the other
893half of the socketpair. The child should fetch the descriptor from the
894half it has passed earlier.
895
896Here is some (untested) pseudocode to that effect:
897
898 use AnyEvent::Util;
899 use AnyEvent::Fork;
900 use AnyEvent::Fork::RPC;
901 use IO::FDPass;
902
903 my ($s1, $s2) = AnyEvent::Util::portable_socketpair;
904
905 my $rpc = AnyEvent::Fork
906 ->new
907 ->send_fh ($s2)
908 ->require ("MyWorker")
909 ->AnyEvent::Fork::RPC::run ("MyWorker::run"
910 init => "MyWorker::init",
911 );
912
913 undef $s2; # no need to keep it around
914
915 # pass an fd
916 $rpc->("i'll send some fd now, please expect it!", my $cv = AE::cv);
917
918 IO::FDPass fileno $s1, fileno $handle_to_pass;
919
920 $cv->recv;
921
922The MyWorker module could look like this:
923
924 package MyWorker;
925
926 use IO::FDPass;
927
928 my $s2;
929
930 sub init {
931 $s2 = $_[0];
932 }
933
934 sub run {
935 if ($_[0] eq "i'll send some fd now, please expect it!") {
936 my $fd = IO::FDPass::recv fileno $s2;
937 ...
938 }
939 }
940
941Of course, this might be blocking if you pass a lot of file descriptors,
942so you might want to look into L<AnyEvent::FDpasser> which can handle the
943gory details.
944
945=head1 EXCEPTIONS
946
947There are no provisions whatsoever for catching exceptions at this time -
948in the child, exeptions might kill the process, causing calls to be lost
949and the parent encountering a fatal error. In the parent, exceptions in
950the result callback will not be caught and cause undefined behaviour.
951
402=head1 SEE ALSO 952=head1 SEE ALSO
403 953
404L<AnyEvent::Fork> (to create the processes in the first place), 954L<AnyEvent::Fork>, to create the processes in the first place.
955
956L<AnyEvent::Fork::Remote>, likewise, but helpful for remote processes.
957
405L<AnyEvent::Fork::Pool> (to manage whole pools of processes). 958L<AnyEvent::Fork::Pool>, to manage whole pools of processes.
406 959
407=head1 AUTHOR AND CONTACT INFORMATION 960=head1 AUTHOR AND CONTACT INFORMATION
408 961
409 Marc Lehmann <schmorp@schmorp.de> 962 Marc Lehmann <schmorp@schmorp.de>
410 http://software.schmorp.de/pkg/AnyEvent-Fork-RPC 963 http://software.schmorp.de/pkg/AnyEvent-Fork-RPC

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines