ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/AnyEvent-Fork-RPC/RPC.pm
(Generate patch)

Comparing AnyEvent-Fork-RPC/RPC.pm (file contents):
Revision 1.9 by root, Wed Apr 17 21:48:35 2013 UTC vs.
Revision 1.12 by root, Thu Apr 18 10:40:34 2013 UTC

39Loading this module also always loads L<AnyEvent::Fork>, so you can make a 39Loading this module also always loads L<AnyEvent::Fork>, so you can make a
40separate C<use AnyEvent::Fork> if you wish, but you don't have to. 40separate C<use AnyEvent::Fork> if you wish, but you don't have to.
41 41
42=head1 EXAMPLES 42=head1 EXAMPLES
43 43
44=head2 Synchronous Backend 44=head2 Example 1: Synchronous Backend
45 45
46Here is a simple example that implements a backend that executes C<unlink> 46Here is a simple example that implements a backend that executes C<unlink>
47and C<rmdir> calls, and reports their status back. It also reports the 47and C<rmdir> calls, and reports their status back. It also reports the
48number of requests it has processed every three requests, which is clearly 48number of requests it has processed every three requests, which is clearly
49silly, but illustrates the use of events. 49silly, but illustrates the use of events.
50 50
51First the parent process: 51First the parent process:
52 52
53 use AnyEvent; 53 use AnyEvent;
54 use AnyEvent::Fork;
55 use AnyEvent::Fork::RPC; 54 use AnyEvent::Fork::RPC;
56 55
57 my $done = AE::cv; 56 my $done = AE::cv;
58 57
59 my $rpc = AnyEvent::Fork 58 my $rpc = AnyEvent::Fork
136ensures that perl will correctly interpret calls to it. 135ensures that perl will correctly interpret calls to it.
137 136
138And as a final remark, there is a fine module on CPAN that can 137And as a final remark, there is a fine module on CPAN that can
139asynchronously C<rmdir> and C<unlink> and a lot more, and more efficiently 138asynchronously C<rmdir> and C<unlink> and a lot more, and more efficiently
140than this example, namely L<IO::AIO>. 139than this example, namely L<IO::AIO>.
140
141=head3 Example 1a: the same with the asynchronous backend
142
143This example only shows what needs to be changed to use the async backend
144instead. Doing this is not very useful, the purpose of this example is
145to show the minimum amount of change that is required to go from the
146synchronous to the asynchronous backend.
147
148To use the async backend in the previous example, you need to add the
149C<async> parameter to the C<AnyEvent::Fork::RPC::run> call:
150
151 ->AnyEvent::Fork::RPC::run ("MyWorker::run",
152 async => 1,
153 ...
154
155And since the function call protocol is now changed, you need to adopt
156C<MyWorker::run> to the async API.
157
158First, you need to accept the extra initial C<$done> callback:
159
160 sub run {
161 my ($done, $cmd, $path) = @_;
162
163And since a response is now generated when C<$done> is called, as opposed
164to when the function returns, we need to call the C<$done> function with
165the status:
166
167 $done->($status or (0, "$!"));
168
169A few remarks are in order. First, it's quite pointless to use the async
170backend for this example - but it I<is> possible. Second, you can call
171C<$done> before or after returning from the function. Third, having both
172returned from the function and having called the C<$done> callback, the
173child process may exit at any time, so you should call C<$done> only when
174you really I<are> done.
175
176=head2 Example 2: Asynchronous Backend
177
178This example implements multiple count-downs in the child, using
179L<AnyEvent> timers. While this is a bit silly (one could use timers in te
180parent just as well), it illustrates the ability to use AnyEvent in the
181child and the fact that responses can arrive in a different order then the
182requests.
183
184It also shows how to embed the actual child code into a C<__DATA__>
185section, so it doesn't need any external files at all.
186
187And when your parent process is often busy, and you have stricter timing
188requirements, then running timers in a child process suddenly doesn't look
189so silly anymore.
190
191Without further ado, here is the code:
192
193 use AnyEvent;
194 use AnyEvent::Fork::RPC;
195
196 my $done = AE::cv;
197
198 my $rpc = AnyEvent::Fork
199 ->new
200 ->require ("AnyEvent::Fork::RPC::Async")
201 ->eval (do { local $/; <DATA> })
202 ->AnyEvent::Fork::RPC::run ("run",
203 async => 1,
204 on_error => sub { warn "FATAL: $_[0]"; exit 1 },
205 on_event => sub { print $_[0] },
206 on_destroy => $done,
207 );
208
209 for my $count (3, 2, 1) {
210 $rpc->($count, sub {
211 warn "job $count finished\n";
212 });
213 }
214
215 undef $rpc;
216
217 $done->recv;
218
219 __DATA__
220
221 # this ends up in main, as we don't use a package declaration
222
223 use AnyEvent;
224
225 sub run {
226 my ($done, $count) = @_;
227
228 my $n;
229
230 AnyEvent::Fork::RPC::event "starting to count up to $count\n";
231
232 my $w; $w = AE::timer 1, 1, sub {
233 ++$n;
234
235 AnyEvent::Fork::RPC::event "count $n of $count\n";
236
237 if ($n == $count) {
238 undef $w;
239 $done->();
240 }
241 };
242 }
243
244The parent part (the one before the C<__DATA__> section) isn't very
245different from the earlier examples. It sets async mode, preloads
246the backend module (so the C<AnyEvent::Fork::RPC::event> function is
247declared), uses a slightly different C<on_event> handler (which we use
248simply for logging purposes) and then, instead of loading a module with
249the actual worker code, it C<eval>'s the code from the data section in the
250child process.
251
252It then starts three countdowns, from 3 to 1 seconds downwards, destroys
253the rpc object so the example finishes eventually, and then just waits for
254the stuff to trickle in.
255
256The worker code uses the event function to log some progress messages, but
257mostly just creates a recurring one-second timer.
258
259The timer callback increments a counter, logs a message, and eventually,
260when the count has been reached, calls the finish callback.
261
262On my system, this results in the following output. Since all timers fire
263at roughly the same time, the actual order isn't guaranteed, but the order
264shown is very likely what you would get, too.
265
266 starting to count up to 3
267 starting to count up to 2
268 starting to count up to 1
269 count 1 of 3
270 count 1 of 2
271 count 1 of 1
272 job 1 finished
273 count 2 of 2
274 job 2 finished
275 count 2 of 3
276 count 3 of 3
277 job 3 finished
278
279While the overall ordering isn't guaranteed, the async backend still
280guarantees that events and responses are delivered to the parent process
281in the exact same ordering as they were generated in the child process.
282
283And unless your system is I<very> busy, it should clearly show that the
284job started last will finish first, as it has the lowest count.
285
286This concludes the async example. Since L<AnyEvent::Fork> does not
287actually fork, you are free to use about any module in the child, not just
288L<AnyEvent>, but also L<IO::AIO>, or L<Tk> for example.
141 289
142=head1 PARENT PROCESS USAGE 290=head1 PARENT PROCESS USAGE
143 291
144This module exports nothing, and only implements a single function: 292This module exports nothing, and only implements a single function:
145 293
431See the examples section earlier in this document for some actual 579See the examples section earlier in this document for some actual
432examples. 580examples.
433 581
434=back 582=back
435 583
584=head1 ADVANCED TOPICS
585
586=head2 Choosing a backend
587
588So how do you decide which backend to use? Well, that's your problem to
589solve, but here are some thoughts on the matter:
590
591=over 4
592
593=item Synchronous
594
595The synchronous backend does not rely on any external modules (well,
596except L<common::sense>, which works around a bug in how perl's warning
597system works). This keeps the process very small, for example, on my
598system, an empty perl interpreter uses 1492kB RSS, which becomes 2020kB
599after C<use warnings; use strict> (for people who grew up with C64s around
600them this is probably shocking every single time they see it). The worker
601process in the first example in this document uses 1792kB.
602
603Since the calls are done synchronously, slow jobs will keep newer jobs
604from executing.
605
606The synchronous backend also has no overhead due to running an event loop
607- reading requests is therefore very efficient, while writing responses is
608less so, as every response results in a write syscall.
609
610If the parent process is busy and a bit slow reading responses, the child
611waits instead of processing further requests. This also limits the amount
612of memory needed for buffering, as never more than one response has to be
613buffered.
614
615The API in the child is simple - you just have to define a function that
616does something and returns something.
617
618It's hard to use modules or code that relies on an event loop, as the
619child cannot execute anything while it waits for more input.
620
621=item Asynchronous
622
623The asynchronous backend relies on L<AnyEvent>, which tries to be small,
624but still comes at a price: On my system, the worker from example 1a uses
6253420kB RSS (for L<AnyEvent>, which loads L<EV>, which needs L<XSLoader>
626which in turn loads a lot of other modules such as L<warnings>, L<strict>,
627L<vars>, L<Exporter>...).
628
629It batches requests and responses reasonably efficiently, doing only as
630few reads and writes as needed, but needs to poll for events via the event
631loop.
632
633Responses are queued when the parent process is busy. This means the child
634can continue to execute any queued requests. It also means that a child
635might queue a lot of responses in memory when it generates them and the
636parent process is slow accepting them.
637
638The API is not a straightforward RPC pattern - you have to call a
639"done" callback to pass return values and signal completion. Also, more
640importantly, the API starts jobs as fast as possible - when 1000 jobs
641are queued and the jobs are slow, they will all run concurrently. The
642child must implement some queueing/limiting mechanism if this causes
643problems. Alternatively, the parent could limit the amount of rpc calls
644that are outstanding.
645
646Using event-based modules such as L<IO::AIO>, L<Gtk2>, L<Tk> and so on is
647easy.
648
649=back
650
651=head2 Passing file descriptors
652
653Unlike L<AnyEvent::Fork>, this module has no in-built file handle or file
654descriptor passing abilities.
655
656The reason is that passing file descriptors is extraordinary tricky
657business, and conflicts with efficient batching of messages.
658
659There still is a method you can use: Create a
660C<AnyEvent::Util::portable_socketpair> and C<send_fh> one half of it to
661the process before you pass control to C<AnyEvent::Fork::RPC::run>.
662
663Whenever you want to pass a file descriptor, send an rpc request to the
664child process (so it expects the descriptor), then send it over the other
665half of the socketpair. The child should fetch the descriptor from the
666half it has passed earlier.
667
668Here is some (untested) pseudocode to that effect:
669
670 use AnyEvent::Util;
671 use AnyEvent::Fork::RPC;
672 use IO::FDPass;
673
674 my ($s1, $s2) = AnyEvent::Util::portable_socketpair;
675
676 my $rpc = AnyEvent::Fork
677 ->new
678 ->send_fh ($s2)
679 ->require ("MyWorker")
680 ->AnyEvent::Fork::RPC::run ("MyWorker::run"
681 init => "MyWorker::init",
682 );
683
684 undef $s2; # no need to keep it around
685
686 # pass an fd
687 $rpc->("i'll send some fd now, please expect it!", my $cv = AE::cv);
688
689 IO::FDPass fileno $s1, fileno $handle_to_pass;
690
691 $cv->recv;
692
693The MyWorker module could look like this:
694
695 package MyWorker;
696
697 use IO::FDPass;
698
699 my $s2;
700
701 sub init {
702 $s2 = $_[0];
703 }
704
705 sub run {
706 if ($_[0] eq "i'll send some fd now, please expect it!") {
707 my $fd = IO::FDPass::recv fileno $s2;
708 ...
709 }
710 }
711
712Of course, this might be blocking if you pass a lot of file descriptors,
713so you might want to look into L<AnyEvent::FDpasser> which can handle the
714gory details.
715
436=head1 SEE ALSO 716=head1 SEE ALSO
437 717
438L<AnyEvent::Fork> (to create the processes in the first place), 718L<AnyEvent::Fork> (to create the processes in the first place),
439L<AnyEvent::Fork::Pool> (to manage whole pools of processes). 719L<AnyEvent::Fork::Pool> (to manage whole pools of processes).
440 720

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines