1 | =head1 NAME |
1 | =head1 NAME |
2 | |
2 | |
3 | AnyEvent::Fork::Pool - simple process pool manager on top of AnyEvent::Fork |
3 | AnyEvent::Fork::Pool - simple process pool manager on top of AnyEvent::Fork |
4 | |
|
|
5 | THE API IS NOT FINISHED, CONSIDER THIS AN ALPHA RELEASE |
|
|
6 | |
4 | |
7 | =head1 SYNOPSIS |
5 | =head1 SYNOPSIS |
8 | |
6 | |
9 | use AnyEvent; |
7 | use AnyEvent; |
10 | use AnyEvent::Fork::Pool; |
8 | use AnyEvent::Fork::Pool; |
… | |
… | |
43 | |
41 | |
44 | $finish->recv; |
42 | $finish->recv; |
45 | |
43 | |
46 | =head1 DESCRIPTION |
44 | =head1 DESCRIPTION |
47 | |
45 | |
48 | This module uses processes created via L<AnyEvent::Fork> and the RPC |
46 | This module uses processes created via L<AnyEvent::Fork> (or |
49 | protocol implement in L<AnyEvent::Fork::RPC> to create a load-balanced |
47 | L<AnyEvent::Fork::Remote>) and the RPC protocol implement in |
50 | pool of processes that handles jobs. |
48 | L<AnyEvent::Fork::RPC> to create a load-balanced pool of processes that |
|
|
49 | handles jobs. |
51 | |
50 | |
52 | Understanding of L<AnyEvent::Fork> is helpful but not critical to be able |
51 | Understanding of L<AnyEvent::Fork> is helpful but not critical to be able |
53 | to use this module, but a thorough understanding of L<AnyEvent::Fork::RPC> |
52 | to use this module, but a thorough understanding of L<AnyEvent::Fork::RPC> |
54 | is, as it defines the actual API that needs to be implemented in the |
53 | is, as it defines the actual API that needs to be implemented in the |
55 | worker processes. |
54 | worker processes. |
56 | |
|
|
57 | =head1 EXAMPLES |
|
|
58 | |
55 | |
59 | =head1 PARENT USAGE |
56 | =head1 PARENT USAGE |
60 | |
57 | |
61 | To create a pool, you first have to create a L<AnyEvent::Fork> object - |
58 | To create a pool, you first have to create a L<AnyEvent::Fork> object - |
62 | this object becomes your template process. Whenever a new worker process |
59 | this object becomes your template process. Whenever a new worker process |
… | |
… | |
89 | |
86 | |
90 | use Guard (); |
87 | use Guard (); |
91 | use Array::Heap (); |
88 | use Array::Heap (); |
92 | |
89 | |
93 | use AnyEvent; |
90 | use AnyEvent; |
|
|
91 | # explicit version on next line, as some cpan-testers test with the 0.1 version, |
|
|
92 | # ignoring dependencies, and this line will at least give a clear indication of that. |
94 | use AnyEvent::Fork; # we don't actually depend on it, this is for convenience |
93 | use AnyEvent::Fork 0.6; # we don't actually depend on it, this is for convenience |
95 | use AnyEvent::Fork::RPC; |
94 | use AnyEvent::Fork::RPC; |
96 | |
95 | |
97 | # these are used for the first and last argument of events |
96 | # these are used for the first and last argument of events |
98 | # in the hope of not colliding. yes, I don't like it either, |
97 | # in the hope of not colliding. yes, I don't like it either, |
99 | # but didn't come up with an obviously better alternative. |
98 | # but didn't come up with an obviously better alternative. |
100 | my $magic0 = ':t6Z@HK1N%Dx@_7?=~-7NQgWDdAs6a,jFN=wLO0*jD*1%P'; |
99 | my $magic0 = ':t6Z@HK1N%Dx@_7?=~-7NQgWDdAs6a,jFN=wLO0*jD*1%P'; |
101 | my $magic1 = '<~53rexz.U`!]X[A235^"fyEoiTF\T~oH1l/N6+Djep9b~bI9`\1x%B~vWO1q*'; |
100 | my $magic1 = '<~53rexz.U`!]X[A235^"fyEoiTF\T~oH1l/N6+Djep9b~bI9`\1x%B~vWO1q*'; |
102 | |
101 | |
103 | our $VERSION = 0.1; |
102 | our $VERSION = 1.1; |
104 | |
103 | |
105 | =item my $pool = AnyEvent::Fork::Pool::run $fork, $function, [key => value...] |
104 | =item my $pool = AnyEvent::Fork::Pool::run $fork, $function, [key => value...] |
106 | |
105 | |
107 | The traditional way to call the pool creation function. But it is way |
106 | The traditional way to call the pool creation function. But it is way |
108 | cooler to call it in the following way: |
107 | cooler to call it in the following way: |
… | |
… | |
446 | to this function are effectively read-only - modifying them after the call |
445 | to this function are effectively read-only - modifying them after the call |
447 | and before the callback is invoked causes undefined behaviour. |
446 | and before the callback is invoked causes undefined behaviour. |
448 | |
447 | |
449 | =cut |
448 | =cut |
450 | |
449 | |
|
|
450 | =item $cpus = AnyEvent::Fork::Pool::ncpu [$default_cpus] |
|
|
451 | |
|
|
452 | =item ($cpus, $eus) = AnyEvent::Fork::Pool::ncpu [$default_cpus] |
|
|
453 | |
|
|
454 | Tries to detect the number of CPUs (C<$cpus> often called cpu cores |
|
|
455 | nowadays) and execution units (C<$eus>) which include e.g. extra |
|
|
456 | hyperthreaded units). When C<$cpus> cannot be determined reliably, |
|
|
457 | C<$default_cpus> is returned for both values, or C<1> if it is missing. |
|
|
458 | |
|
|
459 | For normal CPU bound uses, it is wise to have as many worker processes |
|
|
460 | as CPUs in the system (C<$cpus>), if nothing else uses the CPU. Using |
|
|
461 | hyperthreading is usually detrimental to performance, but in those rare |
|
|
462 | cases where that really helps it might be beneficial to use more workers |
|
|
463 | (C<$eus>). |
|
|
464 | |
|
|
465 | Currently, F</proc/cpuinfo> is parsed on GNU/Linux systems for both |
|
|
466 | C<$cpus> and C<$eu>, and on {Free,Net,Open}BSD, F<sysctl -n hw.ncpu> is |
|
|
467 | used for C<$cpus>. |
|
|
468 | |
|
|
469 | Example: create a worker pool with as many workers as cpu cores, or C<2>, |
|
|
470 | if the actual number could not be determined. |
|
|
471 | |
|
|
472 | $fork->AnyEvent::Fork::Pool::run ("myworker::function", |
|
|
473 | max => (scalar AnyEvent::Fork::Pool::ncpu 2), |
|
|
474 | ); |
|
|
475 | |
|
|
476 | =cut |
|
|
477 | |
|
|
478 | BEGIN { |
|
|
479 | if ($^O eq "linux") { |
|
|
480 | *ncpu = sub(;$) { |
|
|
481 | my ($cpus, $eus); |
|
|
482 | |
|
|
483 | if (open my $fh, "<", "/proc/cpuinfo") { |
|
|
484 | my %id; |
|
|
485 | |
|
|
486 | while (<$fh>) { |
|
|
487 | if (/^core id\s*:\s*(\d+)/) { |
|
|
488 | ++$eus; |
|
|
489 | undef $id{$1}; |
|
|
490 | } |
|
|
491 | } |
|
|
492 | |
|
|
493 | $cpus = scalar keys %id; |
|
|
494 | } else { |
|
|
495 | $cpus = $eus = @_ ? shift : 1; |
|
|
496 | } |
|
|
497 | wantarray ? ($cpus, $eus) : $cpus |
|
|
498 | }; |
|
|
499 | } elsif ($^O eq "freebsd" || $^O eq "netbsd" || $^O eq "openbsd") { |
|
|
500 | *ncpu = sub(;$) { |
|
|
501 | my $cpus = qx<sysctl -n hw.ncpu> * 1 |
|
|
502 | || (@_ ? shift : 1); |
|
|
503 | wantarray ? ($cpus, $cpus) : $cpus |
|
|
504 | }; |
|
|
505 | } else { |
|
|
506 | *ncpu = sub(;$) { |
|
|
507 | my $cpus = @_ ? shift : 1; |
|
|
508 | wantarray ? ($cpus, $cpus) : $cpus |
|
|
509 | }; |
|
|
510 | } |
|
|
511 | } |
|
|
512 | |
451 | =back |
513 | =back |
452 | |
514 | |
453 | =head1 CHILD USAGE |
515 | =head1 CHILD USAGE |
454 | |
516 | |
455 | In addition to the L<AnyEvent::Fork::RPC> API, this module implements one |
517 | In addition to the L<AnyEvent::Fork::RPC> API, this module implements one |
… | |
… | |
472 | deems this useful. For example, after executing a job, one could check |
534 | deems this useful. For example, after executing a job, one could check |
473 | the process size or the number of jobs handled so far, and if either is |
535 | the process size or the number of jobs handled so far, and if either is |
474 | too high, the worker could ask to get retired, to avoid memory leaks to |
536 | too high, the worker could ask to get retired, to avoid memory leaks to |
475 | accumulate. |
537 | accumulate. |
476 | |
538 | |
|
|
539 | Example: retire a worker after it has handled roughly 100 requests. |
|
|
540 | |
|
|
541 | my $count = 0; |
|
|
542 | |
|
|
543 | sub my::worker { |
|
|
544 | |
|
|
545 | ++$count == 100 |
|
|
546 | and AnyEvent::Fork::Pool::retire (); |
|
|
547 | |
|
|
548 | ... normal code goes here |
|
|
549 | } |
|
|
550 | |
477 | =back |
551 | =back |
478 | |
552 | |
479 | =head1 POOL PARAMETERS RECIPES |
553 | =head1 POOL PARAMETERS RECIPES |
480 | |
554 | |
481 | This section describes some recipes for pool paramaters. These are mostly |
555 | This section describes some recipes for pool paramaters. These are mostly |
… | |
… | |
529 | |
603 | |
530 | =head1 SEE ALSO |
604 | =head1 SEE ALSO |
531 | |
605 | |
532 | L<AnyEvent::Fork>, to create the processes in the first place. |
606 | L<AnyEvent::Fork>, to create the processes in the first place. |
533 | |
607 | |
|
|
608 | L<AnyEvent::Fork::Remote>, likewise, but helpful for remote processes. |
|
|
609 | |
534 | L<AnyEvent::Fork::RPC>, which implements the RPC protocol and API. |
610 | L<AnyEvent::Fork::RPC>, which implements the RPC protocol and API. |
535 | |
611 | |
536 | =head1 AUTHOR AND CONTACT INFORMATION |
612 | =head1 AUTHOR AND CONTACT INFORMATION |
537 | |
613 | |
538 | Marc Lehmann <schmorp@schmorp.de> |
614 | Marc Lehmann <schmorp@schmorp.de> |