ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/AnyEvent/lib/AnyEvent.pm
(Generate patch)

Comparing AnyEvent/lib/AnyEvent.pm (file contents):
Revision 1.70 by root, Fri Apr 25 07:25:44 2008 UTC vs.
Revision 1.75 by root, Fri Apr 25 07:49:39 2008 UTC

864 864
865 865
866=head1 BENCHMARK 866=head1 BENCHMARK
867 867
868To give you an idea of the performance and overheads that AnyEvent adds 868To give you an idea of the performance and overheads that AnyEvent adds
869over the event loops themselves (and to give you an impression of the
869over the event loops directly, here is a benchmark of various supported 870speed of various event loops), here is a benchmark of various supported
870event models natively and with anyevent. The benchmark creates a lot of 871event models natively and with anyevent. The benchmark creates a lot of
871timers (with a zero timeout) and io watchers (watching STDOUT, a pty, to 872timers (with a zero timeout) and io watchers (watching STDOUT, a pty, to
872become writable, which it is), lets them fire exactly once and destroys 873become writable, which it is), lets them fire exactly once and destroys
873them again. 874them again.
874 875
893I<invoke> is the time, in microseconds, used to invoke a simple 894I<invoke> is the time, in microseconds, used to invoke a simple
894callback. The callback simply counts down a Perl variable and after it was 895callback. The callback simply counts down a Perl variable and after it was
895invoked "watcher" times, it would C<< ->broadcast >> a condvar once to 896invoked "watcher" times, it would C<< ->broadcast >> a condvar once to
896signal the end of this phase. 897signal the end of this phase.
897 898
898I<destroy> is the time, in microseconds, that it takes destroy a single 899I<destroy> is the time, in microseconds, that it takes to destroy a single
899watcher. 900watcher.
900 901
901=head2 Results 902=head2 Results
902 903
903 name watcher bytes create invoke destroy comment 904 name watchers bytes create invoke destroy comment
904 EV/EV 400000 244 0.56 0.46 0.31 EV native interface 905 EV/EV 400000 244 0.56 0.46 0.31 EV native interface
905 EV/Any 100000 610 3.52 0.91 0.75 EV + AnyEvent watchers 906 EV/Any 100000 610 3.52 0.91 0.75 EV + AnyEvent watchers
906 CoroEV/Any 100000 610 3.49 0.92 0.75 coroutines + Coro::Signal 907 CoroEV/Any 100000 610 3.49 0.92 0.75 coroutines + Coro::Signal
907 Perl/Any 16000 654 4.64 1.22 0.77 pure perl implementation 908 Perl/Any 16000 654 4.64 1.22 0.77 pure perl implementation
908 Event/Event 16000 523 28.05 21.38 0.86 Event native interface 909 Event/Event 16000 523 28.05 21.38 0.86 Event native interface
909 Event/Any 16000 943 34.43 20.48 1.39 Event + AnyEvent watchers 910 Event/Any 16000 943 34.43 20.48 1.39 Event + AnyEvent watchers
910 Glib/Any 16000 1357 96.99 12.55 55.51 quadratic behaviour 911 Glib/Any 16000 1357 96.99 12.55 55.51 quadratic behaviour
911 Tk/Any 2000 1855 27.01 66.61 14.03 SEGV with >> 2000 watchers 912 Tk/Any 2000 1855 27.01 66.61 14.03 SEGV with >> 2000 watchers
912 POE/Event 2000 6644 108.15 768.19 14.33 via POE::Loop::Event 913 POE/Event 2000 6644 108.15 768.19 14.33 via POE::Loop::Event
913 POE/Select 2000 6343 94.69 807.65 562.69 via POE::Loop::Select 914 POE/Select 2000 6343 94.69 807.65 562.69 via POE::Loop::Select
914 915
915=head2 Discussion 916=head2 Discussion
916 917
917The benchmark does I<not> measure scalability of the event loop very 918The benchmark does I<not> measure scalability of the event loop very
918well. For example, a select-based event loop (such as the pure perl one) 919well. For example, a select-based event loop (such as the pure perl one)
934but this was not subjetc of this benchmark. 935but this was not subjetc of this benchmark.
935 936
936The C<Event> module has a relatively high setup and callback invocation cost, 937The C<Event> module has a relatively high setup and callback invocation cost,
937but overall scores on the third place. 938but overall scores on the third place.
938 939
939C<Glib>'s memory usage is quite a bit bit higher, features a faster 940C<Glib>'s memory usage is quite a bit bit higher, but it features a
940callback invocation and overall lands in the same class as C<Event>. 941faster callback invocation and overall ends up in the same class as
942C<Event>. However, Glib scales extremely badly, doubling the number of
943watchers increases the processing time by more than a factor of four,
944making it completely unusable when using larger numbers of watchers
945(note that only a single file descriptor was used in the benchmark, so
946inefficiencies of C<poll> do not account for this).
941 947
942The C<Tk> adaptor works relatively well, the fact that it crashes with 948The C<Tk> adaptor works relatively well. The fact that it crashes with
943more than 2000 watchers is a big setback, however, as correctness takes 949more than 2000 watchers is a big setback, however, as correctness takes
944precedence over speed. Nevertheless, its performance is surprising, as the 950precedence over speed. Nevertheless, its performance is surprising, as the
945file descriptor is dup()ed for each watcher. This shows that the dup() 951file descriptor is dup()ed for each watcher. This shows that the dup()
946employed by some adaptors is not a big performance issue (it does incur a 952employed by some adaptors is not a big performance issue (it does incur a
947hidden memory cost inside the kernel, though). 953hidden memory cost inside the kernel, though, that is not reflected in the
954figures above).
948 955
949C<POE>, regardless of backend (wether using its pure perl select-based 956C<POE>, regardless of underlying event loop (wether using its pure perl
950backend or the Event backend) shows abysmal performance and memory 957select-based backend or the Event module) shows abysmal performance and
951usage: Watchers use almost 30 times as much memory as EV watchers, and 10 958memory usage: Watchers use almost 30 times as much memory as EV watchers,
952times as much memory as both Event or EV via AnyEvent. Watcher invocation 959and 10 times as much memory as both Event or EV via AnyEvent. Watcher
953is almost 700 times slower as with AnyEvent's pure perl implementation. 960invocation is almost 700 times slower than with AnyEvent's pure perl
961implementation. The design of the POE adaptor class in AnyEvent can not
962really account for this, as session creation overhead is small compared
963to execution of the state machine, which is coded pretty optimally within
964L<AnyEvent::Impl::POE>. POE simply seems to be abysmally slow.
954 965
966=head2 Summary
967
955Summary: using EV through AnyEvent is faster than any other event 968Using EV through AnyEvent is faster than any other event loop, but most
956loop. The overhead AnyEvent adds can be very small, and you should avoid 969event loops have acceptable performance with or without AnyEvent.
957POE like the plague if you want performance or reasonable memory usage. 970
971The overhead AnyEvent adds is usually much smaller than the overhead of
972the actual event loop, only with extremely fast event loops such as the EV
973adds AnyEvent significant overhead.
974
975And you should simply avoid POE like the plague if you want performance or
976reasonable memory usage.
958 977
959 978
960=head1 FORK 979=head1 FORK
961 980
962Most event libraries are not fork-safe. The ones who are usually are 981Most event libraries are not fork-safe. The ones who are usually are

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines