… | |
… | |
964 | maximal/minimal, respectively. Even when going through AnyEvent, it uses |
964 | maximal/minimal, respectively. Even when going through AnyEvent, it uses |
965 | far less memory than any other event loop and is still faster than Event |
965 | far less memory than any other event loop and is still faster than Event |
966 | natively. |
966 | natively. |
967 | |
967 | |
968 | The pure perl implementation is hit in a few sweet spots (both the |
968 | The pure perl implementation is hit in a few sweet spots (both the |
969 | zero timeout and the use of a single fd hit optimisations in the perl |
969 | constant timeout and the use of a single fd hit optimisations in the perl |
970 | interpreter and the backend itself, and all watchers become ready at the |
970 | interpreter and the backend itself). Nevertheless this shows that it |
971 | same time). Nevertheless this shows that it adds very little overhead in |
971 | adds very little overhead in itself. Like any select-based backend its |
972 | itself. Like any select-based backend its performance becomes really bad |
972 | performance becomes really bad with lots of file descriptors (and few of |
973 | with lots of file descriptors (and few of them active), of course, but |
973 | them active), of course, but this was not subject of this benchmark. |
974 | this was not subject of this benchmark. |
|
|
975 | |
974 | |
976 | The C<Event> module has a relatively high setup and callback invocation cost, |
975 | The C<Event> module has a relatively high setup and callback invocation cost, |
977 | but overall scores on the third place. |
976 | but overall scores on the third place. |
978 | |
977 | |
979 | C<Glib>'s memory usage is quite a bit bit higher, but it features a |
978 | C<Glib>'s memory usage is quite a bit bit higher, but it features a |
… | |
… | |
987 | The C<Tk> adaptor works relatively well. The fact that it crashes with |
986 | The C<Tk> adaptor works relatively well. The fact that it crashes with |
988 | more than 2000 watchers is a big setback, however, as correctness takes |
987 | more than 2000 watchers is a big setback, however, as correctness takes |
989 | precedence over speed. Nevertheless, its performance is surprising, as the |
988 | precedence over speed. Nevertheless, its performance is surprising, as the |
990 | file descriptor is dup()ed for each watcher. This shows that the dup() |
989 | file descriptor is dup()ed for each watcher. This shows that the dup() |
991 | employed by some adaptors is not a big performance issue (it does incur a |
990 | employed by some adaptors is not a big performance issue (it does incur a |
992 | hidden memory cost inside the kernel, though, that is not reflected in the |
991 | hidden memory cost inside the kernel which is not reflected in the figures |
993 | figures above). |
992 | above). |
994 | |
993 | |
995 | C<POE>, regardless of underlying event loop (wether using its pure perl |
994 | C<POE>, regardless of underlying event loop (whether using its pure |
996 | select-based backend or the Event module) shows abysmal performance and |
995 | perl select-based backend or the Event module, the POE-EV backend |
|
|
996 | couldn't be tested because it wasn't working) shows abysmal performance |
997 | memory usage: Watchers use almost 30 times as much memory as EV watchers, |
997 | and memory usage: Watchers use almost 30 times as much memory as |
998 | and 10 times as much memory as both Event or EV via AnyEvent. Watcher |
998 | EV watchers, and 10 times as much memory as Event (the high memory |
|
|
999 | requirements are caused by requiring a session for each watcher). Watcher |
999 | invocation is almost 900 times slower than with AnyEvent's pure perl |
1000 | invocation speed is almost 900 times slower than with AnyEvent's pure perl |
1000 | implementation. The design of the POE adaptor class in AnyEvent can not |
1001 | implementation. The design of the POE adaptor class in AnyEvent can not |
1001 | really account for this, as session creation overhead is small compared |
1002 | really account for this, as session creation overhead is small compared |
1002 | to execution of the state machine, which is coded pretty optimally within |
1003 | to execution of the state machine, which is coded pretty optimally within |
1003 | L<AnyEvent::Impl::POE>. POE simply seems to be abysmally slow. |
1004 | L<AnyEvent::Impl::POE>. POE simply seems to be abysmally slow. |
1004 | |
1005 | |
1005 | =head2 Summary |
1006 | =head2 Summary |
1006 | |
1007 | |
|
|
1008 | =over 4 |
|
|
1009 | |
1007 | Using EV through AnyEvent is faster than any other event loop, but most |
1010 | =item * Using EV through AnyEvent is faster than any other event loop, but most |
1008 | event loops have acceptable performance with or without AnyEvent. |
1011 | event loops have acceptable performance with or without AnyEvent. |
1009 | |
1012 | |
1010 | The overhead AnyEvent adds is usually much smaller than the overhead of |
1013 | =item * The overhead AnyEvent adds is usually much smaller than the overhead of |
1011 | the actual event loop, only with extremely fast event loops such as the EV |
1014 | the actual event loop, only with extremely fast event loops such as the EV |
1012 | adds AnyEvent significant overhead. |
1015 | adds AnyEvent significant overhead. |
1013 | |
1016 | |
1014 | And you should simply avoid POE like the plague if you want performance or |
1017 | =item * You should simply avoid POE like the plague if you want performance or |
1015 | reasonable memory usage. |
1018 | reasonable memory usage. |
|
|
1019 | |
|
|
1020 | =back |
1016 | |
1021 | |
1017 | |
1022 | |
1018 | =head1 FORK |
1023 | =head1 FORK |
1019 | |
1024 | |
1020 | Most event libraries are not fork-safe. The ones who are usually are |
1025 | Most event libraries are not fork-safe. The ones who are usually are |