ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/AnyEvent/lib/AnyEvent/Handle.pm
(Generate patch)

Comparing AnyEvent/lib/AnyEvent/Handle.pm (file contents):
Revision 1.202 by root, Sat Oct 16 02:01:54 2010 UTC vs.
Revision 1.218 by root, Thu Feb 24 12:04:20 2011 UTC

75 } 75 }
76 76
77 \&$func 77 \&$func
78} 78}
79 79
80sub MAX_READ_SIZE() { 131072 }
81
80=head1 METHODS 82=head1 METHODS
81 83
82=over 4 84=over 4
83 85
84=item $handle = B<new> AnyEvent::Handle fh => $filehandle, key => value... 86=item $handle = B<new> AnyEvent::Handle fh => $filehandle, key => value...
112=over 4 114=over 4
113 115
114=item on_prepare => $cb->($handle) 116=item on_prepare => $cb->($handle)
115 117
116This (rarely used) callback is called before a new connection is 118This (rarely used) callback is called before a new connection is
117attempted, but after the file handle has been created. It could be used to 119attempted, but after the file handle has been created (you can access that
120file handle via C<< $handle->{fh} >>). It could be used to prepare the
118prepare the file handle with parameters required for the actual connect 121file handle with parameters required for the actual connect (as opposed to
119(as opposed to settings that can be changed when the connection is already 122settings that can be changed when the connection is already established).
120established).
121 123
122The return value of this callback should be the connect timeout value in 124The return value of this callback should be the connect timeout value in
123seconds (or C<0>, or C<undef>, or the empty list, to indicate that the 125seconds (or C<0>, or C<undef>, or the empty list, to indicate that the
124default timeout is to be used). 126default timeout is to be used).
125 127
157 159
158Some errors are fatal (which is indicated by C<$fatal> being true). On 160Some errors are fatal (which is indicated by C<$fatal> being true). On
159fatal errors the handle object will be destroyed (by a call to C<< -> 161fatal errors the handle object will be destroyed (by a call to C<< ->
160destroy >>) after invoking the error callback (which means you are free to 162destroy >>) after invoking the error callback (which means you are free to
161examine the handle object). Examples of fatal errors are an EOF condition 163examine the handle object). Examples of fatal errors are an EOF condition
162with active (but unsatisifable) read watchers (C<EPIPE>) or I/O errors. In 164with active (but unsatisfiable) read watchers (C<EPIPE>) or I/O errors. In
163cases where the other side can close the connection at will, it is 165cases where the other side can close the connection at will, it is
164often easiest to not report C<EPIPE> errors in this callback. 166often easiest to not report C<EPIPE> errors in this callback.
165 167
166AnyEvent::Handle tries to find an appropriate error code for you to check 168AnyEvent::Handle tries to find an appropriate error code for you to check
167against, but in some cases (TLS errors), this does not work well. It is 169against, but in some cases (TLS errors), this does not work well. It is
245many seconds pass without a successful read or write on the underlying 247many seconds pass without a successful read or write on the underlying
246file handle (or a call to C<timeout_reset>), the C<on_timeout> callback 248file handle (or a call to C<timeout_reset>), the C<on_timeout> callback
247will be invoked (and if that one is missing, a non-fatal C<ETIMEDOUT> 249will be invoked (and if that one is missing, a non-fatal C<ETIMEDOUT>
248error will be raised). 250error will be raised).
249 251
250There are three variants of the timeouts that work independently 252There are three variants of the timeouts that work independently of each
251of each other, for both read and write, just read, and just write: 253other, for both read and write (triggered when nothing was read I<OR>
254written), just read (triggered when nothing was read), and just write:
252C<timeout>, C<rtimeout> and C<wtimeout>, with corresponding callbacks 255C<timeout>, C<rtimeout> and C<wtimeout>, with corresponding callbacks
253C<on_timeout>, C<on_rtimeout> and C<on_wtimeout>, and reset functions 256C<on_timeout>, C<on_rtimeout> and C<on_wtimeout>, and reset functions
254C<timeout_reset>, C<rtimeout_reset>, and C<wtimeout_reset>. 257C<timeout_reset>, C<rtimeout_reset>, and C<wtimeout_reset>.
255 258
256Note that timeout processing is active even when you do not have 259Note that timeout processing is active even when you do not have any
257any outstanding read or write requests: If you plan to keep the connection 260outstanding read or write requests: If you plan to keep the connection
258idle then you should disable the timeout temporarily or ignore the timeout 261idle then you should disable the timeout temporarily or ignore the
259in the C<on_timeout> callback, in which case AnyEvent::Handle will simply 262timeout in the corresponding C<on_timeout> callback, in which case
260restart the timeout. 263AnyEvent::Handle will simply restart the timeout.
261 264
262Zero (the default) disables this timeout. 265Zero (the default) disables the corresponding timeout.
263 266
264=item on_timeout => $cb->($handle) 267=item on_timeout => $cb->($handle)
268
269=item on_rtimeout => $cb->($handle)
270
271=item on_wtimeout => $cb->($handle)
265 272
266Called whenever the inactivity timeout passes. If you return from this 273Called whenever the inactivity timeout passes. If you return from this
267callback, then the timeout will be reset as if some activity had happened, 274callback, then the timeout will be reset as if some activity had happened,
268so this condition is not fatal in any way. 275so this condition is not fatal in any way.
269 276
276For example, a server accepting connections from untrusted sources should 283For example, a server accepting connections from untrusted sources should
277be configured to accept only so-and-so much data that it cannot act on 284be configured to accept only so-and-so much data that it cannot act on
278(for example, when expecting a line, an attacker could send an unlimited 285(for example, when expecting a line, an attacker could send an unlimited
279amount of data without a callback ever being called as long as the line 286amount of data without a callback ever being called as long as the line
280isn't finished). 287isn't finished).
288
289=item wbuf_max => <bytes>
290
291If defined, then a fatal error will be raised (with C<$!> set to C<ENOSPC>)
292when the write buffer ever (strictly) exceeds this size. This is useful to
293avoid some forms of denial-of-service attacks.
294
295Although the units of this parameter is bytes, this is the I<raw> number
296of bytes not yet accepted by the kernel. This can make a difference when
297you e.g. use TLS, as TLS typically makes your write data larger (but it
298can also make it smaller due to compression).
299
300As an example of when this limit is useful, take a chat server that sends
301chat messages to a client. If the client does not read those in a timely
302manner then the send buffer in the server would grow unbounded.
281 303
282=item autocork => <boolean> 304=item autocork => <boolean>
283 305
284When disabled (the default), C<push_write> will try to immediately 306When disabled (the default), C<push_write> will try to immediately
285write the data to the handle if possible. This avoids having to register 307write the data to the handle if possible. This avoids having to register
337already have occured on BSD systems), but at least it will protect you 359already have occured on BSD systems), but at least it will protect you
338from most attacks. 360from most attacks.
339 361
340=item read_size => <bytes> 362=item read_size => <bytes>
341 363
342The default read block size (the number of bytes this module will 364The initial read block size, the number of bytes this module will try to
343try to read during each loop iteration, which affects memory 365read during each loop iteration. Each handle object will consume at least
344requirements). Default: C<8192>. 366this amount of memory for the read buffer as well, so when handling many
367connections requirements). See also C<max_read_size>. Default: C<2048>.
368
369=item max_read_size => <bytes>
370
371The maximum read buffer size used by the dynamic adjustment
372algorithm: Each time AnyEvent::Handle can read C<read_size> bytes in
373one go it will double C<read_size> up to the maximum given by this
374option. Default: C<131072> or C<read_size>, whichever is higher.
345 375
346=item low_water_mark => <bytes> 376=item low_water_mark => <bytes>
347 377
348Sets the number of bytes (default: C<0>) that make up an "empty" write 378Sets the number of bytes (default: C<0>) that make up an "empty" write
349buffer: If the buffer reaches this size or gets even samller it is 379buffer: If the buffer reaches this size or gets even samller it is
412Use the C<< ->starttls >> method if you need to start TLS negotiation later. 442Use the C<< ->starttls >> method if you need to start TLS negotiation later.
413 443
414=item tls_ctx => $anyevent_tls 444=item tls_ctx => $anyevent_tls
415 445
416Use the given C<AnyEvent::TLS> object to create the new TLS connection 446Use the given C<AnyEvent::TLS> object to create the new TLS connection
417(unless a connection object was specified directly). If this parameter is 447(unless a connection object was specified directly). If this
418missing, then AnyEvent::Handle will use C<AnyEvent::Handle::TLS_CTX>. 448parameter is missing (or C<undef>), then AnyEvent::Handle will use
449C<AnyEvent::Handle::TLS_CTX>.
419 450
420Instead of an object, you can also specify a hash reference with C<< key 451Instead of an object, you can also specify a hash reference with C<< key
421=> value >> pairs. Those will be passed to L<AnyEvent::TLS> to create a 452=> value >> pairs. Those will be passed to L<AnyEvent::TLS> to create a
422new TLS context object. 453new TLS context object.
423 454
492 $self->{connect}[0], 523 $self->{connect}[0],
493 $self->{connect}[1], 524 $self->{connect}[1],
494 sub { 525 sub {
495 my ($fh, $host, $port, $retry) = @_; 526 my ($fh, $host, $port, $retry) = @_;
496 527
528 delete $self->{_connect}; # no longer needed
529
497 if ($fh) { 530 if ($fh) {
498 $self->{fh} = $fh; 531 $self->{fh} = $fh;
499 532
500 delete $self->{_skip_drain_rbuf}; 533 delete $self->{_skip_drain_rbuf};
501 $self->_start; 534 $self->_start;
508 }); 541 });
509 542
510 } else { 543 } else {
511 if ($self->{on_connect_error}) { 544 if ($self->{on_connect_error}) {
512 $self->{on_connect_error}($self, "$!"); 545 $self->{on_connect_error}($self, "$!");
513 $self->destroy; 546 $self->destroy if $self;
514 } else { 547 } else {
515 $self->_error ($!, 1); 548 $self->_error ($!, 1);
516 } 549 }
517 } 550 }
518 }, 551 },
519 sub { 552 sub {
520 local $self->{fh} = $_[0]; 553 local $self->{fh} = $_[0];
521 554
522 $self->{on_prepare} 555 $self->{on_prepare}
523 ? $self->{on_prepare}->($self) 556 ? $self->{on_prepare}->($self)
524 : () 557 : ()
525 } 558 }
526 ); 559 );
527 } 560 }
528 561
545 AnyEvent::Util::fh_nonblocking $self->{fh}, 1; 578 AnyEvent::Util::fh_nonblocking $self->{fh}, 1;
546 579
547 $self->{_activity} = 580 $self->{_activity} =
548 $self->{_ractivity} = 581 $self->{_ractivity} =
549 $self->{_wactivity} = AE::now; 582 $self->{_wactivity} = AE::now;
583
584 $self->{read_size} ||= 2048;
585 $self->{max_read_size} = $self->{read_size}
586 if $self->{read_size} > ($self->{max_read_size} || MAX_READ_SIZE);
550 587
551 $self->timeout (delete $self->{timeout} ) if $self->{timeout}; 588 $self->timeout (delete $self->{timeout} ) if $self->{timeout};
552 $self->rtimeout (delete $self->{rtimeout} ) if $self->{rtimeout}; 589 $self->rtimeout (delete $self->{rtimeout} ) if $self->{rtimeout};
553 $self->wtimeout (delete $self->{wtimeout} ) if $self->{wtimeout}; 590 $self->wtimeout (delete $self->{wtimeout} ) if $self->{wtimeout};
554 591
723 760
724=item $handle->rbuf_max ($max_octets) 761=item $handle->rbuf_max ($max_octets)
725 762
726Configures the C<rbuf_max> setting (C<undef> disables it). 763Configures the C<rbuf_max> setting (C<undef> disables it).
727 764
765=item $handle->wbuf_max ($max_octets)
766
767Configures the C<wbuf_max> setting (C<undef> disables it).
768
728=cut 769=cut
729 770
730sub rbuf_max { 771sub rbuf_max {
731 $_[0]{rbuf_max} = $_[1]; 772 $_[0]{rbuf_max} = $_[1];
732} 773}
733 774
775sub wbuf_max {
776 $_[0]{wbuf_max} = $_[1];
777}
778
734############################################################################# 779#############################################################################
735 780
736=item $handle->timeout ($seconds) 781=item $handle->timeout ($seconds)
737 782
738=item $handle->rtimeout ($seconds) 783=item $handle->rtimeout ($seconds)
739 784
740=item $handle->wtimeout ($seconds) 785=item $handle->wtimeout ($seconds)
741 786
742Configures (or disables) the inactivity timeout. 787Configures (or disables) the inactivity timeout.
788
789The timeout will be checked instantly, so this method might destroy the
790handle before it returns.
743 791
744=item $handle->timeout_reset 792=item $handle->timeout_reset
745 793
746=item $handle->rtimeout_reset 794=item $handle->rtimeout_reset
747 795
856 if $cb && $self->{low_water_mark} >= (length $self->{wbuf}) + (length $self->{_tls_wbuf}); 904 if $cb && $self->{low_water_mark} >= (length $self->{wbuf}) + (length $self->{_tls_wbuf});
857} 905}
858 906
859=item $handle->push_write ($data) 907=item $handle->push_write ($data)
860 908
861Queues the given scalar to be written. You can push as much data as you 909Queues the given scalar to be written. You can push as much data as
862want (only limited by the available memory), as C<AnyEvent::Handle> 910you want (only limited by the available memory and C<wbuf_max>), as
863buffers it independently of the kernel. 911C<AnyEvent::Handle> buffers it independently of the kernel.
864 912
865This method may invoke callbacks (and therefore the handle might be 913This method may invoke callbacks (and therefore the handle might be
866destroyed after it returns). 914destroyed after it returns).
867 915
868=cut 916=cut
896 $cb->() unless $self->{autocork}; 944 $cb->() unless $self->{autocork};
897 945
898 # if still data left in wbuf, we need to poll 946 # if still data left in wbuf, we need to poll
899 $self->{_ww} = AE::io $self->{fh}, 1, $cb 947 $self->{_ww} = AE::io $self->{fh}, 1, $cb
900 if length $self->{wbuf}; 948 if length $self->{wbuf};
949
950 if (
951 defined $self->{wbuf_max}
952 && $self->{wbuf_max} < length $self->{wbuf}
953 ) {
954 $self->_error (Errno::ENOSPC, 1), return;
955 }
901 }; 956 };
902} 957}
903 958
904our %WH; 959our %WH;
905 960
1040before it was actually written. One way to do that is to replace your 1095before it was actually written. One way to do that is to replace your
1041C<on_drain> handler by a callback that shuts down the socket (and set 1096C<on_drain> handler by a callback that shuts down the socket (and set
1042C<low_water_mark> to C<0>). This method is a shorthand for just that, and 1097C<low_water_mark> to C<0>). This method is a shorthand for just that, and
1043replaces the C<on_drain> callback with: 1098replaces the C<on_drain> callback with:
1044 1099
1045 sub { shutdown $_[0]{fh}, 1 } # for push_shutdown 1100 sub { shutdown $_[0]{fh}, 1 }
1046 1101
1047This simply shuts down the write side and signals an EOF condition to the 1102This simply shuts down the write side and signals an EOF condition to the
1048the peer. 1103the peer.
1049 1104
1050You can rely on the normal read queue and C<on_eof> handling 1105You can rely on the normal read queue and C<on_eof> handling
1726Note that AnyEvent::Handle will automatically C<start_read> for you when 1781Note that AnyEvent::Handle will automatically C<start_read> for you when
1727you change the C<on_read> callback or push/unshift a read callback, and it 1782you change the C<on_read> callback or push/unshift a read callback, and it
1728will automatically C<stop_read> for you when neither C<on_read> is set nor 1783will automatically C<stop_read> for you when neither C<on_read> is set nor
1729there are any read requests in the queue. 1784there are any read requests in the queue.
1730 1785
1731These methods will have no effect when in TLS mode (as TLS doesn't support 1786In older versions of this module (<= 5.3), these methods had no effect,
1732half-duplex connections). 1787as TLS does not support half-duplex connections. In current versions they
1788work as expected, as this behaviour is required to avoid certain resource
1789attacks, where the program would be forced to read (and buffer) arbitrary
1790amounts of data before being able to send some data. The drawback is that
1791some readings of the the SSL/TLS specifications basically require this
1792attack to be working, as SSL/TLS implementations might stall sending data
1793during a rehandshake.
1794
1795As a guideline, during the initial handshake, you should not stop reading,
1796and as a client, it might cause problems, depending on your applciation.
1733 1797
1734=cut 1798=cut
1735 1799
1736sub stop_read { 1800sub stop_read {
1737 my ($self) = @_; 1801 my ($self) = @_;
1738 1802
1739 delete $self->{_rw} unless $self->{tls}; 1803 delete $self->{_rw};
1740} 1804}
1741 1805
1742sub start_read { 1806sub start_read {
1743 my ($self) = @_; 1807 my ($self) = @_;
1744 1808
1745 unless ($self->{_rw} || $self->{_eof} || !$self->{fh}) { 1809 unless ($self->{_rw} || $self->{_eof} || !$self->{fh}) {
1746 Scalar::Util::weaken $self; 1810 Scalar::Util::weaken $self;
1747 1811
1748 $self->{_rw} = AE::io $self->{fh}, 0, sub { 1812 $self->{_rw} = AE::io $self->{fh}, 0, sub {
1749 my $rbuf = \($self->{tls} ? my $buf : $self->{rbuf}); 1813 my $rbuf = \($self->{tls} ? my $buf : $self->{rbuf});
1750 my $len = sysread $self->{fh}, $$rbuf, $self->{read_size} || 8192, length $$rbuf; 1814 my $len = sysread $self->{fh}, $$rbuf, $self->{read_size}, length $$rbuf;
1751 1815
1752 if ($len > 0) { 1816 if ($len > 0) {
1753 $self->{_activity} = $self->{_ractivity} = AE::now; 1817 $self->{_activity} = $self->{_ractivity} = AE::now;
1754 1818
1755 if ($self->{tls}) { 1819 if ($self->{tls}) {
1756 Net::SSLeay::BIO_write ($self->{_rbio}, $$rbuf); 1820 Net::SSLeay::BIO_write ($self->{_rbio}, $$rbuf);
1757 1821
1758 &_dotls ($self); 1822 &_dotls ($self);
1759 } else { 1823 } else {
1760 $self->_drain_rbuf; 1824 $self->_drain_rbuf;
1825 }
1826
1827 if ($len == $self->{read_size}) {
1828 $self->{read_size} *= 2;
1829 $self->{read_size} = $self->{max_read_size} || MAX_READ_SIZE
1830 if $self->{read_size} > ($self->{max_read_size} || MAX_READ_SIZE);
1761 } 1831 }
1762 1832
1763 } elsif (defined $len) { 1833 } elsif (defined $len) {
1764 delete $self->{_rw}; 1834 delete $self->{_rw};
1765 $self->{_eof} = 1; 1835 $self->{_eof} = 1;
1988 if $self->{tls} > 0; 2058 if $self->{tls} > 0;
1989 2059
1990 delete @$self{qw(_rbio _wbio _tls_wbuf _on_starttls)}; 2060 delete @$self{qw(_rbio _wbio _tls_wbuf _on_starttls)};
1991} 2061}
1992 2062
2063=item $handle->resettls
2064
2065This rarely-used method simply resets and TLS state on the handle, usually
2066causing data loss.
2067
2068One case where it may be useful is when you want to skip over the data in
2069the stream but you are not interested in interpreting it, so data loss is
2070no concern.
2071
2072=cut
2073
2074*resettls = \&_freetls;
2075
1993sub DESTROY { 2076sub DESTROY {
1994 my ($self) = @_; 2077 my ($self) = @_;
1995 2078
1996 &_freetls; 2079 &_freetls;
1997 2080
2112 2195
2113It is only safe to "forget" the reference inside EOF or error callbacks, 2196It is only safe to "forget" the reference inside EOF or error callbacks,
2114from within all other callbacks, you need to explicitly call the C<< 2197from within all other callbacks, you need to explicitly call the C<<
2115->destroy >> method. 2198->destroy >> method.
2116 2199
2200=item Why is my C<on_eof> callback never called?
2201
2202Probably because your C<on_error> callback is being called instead: When
2203you have outstanding requests in your read queue, then an EOF is
2204considered an error as you clearly expected some data.
2205
2206To avoid this, make sure you have an empty read queue whenever your handle
2207is supposed to be "idle" (i.e. connection closes are O.K.). You cna set
2208an C<on_read> handler that simply pushes the first read requests in the
2209queue.
2210
2211See also the next question, which explains this in a bit more detail.
2212
2213=item How can I serve requests in a loop?
2214
2215Most protocols consist of some setup phase (authentication for example)
2216followed by a request handling phase, where the server waits for requests
2217and handles them, in a loop.
2218
2219There are two important variants: The first (traditional, better) variant
2220handles requests until the server gets some QUIT command, causing it to
2221close the connection first (highly desirable for a busy TCP server). A
2222client dropping the connection is an error, which means this variant can
2223detect an unexpected detection close.
2224
2225To handle this case, always make sure you have a on-empty read queue, by
2226pushing the "read request start" handler on it:
2227
2228 # we assume a request starts with a single line
2229 my @start_request; @start_request = (line => sub {
2230 my ($hdl, $line) = @_;
2231
2232 ... handle request
2233
2234 # push next request read, possibly from a nested callback
2235 $hdl->push_read (@start_request);
2236 });
2237
2238 # auth done, now go into request handling loop
2239 # now push the first @start_request
2240 $hdl->push_read (@start_request);
2241
2242By always having an outstanding C<push_read>, the handle always expects
2243some data and raises the C<EPIPE> error when the connction is dropped
2244unexpectedly.
2245
2246The second variant is a protocol where the client can drop the connection
2247at any time. For TCP, this means that the server machine may run out of
2248sockets easier, and in general, it means you cnanot distinguish a protocl
2249failure/client crash from a normal connection close. Nevertheless, these
2250kinds of protocols are common (and sometimes even the best solution to the
2251problem).
2252
2253Having an outstanding read request at all times is possible if you ignore
2254C<EPIPE> errors, but this doesn't help with when the client drops the
2255connection during a request, which would still be an error.
2256
2257A better solution is to push the initial request read in an C<on_read>
2258callback. This avoids an error, as when the server doesn't expect data
2259(i.e. is idly waiting for the next request, an EOF will not raise an
2260error, but simply result in an C<on_eof> callback. It is also a bit slower
2261and simpler:
2262
2263 # auth done, now go into request handling loop
2264 $hdl->on_read (sub {
2265 my ($hdl) = @_;
2266
2267 # called each time we receive data but the read queue is empty
2268 # simply start read the request
2269
2270 $hdl->push_read (line => sub {
2271 my ($hdl, $line) = @_;
2272
2273 ... handle request
2274
2275 # do nothing special when the request has been handled, just
2276 # let the request queue go empty.
2277 });
2278 });
2279
2117=item I get different callback invocations in TLS mode/Why can't I pause 2280=item I get different callback invocations in TLS mode/Why can't I pause
2118reading? 2281reading?
2119 2282
2120Unlike, say, TCP, TLS connections do not consist of two independent 2283Unlike, say, TCP, TLS connections do not consist of two independent
2121communication channels, one for each direction. Or put differently, the 2284communication channels, one for each direction. Or put differently, the

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines