ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/AnyEvent/lib/AnyEvent/Handle.pm
(Generate patch)

Comparing AnyEvent/lib/AnyEvent/Handle.pm (file contents):
Revision 1.201 by root, Wed Oct 13 01:15:57 2010 UTC vs.
Revision 1.216 by root, Sun Jan 23 11:15:09 2011 UTC

75 } 75 }
76 76
77 \&$func 77 \&$func
78} 78}
79 79
80sub MAX_READ_SIZE() { 131072 }
81
80=head1 METHODS 82=head1 METHODS
81 83
82=over 4 84=over 4
83 85
84=item $handle = B<new> AnyEvent::Handle fh => $filehandle, key => value... 86=item $handle = B<new> AnyEvent::Handle fh => $filehandle, key => value...
112=over 4 114=over 4
113 115
114=item on_prepare => $cb->($handle) 116=item on_prepare => $cb->($handle)
115 117
116This (rarely used) callback is called before a new connection is 118This (rarely used) callback is called before a new connection is
117attempted, but after the file handle has been created. It could be used to 119attempted, but after the file handle has been created (you can access that
120file handle via C<< $handle->{fh} >>). It could be used to prepare the
118prepare the file handle with parameters required for the actual connect 121file handle with parameters required for the actual connect (as opposed to
119(as opposed to settings that can be changed when the connection is already 122settings that can be changed when the connection is already established).
120established).
121 123
122The return value of this callback should be the connect timeout value in 124The return value of this callback should be the connect timeout value in
123seconds (or C<0>, or C<undef>, or the empty list, to indicate that the 125seconds (or C<0>, or C<undef>, or the empty list, to indicate that the
124default timeout is to be used). 126default timeout is to be used).
125 127
157 159
158Some errors are fatal (which is indicated by C<$fatal> being true). On 160Some errors are fatal (which is indicated by C<$fatal> being true). On
159fatal errors the handle object will be destroyed (by a call to C<< -> 161fatal errors the handle object will be destroyed (by a call to C<< ->
160destroy >>) after invoking the error callback (which means you are free to 162destroy >>) after invoking the error callback (which means you are free to
161examine the handle object). Examples of fatal errors are an EOF condition 163examine the handle object). Examples of fatal errors are an EOF condition
162with active (but unsatisifable) read watchers (C<EPIPE>) or I/O errors. In 164with active (but unsatisfiable) read watchers (C<EPIPE>) or I/O errors. In
163cases where the other side can close the connection at will, it is 165cases where the other side can close the connection at will, it is
164often easiest to not report C<EPIPE> errors in this callback. 166often easiest to not report C<EPIPE> errors in this callback.
165 167
166AnyEvent::Handle tries to find an appropriate error code for you to check 168AnyEvent::Handle tries to find an appropriate error code for you to check
167against, but in some cases (TLS errors), this does not work well. It is 169against, but in some cases (TLS errors), this does not work well. It is
276For example, a server accepting connections from untrusted sources should 278For example, a server accepting connections from untrusted sources should
277be configured to accept only so-and-so much data that it cannot act on 279be configured to accept only so-and-so much data that it cannot act on
278(for example, when expecting a line, an attacker could send an unlimited 280(for example, when expecting a line, an attacker could send an unlimited
279amount of data without a callback ever being called as long as the line 281amount of data without a callback ever being called as long as the line
280isn't finished). 282isn't finished).
283
284=item wbuf_max => <bytes>
285
286If defined, then a fatal error will be raised (with C<$!> set to C<ENOSPC>)
287when the write buffer ever (strictly) exceeds this size. This is useful to
288avoid some forms of denial-of-service attacks.
289
290Although the units of this parameter is bytes, this is the I<raw> number
291of bytes not yet accepted by the kernel. This can make a difference when
292you e.g. use TLS, as TLS typically makes your write data larger (but it
293can also make it smaller due to compression).
294
295As an example of when this limit is useful, take a chat server that sends
296chat messages to a client. If the client does not read those in a timely
297manner then the send buffer in the server would grow unbounded.
281 298
282=item autocork => <boolean> 299=item autocork => <boolean>
283 300
284When disabled (the default), C<push_write> will try to immediately 301When disabled (the default), C<push_write> will try to immediately
285write the data to the handle if possible. This avoids having to register 302write the data to the handle if possible. This avoids having to register
337already have occured on BSD systems), but at least it will protect you 354already have occured on BSD systems), but at least it will protect you
338from most attacks. 355from most attacks.
339 356
340=item read_size => <bytes> 357=item read_size => <bytes>
341 358
342The default read block size (the number of bytes this module will 359The initial read block size, the number of bytes this module will try to
343try to read during each loop iteration, which affects memory 360read during each loop iteration. Each handle object will consume at least
344requirements). Default: C<8192>. 361this amount of memory for the read buffer as well, so when handling many
362connections requirements). See also C<max_read_size>. Default: C<2048>.
363
364=item max_read_size => <bytes>
365
366The maximum read buffer size used by the dynamic adjustment
367algorithm: Each time AnyEvent::Handle can read C<read_size> bytes in
368one go it will double C<read_size> up to the maximum given by this
369option. Default: C<131072> or C<read_size>, whichever is higher.
345 370
346=item low_water_mark => <bytes> 371=item low_water_mark => <bytes>
347 372
348Sets the number of bytes (default: C<0>) that make up an "empty" write 373Sets the number of bytes (default: C<0>) that make up an "empty" write
349buffer: If the buffer reaches this size or gets even samller it is 374buffer: If the buffer reaches this size or gets even samller it is
412Use the C<< ->starttls >> method if you need to start TLS negotiation later. 437Use the C<< ->starttls >> method if you need to start TLS negotiation later.
413 438
414=item tls_ctx => $anyevent_tls 439=item tls_ctx => $anyevent_tls
415 440
416Use the given C<AnyEvent::TLS> object to create the new TLS connection 441Use the given C<AnyEvent::TLS> object to create the new TLS connection
417(unless a connection object was specified directly). If this parameter is 442(unless a connection object was specified directly). If this
418missing, then AnyEvent::Handle will use C<AnyEvent::Handle::TLS_CTX>. 443parameter is missing (or C<undef>), then AnyEvent::Handle will use
444C<AnyEvent::Handle::TLS_CTX>.
419 445
420Instead of an object, you can also specify a hash reference with C<< key 446Instead of an object, you can also specify a hash reference with C<< key
421=> value >> pairs. Those will be passed to L<AnyEvent::TLS> to create a 447=> value >> pairs. Those will be passed to L<AnyEvent::TLS> to create a
422new TLS context object. 448new TLS context object.
423 449
491 AnyEvent::Socket::tcp_connect ( 517 AnyEvent::Socket::tcp_connect (
492 $self->{connect}[0], 518 $self->{connect}[0],
493 $self->{connect}[1], 519 $self->{connect}[1],
494 sub { 520 sub {
495 my ($fh, $host, $port, $retry) = @_; 521 my ($fh, $host, $port, $retry) = @_;
522
523 delete $self->{_connect}; # no longer needed
496 524
497 if ($fh) { 525 if ($fh) {
498 $self->{fh} = $fh; 526 $self->{fh} = $fh;
499 527
500 delete $self->{_skip_drain_rbuf}; 528 delete $self->{_skip_drain_rbuf};
518 }, 546 },
519 sub { 547 sub {
520 local $self->{fh} = $_[0]; 548 local $self->{fh} = $_[0];
521 549
522 $self->{on_prepare} 550 $self->{on_prepare}
523 ? $self->{on_prepare}->($self) 551 ? $self->{on_prepare}->($self)
524 : () 552 : ()
525 } 553 }
526 ); 554 );
527 } 555 }
528 556
545 AnyEvent::Util::fh_nonblocking $self->{fh}, 1; 573 AnyEvent::Util::fh_nonblocking $self->{fh}, 1;
546 574
547 $self->{_activity} = 575 $self->{_activity} =
548 $self->{_ractivity} = 576 $self->{_ractivity} =
549 $self->{_wactivity} = AE::now; 577 $self->{_wactivity} = AE::now;
578
579 $self->{read_size} ||= 2048;
580 $self->{max_read_size} = $self->{read_size}
581 if $self->{read_size} > ($self->{max_read_size} || MAX_READ_SIZE);
550 582
551 $self->timeout (delete $self->{timeout} ) if $self->{timeout}; 583 $self->timeout (delete $self->{timeout} ) if $self->{timeout};
552 $self->rtimeout (delete $self->{rtimeout} ) if $self->{rtimeout}; 584 $self->rtimeout (delete $self->{rtimeout} ) if $self->{rtimeout};
553 $self->wtimeout (delete $self->{wtimeout} ) if $self->{wtimeout}; 585 $self->wtimeout (delete $self->{wtimeout} ) if $self->{wtimeout};
554 586
723 755
724=item $handle->rbuf_max ($max_octets) 756=item $handle->rbuf_max ($max_octets)
725 757
726Configures the C<rbuf_max> setting (C<undef> disables it). 758Configures the C<rbuf_max> setting (C<undef> disables it).
727 759
760=item $handle->wbuf_max ($max_octets)
761
762Configures the C<wbuf_max> setting (C<undef> disables it).
763
728=cut 764=cut
729 765
730sub rbuf_max { 766sub rbuf_max {
731 $_[0]{rbuf_max} = $_[1]; 767 $_[0]{rbuf_max} = $_[1];
768}
769
770sub wbuf_max {
771 $_[0]{wbuf_max} = $_[1];
732} 772}
733 773
734############################################################################# 774#############################################################################
735 775
736=item $handle->timeout ($seconds) 776=item $handle->timeout ($seconds)
856 if $cb && $self->{low_water_mark} >= (length $self->{wbuf}) + (length $self->{_tls_wbuf}); 896 if $cb && $self->{low_water_mark} >= (length $self->{wbuf}) + (length $self->{_tls_wbuf});
857} 897}
858 898
859=item $handle->push_write ($data) 899=item $handle->push_write ($data)
860 900
861Queues the given scalar to be written. You can push as much data as you 901Queues the given scalar to be written. You can push as much data as
862want (only limited by the available memory), as C<AnyEvent::Handle> 902you want (only limited by the available memory and C<wbuf_max>), as
863buffers it independently of the kernel. 903C<AnyEvent::Handle> buffers it independently of the kernel.
864 904
865This method may invoke callbacks (and therefore the handle might be 905This method may invoke callbacks (and therefore the handle might be
866destroyed after it returns). 906destroyed after it returns).
867 907
868=cut 908=cut
896 $cb->() unless $self->{autocork}; 936 $cb->() unless $self->{autocork};
897 937
898 # if still data left in wbuf, we need to poll 938 # if still data left in wbuf, we need to poll
899 $self->{_ww} = AE::io $self->{fh}, 1, $cb 939 $self->{_ww} = AE::io $self->{fh}, 1, $cb
900 if length $self->{wbuf}; 940 if length $self->{wbuf};
941
942 if (
943 defined $self->{wbuf_max}
944 && $self->{wbuf_max} < length $self->{wbuf}
945 ) {
946 $self->_error (Errno::ENOSPC, 1), return;
947 }
901 }; 948 };
902} 949}
903 950
904our %WH; 951our %WH;
905 952
1040before it was actually written. One way to do that is to replace your 1087before it was actually written. One way to do that is to replace your
1041C<on_drain> handler by a callback that shuts down the socket (and set 1088C<on_drain> handler by a callback that shuts down the socket (and set
1042C<low_water_mark> to C<0>). This method is a shorthand for just that, and 1089C<low_water_mark> to C<0>). This method is a shorthand for just that, and
1043replaces the C<on_drain> callback with: 1090replaces the C<on_drain> callback with:
1044 1091
1045 sub { shutdown $_[0]{fh}, 1 } # for push_shutdown 1092 sub { shutdown $_[0]{fh}, 1 }
1046 1093
1047This simply shuts down the write side and signals an EOF condition to the 1094This simply shuts down the write side and signals an EOF condition to the
1048the peer. 1095the peer.
1049 1096
1050You can rely on the normal read queue and C<on_eof> handling 1097You can rely on the normal read queue and C<on_eof> handling
1726Note that AnyEvent::Handle will automatically C<start_read> for you when 1773Note that AnyEvent::Handle will automatically C<start_read> for you when
1727you change the C<on_read> callback or push/unshift a read callback, and it 1774you change the C<on_read> callback or push/unshift a read callback, and it
1728will automatically C<stop_read> for you when neither C<on_read> is set nor 1775will automatically C<stop_read> for you when neither C<on_read> is set nor
1729there are any read requests in the queue. 1776there are any read requests in the queue.
1730 1777
1731These methods will have no effect when in TLS mode (as TLS doesn't support 1778In older versions of this module (<= 5.3), these methods had no effect,
1732half-duplex connections). 1779as TLS does not support half-duplex connections. In current versions they
1780work as expected, as this behaviour is required to avoid certain resource
1781attacks, where the program would be forced to read (and buffer) arbitrary
1782amounts of data before being able to send some data. The drawback is that
1783some readings of the the SSL/TLS specifications basically require this
1784attack to be working, as SSL/TLS implementations might stall sending data
1785during a rehandshake.
1786
1787As a guideline, during the initial handshake, you should not stop reading,
1788and as a client, it might cause problems, depending on your applciation.
1733 1789
1734=cut 1790=cut
1735 1791
1736sub stop_read { 1792sub stop_read {
1737 my ($self) = @_; 1793 my ($self) = @_;
1738 1794
1739 delete $self->{_rw} unless $self->{tls}; 1795 delete $self->{_rw};
1740} 1796}
1741 1797
1742sub start_read { 1798sub start_read {
1743 my ($self) = @_; 1799 my ($self) = @_;
1744 1800
1745 unless ($self->{_rw} || $self->{_eof} || !$self->{fh}) { 1801 unless ($self->{_rw} || $self->{_eof} || !$self->{fh}) {
1746 Scalar::Util::weaken $self; 1802 Scalar::Util::weaken $self;
1747 1803
1748 $self->{_rw} = AE::io $self->{fh}, 0, sub { 1804 $self->{_rw} = AE::io $self->{fh}, 0, sub {
1749 my $rbuf = \($self->{tls} ? my $buf : $self->{rbuf}); 1805 my $rbuf = \($self->{tls} ? my $buf : $self->{rbuf});
1750 my $len = sysread $self->{fh}, $$rbuf, $self->{read_size} || 8192, length $$rbuf; 1806 my $len = sysread $self->{fh}, $$rbuf, $self->{read_size}, length $$rbuf;
1751 1807
1752 if ($len > 0) { 1808 if ($len > 0) {
1753 $self->{_activity} = $self->{_ractivity} = AE::now; 1809 $self->{_activity} = $self->{_ractivity} = AE::now;
1754 1810
1755 if ($self->{tls}) { 1811 if ($self->{tls}) {
1756 Net::SSLeay::BIO_write ($self->{_rbio}, $$rbuf); 1812 Net::SSLeay::BIO_write ($self->{_rbio}, $$rbuf);
1757 1813
1758 &_dotls ($self); 1814 &_dotls ($self);
1759 } else { 1815 } else {
1760 $self->_drain_rbuf; 1816 $self->_drain_rbuf;
1817 }
1818
1819 if ($len == $self->{read_size}) {
1820 $self->{read_size} *= 2;
1821 $self->{read_size} = $self->{max_read_size} || MAX_READ_SIZE
1822 if $self->{read_size} > ($self->{max_read_size} || MAX_READ_SIZE);
1761 } 1823 }
1762 1824
1763 } elsif (defined $len) { 1825 } elsif (defined $len) {
1764 delete $self->{_rw}; 1826 delete $self->{_rw};
1765 $self->{_eof} = 1; 1827 $self->{_eof} = 1;
1988 if $self->{tls} > 0; 2050 if $self->{tls} > 0;
1989 2051
1990 delete @$self{qw(_rbio _wbio _tls_wbuf _on_starttls)}; 2052 delete @$self{qw(_rbio _wbio _tls_wbuf _on_starttls)};
1991} 2053}
1992 2054
2055=item $handle->resettls
2056
2057This rarely-used method simply resets and TLS state on the handle, usually
2058causing data loss.
2059
2060One case where it may be useful is when you want to skip over the data in
2061the stream but you are not interested in interpreting it, so data loss is
2062no concern.
2063
2064=cut
2065
2066*resettls = \&_freetls;
2067
1993sub DESTROY { 2068sub DESTROY {
1994 my ($self) = @_; 2069 my ($self) = @_;
1995 2070
1996 &_freetls; 2071 &_freetls;
1997 2072
2006 push @linger, AE::io $fh, 1, sub { 2081 push @linger, AE::io $fh, 1, sub {
2007 my $len = syswrite $fh, $wbuf, length $wbuf; 2082 my $len = syswrite $fh, $wbuf, length $wbuf;
2008 2083
2009 if ($len > 0) { 2084 if ($len > 0) {
2010 substr $wbuf, 0, $len, ""; 2085 substr $wbuf, 0, $len, "";
2011 } else { 2086 } elsif (defined $len || ($! != EAGAIN && $! != EINTR && $! != WSAEWOULDBLOCK)) {
2012 @linger = (); # end 2087 @linger = (); # end
2013 } 2088 }
2014 }; 2089 };
2015 push @linger, AE::timer $linger, 0, sub { 2090 push @linger, AE::timer $linger, 0, sub {
2016 @linger = (); 2091 @linger = ();
2112 2187
2113It is only safe to "forget" the reference inside EOF or error callbacks, 2188It is only safe to "forget" the reference inside EOF or error callbacks,
2114from within all other callbacks, you need to explicitly call the C<< 2189from within all other callbacks, you need to explicitly call the C<<
2115->destroy >> method. 2190->destroy >> method.
2116 2191
2192=item Why is my C<on_eof> callback never called?
2193
2194Probably because your C<on_error> callback is being called instead: When
2195you have outstanding requests in your read queue, then an EOF is
2196considered an error as you clearly expected some data.
2197
2198To avoid this, make sure you have an empty read queue whenever your handle
2199is supposed to be "idle" (i.e. connection closes are O.K.). You cna set
2200an C<on_read> handler that simply pushes the first read requests in the
2201queue.
2202
2203See also the next question, which explains this in a bit more detail.
2204
2205=item How can I serve requests in a loop?
2206
2207Most protocols consist of some setup phase (authentication for example)
2208followed by a request handling phase, where the server waits for requests
2209and handles them, in a loop.
2210
2211There are two important variants: The first (traditional, better) variant
2212handles requests until the server gets some QUIT command, causing it to
2213close the connection first (highly desirable for a busy TCP server). A
2214client dropping the connection is an error, which means this variant can
2215detect an unexpected detection close.
2216
2217To handle this case, always make sure you have a on-empty read queue, by
2218pushing the "read request start" handler on it:
2219
2220 # we assume a request starts with a single line
2221 my @start_request; @start_request = (line => sub {
2222 my ($hdl, $line) = @_;
2223
2224 ... handle request
2225
2226 # push next request read, possibly from a nested callback
2227 $hdl->push_read (@start_request);
2228 });
2229
2230 # auth done, now go into request handling loop
2231 # now push the first @start_request
2232 $hdl->push_read (@start_request);
2233
2234By always having an outstanding C<push_read>, the handle always expects
2235some data and raises the C<EPIPE> error when the connction is dropped
2236unexpectedly.
2237
2238The second variant is a protocol where the client can drop the connection
2239at any time. For TCP, this means that the server machine may run out of
2240sockets easier, and in general, it means you cnanot distinguish a protocl
2241failure/client crash from a normal connection close. Nevertheless, these
2242kinds of protocols are common (and sometimes even the best solution to the
2243problem).
2244
2245Having an outstanding read request at all times is possible if you ignore
2246C<EPIPE> errors, but this doesn't help with when the client drops the
2247connection during a request, which would still be an error.
2248
2249A better solution is to push the initial request read in an C<on_read>
2250callback. This avoids an error, as when the server doesn't expect data
2251(i.e. is idly waiting for the next request, an EOF will not raise an
2252error, but simply result in an C<on_eof> callback. It is also a bit slower
2253and simpler:
2254
2255 # auth done, now go into request handling loop
2256 $hdl->on_read (sub {
2257 my ($hdl) = @_;
2258
2259 # called each time we receive data but the read queue is empty
2260 # simply start read the request
2261
2262 $hdl->push_read (line => sub {
2263 my ($hdl, $line) = @_;
2264
2265 ... handle request
2266
2267 # do nothing special when the request has been handled, just
2268 # let the request queue go empty.
2269 });
2270 });
2271
2117=item I get different callback invocations in TLS mode/Why can't I pause 2272=item I get different callback invocations in TLS mode/Why can't I pause
2118reading? 2273reading?
2119 2274
2120Unlike, say, TCP, TLS connections do not consist of two independent 2275Unlike, say, TCP, TLS connections do not consist of two independent
2121communication channels, one for each direction. Or put differently, the 2276communication channels, one for each direction. Or put differently, the

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines