ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/AnyEvent/lib/AnyEvent/Handle.pm
(Generate patch)

Comparing AnyEvent/lib/AnyEvent/Handle.pm (file contents):
Revision 1.202 by root, Sat Oct 16 02:01:54 2010 UTC vs.
Revision 1.211 by root, Fri Dec 31 04:47:41 2010 UTC

75 } 75 }
76 76
77 \&$func 77 \&$func
78} 78}
79 79
80sub MAX_READ_SIZE() { 131072 }
81
80=head1 METHODS 82=head1 METHODS
81 83
82=over 4 84=over 4
83 85
84=item $handle = B<new> AnyEvent::Handle fh => $filehandle, key => value... 86=item $handle = B<new> AnyEvent::Handle fh => $filehandle, key => value...
112=over 4 114=over 4
113 115
114=item on_prepare => $cb->($handle) 116=item on_prepare => $cb->($handle)
115 117
116This (rarely used) callback is called before a new connection is 118This (rarely used) callback is called before a new connection is
117attempted, but after the file handle has been created. It could be used to 119attempted, but after the file handle has been created (you can access that
120file handle via C<< $handle->{fh} >>). It could be used to prepare the
118prepare the file handle with parameters required for the actual connect 121file handle with parameters required for the actual connect (as opposed to
119(as opposed to settings that can be changed when the connection is already 122settings that can be changed when the connection is already established).
120established).
121 123
122The return value of this callback should be the connect timeout value in 124The return value of this callback should be the connect timeout value in
123seconds (or C<0>, or C<undef>, or the empty list, to indicate that the 125seconds (or C<0>, or C<undef>, or the empty list, to indicate that the
124default timeout is to be used). 126default timeout is to be used).
125 127
157 159
158Some errors are fatal (which is indicated by C<$fatal> being true). On 160Some errors are fatal (which is indicated by C<$fatal> being true). On
159fatal errors the handle object will be destroyed (by a call to C<< -> 161fatal errors the handle object will be destroyed (by a call to C<< ->
160destroy >>) after invoking the error callback (which means you are free to 162destroy >>) after invoking the error callback (which means you are free to
161examine the handle object). Examples of fatal errors are an EOF condition 163examine the handle object). Examples of fatal errors are an EOF condition
162with active (but unsatisifable) read watchers (C<EPIPE>) or I/O errors. In 164with active (but unsatisfiable) read watchers (C<EPIPE>) or I/O errors. In
163cases where the other side can close the connection at will, it is 165cases where the other side can close the connection at will, it is
164often easiest to not report C<EPIPE> errors in this callback. 166often easiest to not report C<EPIPE> errors in this callback.
165 167
166AnyEvent::Handle tries to find an appropriate error code for you to check 168AnyEvent::Handle tries to find an appropriate error code for you to check
167against, but in some cases (TLS errors), this does not work well. It is 169against, but in some cases (TLS errors), this does not work well. It is
276For example, a server accepting connections from untrusted sources should 278For example, a server accepting connections from untrusted sources should
277be configured to accept only so-and-so much data that it cannot act on 279be configured to accept only so-and-so much data that it cannot act on
278(for example, when expecting a line, an attacker could send an unlimited 280(for example, when expecting a line, an attacker could send an unlimited
279amount of data without a callback ever being called as long as the line 281amount of data without a callback ever being called as long as the line
280isn't finished). 282isn't finished).
283
284=item wbuf_max => <bytes>
285
286If defined, then a fatal error will be raised (with C<$!> set to C<ENOSPC>)
287when the write buffer ever (strictly) exceeds this size. This is useful to
288avoid some forms of denial-of-service attacks.
289
290Although the units of this parameter is bytes, this is the I<raw> number
291of bytes not yet accepted by the kernel. This can make a difference when
292you e.g. use TLS, as TLS typically makes your write data larger (but it
293can also make it smaller due to compression).
294
295As an example of when this limit is useful, take a chat server that sends
296chat messages to a client. If the client does not read those in a timely
297manner then the send buffer in the server would grow unbounded.
281 298
282=item autocork => <boolean> 299=item autocork => <boolean>
283 300
284When disabled (the default), C<push_write> will try to immediately 301When disabled (the default), C<push_write> will try to immediately
285write the data to the handle if possible. This avoids having to register 302write the data to the handle if possible. This avoids having to register
337already have occured on BSD systems), but at least it will protect you 354already have occured on BSD systems), but at least it will protect you
338from most attacks. 355from most attacks.
339 356
340=item read_size => <bytes> 357=item read_size => <bytes>
341 358
342The default read block size (the number of bytes this module will 359The initial read block size, the number of bytes this module will try to
343try to read during each loop iteration, which affects memory 360read during each loop iteration. Each handle object will consume at least
344requirements). Default: C<8192>. 361this amount of memory for the read buffer as well, so when handling many
362connections requirements). See also C<max_read_size>. Default: C<2048>.
363
364=item max_read_size => <bytes>
365
366The maximum read buffer size used by the dynamic adjustment
367algorithm: Each time AnyEvent::Handle can read C<read_size> bytes in
368one go it will double C<read_size> up to the maximum given by this
369option. Default: C<131072> or C<read_size>, whichever is higher.
345 370
346=item low_water_mark => <bytes> 371=item low_water_mark => <bytes>
347 372
348Sets the number of bytes (default: C<0>) that make up an "empty" write 373Sets the number of bytes (default: C<0>) that make up an "empty" write
349buffer: If the buffer reaches this size or gets even samller it is 374buffer: If the buffer reaches this size or gets even samller it is
412Use the C<< ->starttls >> method if you need to start TLS negotiation later. 437Use the C<< ->starttls >> method if you need to start TLS negotiation later.
413 438
414=item tls_ctx => $anyevent_tls 439=item tls_ctx => $anyevent_tls
415 440
416Use the given C<AnyEvent::TLS> object to create the new TLS connection 441Use the given C<AnyEvent::TLS> object to create the new TLS connection
417(unless a connection object was specified directly). If this parameter is 442(unless a connection object was specified directly). If this
418missing, then AnyEvent::Handle will use C<AnyEvent::Handle::TLS_CTX>. 443parameter is missing (or C<undef>), then AnyEvent::Handle will use
444C<AnyEvent::Handle::TLS_CTX>.
419 445
420Instead of an object, you can also specify a hash reference with C<< key 446Instead of an object, you can also specify a hash reference with C<< key
421=> value >> pairs. Those will be passed to L<AnyEvent::TLS> to create a 447=> value >> pairs. Those will be passed to L<AnyEvent::TLS> to create a
422new TLS context object. 448new TLS context object.
423 449
491 AnyEvent::Socket::tcp_connect ( 517 AnyEvent::Socket::tcp_connect (
492 $self->{connect}[0], 518 $self->{connect}[0],
493 $self->{connect}[1], 519 $self->{connect}[1],
494 sub { 520 sub {
495 my ($fh, $host, $port, $retry) = @_; 521 my ($fh, $host, $port, $retry) = @_;
522
523 delete $self->{_connect}; # no longer needed
496 524
497 if ($fh) { 525 if ($fh) {
498 $self->{fh} = $fh; 526 $self->{fh} = $fh;
499 527
500 delete $self->{_skip_drain_rbuf}; 528 delete $self->{_skip_drain_rbuf};
518 }, 546 },
519 sub { 547 sub {
520 local $self->{fh} = $_[0]; 548 local $self->{fh} = $_[0];
521 549
522 $self->{on_prepare} 550 $self->{on_prepare}
523 ? $self->{on_prepare}->($self) 551 ? $self->{on_prepare}->($self)
524 : () 552 : ()
525 } 553 }
526 ); 554 );
527 } 555 }
528 556
545 AnyEvent::Util::fh_nonblocking $self->{fh}, 1; 573 AnyEvent::Util::fh_nonblocking $self->{fh}, 1;
546 574
547 $self->{_activity} = 575 $self->{_activity} =
548 $self->{_ractivity} = 576 $self->{_ractivity} =
549 $self->{_wactivity} = AE::now; 577 $self->{_wactivity} = AE::now;
578
579 $self->{read_size} ||= 2048;
580 $self->{max_read_size} = $self->{read_size}
581 if $self->{read_size} > ($self->{max_read_size} || MAX_READ_SIZE);
550 582
551 $self->timeout (delete $self->{timeout} ) if $self->{timeout}; 583 $self->timeout (delete $self->{timeout} ) if $self->{timeout};
552 $self->rtimeout (delete $self->{rtimeout} ) if $self->{rtimeout}; 584 $self->rtimeout (delete $self->{rtimeout} ) if $self->{rtimeout};
553 $self->wtimeout (delete $self->{wtimeout} ) if $self->{wtimeout}; 585 $self->wtimeout (delete $self->{wtimeout} ) if $self->{wtimeout};
554 586
723 755
724=item $handle->rbuf_max ($max_octets) 756=item $handle->rbuf_max ($max_octets)
725 757
726Configures the C<rbuf_max> setting (C<undef> disables it). 758Configures the C<rbuf_max> setting (C<undef> disables it).
727 759
760=item $handle->wbuf_max ($max_octets)
761
762Configures the C<wbuf_max> setting (C<undef> disables it).
763
728=cut 764=cut
729 765
730sub rbuf_max { 766sub rbuf_max {
731 $_[0]{rbuf_max} = $_[1]; 767 $_[0]{rbuf_max} = $_[1];
768}
769
770sub rbuf_max {
771 $_[0]{wbuf_max} = $_[1];
732} 772}
733 773
734############################################################################# 774#############################################################################
735 775
736=item $handle->timeout ($seconds) 776=item $handle->timeout ($seconds)
856 if $cb && $self->{low_water_mark} >= (length $self->{wbuf}) + (length $self->{_tls_wbuf}); 896 if $cb && $self->{low_water_mark} >= (length $self->{wbuf}) + (length $self->{_tls_wbuf});
857} 897}
858 898
859=item $handle->push_write ($data) 899=item $handle->push_write ($data)
860 900
861Queues the given scalar to be written. You can push as much data as you 901Queues the given scalar to be written. You can push as much data as
862want (only limited by the available memory), as C<AnyEvent::Handle> 902you want (only limited by the available memory and C<wbuf_max>), as
863buffers it independently of the kernel. 903C<AnyEvent::Handle> buffers it independently of the kernel.
864 904
865This method may invoke callbacks (and therefore the handle might be 905This method may invoke callbacks (and therefore the handle might be
866destroyed after it returns). 906destroyed after it returns).
867 907
868=cut 908=cut
896 $cb->() unless $self->{autocork}; 936 $cb->() unless $self->{autocork};
897 937
898 # if still data left in wbuf, we need to poll 938 # if still data left in wbuf, we need to poll
899 $self->{_ww} = AE::io $self->{fh}, 1, $cb 939 $self->{_ww} = AE::io $self->{fh}, 1, $cb
900 if length $self->{wbuf}; 940 if length $self->{wbuf};
941
942 if (
943 defined $self->{wbuf_max}
944 && $self->{wbuf_max} < length $self->{wbuf}
945 ) {
946 $self->_error (Errno::ENOSPC, 1), return;
947 }
901 }; 948 };
902} 949}
903 950
904our %WH; 951our %WH;
905 952
1569 sub { 1616 sub {
1570 # when we can use 5.10 we can use ".", but for 5.8 we use the re-pack method 1617 # when we can use 5.10 we can use ".", but for 5.8 we use the re-pack method
1571 defined (my $len = eval { unpack $format, $_[0]{rbuf} }) 1618 defined (my $len = eval { unpack $format, $_[0]{rbuf} })
1572 or return; 1619 or return;
1573 1620
1621 warn "len $len\n";#d#
1574 $format = length pack $format, $len; 1622 $format = length pack $format, $len;
1623 warn "len2 $format\n";#d#
1575 1624
1576 # bypass unshift if we already have the remaining chunk 1625 # bypass unshift if we already have the remaining chunk
1577 if ($format + $len <= length $_[0]{rbuf}) { 1626 if ($format + $len <= length $_[0]{rbuf}) {
1578 my $data = substr $_[0]{rbuf}, $format, $len; 1627 my $data = substr $_[0]{rbuf}, $format, $len;
1579 substr $_[0]{rbuf}, 0, $format + $len, ""; 1628 substr $_[0]{rbuf}, 0, $format + $len, "";
1745 unless ($self->{_rw} || $self->{_eof} || !$self->{fh}) { 1794 unless ($self->{_rw} || $self->{_eof} || !$self->{fh}) {
1746 Scalar::Util::weaken $self; 1795 Scalar::Util::weaken $self;
1747 1796
1748 $self->{_rw} = AE::io $self->{fh}, 0, sub { 1797 $self->{_rw} = AE::io $self->{fh}, 0, sub {
1749 my $rbuf = \($self->{tls} ? my $buf : $self->{rbuf}); 1798 my $rbuf = \($self->{tls} ? my $buf : $self->{rbuf});
1750 my $len = sysread $self->{fh}, $$rbuf, $self->{read_size} || 8192, length $$rbuf; 1799 my $len = sysread $self->{fh}, $$rbuf, $self->{read_size}, length $$rbuf;
1751 1800
1752 if ($len > 0) { 1801 if ($len > 0) {
1753 $self->{_activity} = $self->{_ractivity} = AE::now; 1802 $self->{_activity} = $self->{_ractivity} = AE::now;
1754 1803
1755 if ($self->{tls}) { 1804 if ($self->{tls}) {
1756 Net::SSLeay::BIO_write ($self->{_rbio}, $$rbuf); 1805 Net::SSLeay::BIO_write ($self->{_rbio}, $$rbuf);
1757 1806
1758 &_dotls ($self); 1807 &_dotls ($self);
1759 } else { 1808 } else {
1760 $self->_drain_rbuf; 1809 $self->_drain_rbuf;
1810 }
1811
1812 if ($len == $self->{read_size}) {
1813 $self->{read_size} *= 2;
1814 $self->{read_size} = $self->{max_read_size} || MAX_READ_SIZE
1815 if $self->{read_size} > ($self->{max_read_size} || MAX_READ_SIZE);
1761 } 1816 }
1762 1817
1763 } elsif (defined $len) { 1818 } elsif (defined $len) {
1764 delete $self->{_rw}; 1819 delete $self->{_rw};
1765 $self->{_eof} = 1; 1820 $self->{_eof} = 1;
2112 2167
2113It is only safe to "forget" the reference inside EOF or error callbacks, 2168It is only safe to "forget" the reference inside EOF or error callbacks,
2114from within all other callbacks, you need to explicitly call the C<< 2169from within all other callbacks, you need to explicitly call the C<<
2115->destroy >> method. 2170->destroy >> method.
2116 2171
2172=item Why is my C<on_eof> callback never called?
2173
2174Probably because your C<on_error> callback is being called instead: When
2175you have outstanding requests in your read queue, then an EOF is
2176considered an error as you clearly expected some data.
2177
2178To avoid this, make sure you have an empty read queue whenever your handle
2179is supposed to be "idle" (i.e. connection closes are O.K.). You cna set
2180an C<on_read> handler that simply pushes the first read requests in the
2181queue.
2182
2183See also the next question, which explains this in a bit more detail.
2184
2185=item How can I serve requests in a loop?
2186
2187Most protocols consist of some setup phase (authentication for example)
2188followed by a request handling phase, where the server waits for requests
2189and handles them, in a loop.
2190
2191There are two important variants: The first (traditional, better) variant
2192handles requests until the server gets some QUIT command, causing it to
2193close the connection first (highly desirable for a busy TCP server). A
2194client dropping the connection is an error, which means this variant can
2195detect an unexpected detection close.
2196
2197To handle this case, always make sure you have a on-empty read queue, by
2198pushing the "read request start" handler on it:
2199
2200 # we assume a request starts with a single line
2201 my @start_request; @start_request = (line => sub {
2202 my ($hdl, $line) = @_;
2203
2204 ... handle request
2205
2206 # push next request read, possibly from a nested callback
2207 $hdl->push_read (@start_request);
2208 });
2209
2210 # auth done, now go into request handling loop
2211 # now push the first @start_request
2212 $hdl->push_read (@start_request);
2213
2214By always having an outstanding C<push_read>, the handle always expects
2215some data and raises the C<EPIPE> error when the connction is dropped
2216unexpectedly.
2217
2218The second variant is a protocol where the client can drop the connection
2219at any time. For TCP, this means that the server machine may run out of
2220sockets easier, and in general, it means you cnanot distinguish a protocl
2221failure/client crash from a normal connection close. Nevertheless, these
2222kinds of protocols are common (and sometimes even the best solution to the
2223problem).
2224
2225Having an outstanding read request at all times is possible if you ignore
2226C<EPIPE> errors, but this doesn't help with when the client drops the
2227connection during a request, which would still be an error.
2228
2229A better solution is to push the initial request read in an C<on_read>
2230callback. This avoids an error, as when the server doesn't expect data
2231(i.e. is idly waiting for the next request, an EOF will not raise an
2232error, but simply result in an C<on_eof> callback. It is also a bit slower
2233and simpler:
2234
2235 # auth done, now go into request handling loop
2236 $hdl->on_read (sub {
2237 my ($hdl) = @_;
2238
2239 # called each time we receive data but the read queue is empty
2240 # simply start read the request
2241
2242 $hdl->push_read (line => sub {
2243 my ($hdl, $line) = @_;
2244
2245 ... handle request
2246
2247 # do nothing special when the request has been handled, just
2248 # let the request queue go empty.
2249 });
2250 });
2251
2117=item I get different callback invocations in TLS mode/Why can't I pause 2252=item I get different callback invocations in TLS mode/Why can't I pause
2118reading? 2253reading?
2119 2254
2120Unlike, say, TCP, TLS connections do not consist of two independent 2255Unlike, say, TCP, TLS connections do not consist of two independent
2121communication channels, one for each direction. Or put differently, the 2256communication channels, one for each direction. Or put differently, the

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines