ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/AnyEvent/lib/AnyEvent/Handle.pm
(Generate patch)

Comparing AnyEvent/lib/AnyEvent/Handle.pm (file contents):
Revision 1.202 by root, Sat Oct 16 02:01:54 2010 UTC vs.
Revision 1.233 by root, Thu Apr 5 06:14:10 2012 UTC

11 11
12 my $hdl; $hdl = new AnyEvent::Handle 12 my $hdl; $hdl = new AnyEvent::Handle
13 fh => \*STDIN, 13 fh => \*STDIN,
14 on_error => sub { 14 on_error => sub {
15 my ($hdl, $fatal, $msg) = @_; 15 my ($hdl, $fatal, $msg) = @_;
16 warn "got error $msg\n"; 16 AE::log error => $msg;
17 $hdl->destroy; 17 $hdl->destroy;
18 $cv->send; 18 $cv->send;
19 }; 19 };
20 20
21 # send some request line 21 # send some request line
22 $hdl->push_write ("getinfo\015\012"); 22 $hdl->push_write ("getinfo\015\012");
23 23
24 # read the response line 24 # read the response line
25 $hdl->push_read (line => sub { 25 $hdl->push_read (line => sub {
26 my ($hdl, $line) = @_; 26 my ($hdl, $line) = @_;
27 warn "got line <$line>\n"; 27 say "got line <$line>";
28 $cv->send; 28 $cv->send;
29 }); 29 });
30 30
31 $cv->recv; 31 $cv->recv;
32 32
75 } 75 }
76 76
77 \&$func 77 \&$func
78} 78}
79 79
80sub MAX_READ_SIZE() { 131072 }
81
80=head1 METHODS 82=head1 METHODS
81 83
82=over 4 84=over 4
83 85
84=item $handle = B<new> AnyEvent::Handle fh => $filehandle, key => value... 86=item $handle = B<new> AnyEvent::Handle fh => $filehandle, key => value...
112=over 4 114=over 4
113 115
114=item on_prepare => $cb->($handle) 116=item on_prepare => $cb->($handle)
115 117
116This (rarely used) callback is called before a new connection is 118This (rarely used) callback is called before a new connection is
117attempted, but after the file handle has been created. It could be used to 119attempted, but after the file handle has been created (you can access that
120file handle via C<< $handle->{fh} >>). It could be used to prepare the
118prepare the file handle with parameters required for the actual connect 121file handle with parameters required for the actual connect (as opposed to
119(as opposed to settings that can be changed when the connection is already 122settings that can be changed when the connection is already established).
120established).
121 123
122The return value of this callback should be the connect timeout value in 124The return value of this callback should be the connect timeout value in
123seconds (or C<0>, or C<undef>, or the empty list, to indicate that the 125seconds (or C<0>, or C<undef>, or the empty list, to indicate that the
124default timeout is to be used). 126default timeout is to be used).
125 127
126=item on_connect => $cb->($handle, $host, $port, $retry->()) 128=item on_connect => $cb->($handle, $host, $port, $retry->())
127 129
128This callback is called when a connection has been successfully established. 130This callback is called when a connection has been successfully established.
129 131
130The peer's numeric host and port (the socket peername) are passed as 132The peer's numeric host and port (the socket peername) are passed as
131parameters, together with a retry callback. 133parameters, together with a retry callback. At the time it is called the
134read and write queues, EOF status, TLS status and similar properties of
135the handle will have been reset.
132 136
137It is not allowed to use the read or write queues while the handle object
138is connecting.
139
133If, for some reason, the handle is not acceptable, calling C<$retry> 140If, for some reason, the handle is not acceptable, calling C<$retry> will
134will continue with the next connection target (in case of multi-homed 141continue with the next connection target (in case of multi-homed hosts or
135hosts or SRV records there can be multiple connection endpoints). At the 142SRV records there can be multiple connection endpoints). The C<$retry>
136time it is called the read and write queues, eof status, tls status and 143callback can be invoked after the connect callback returns, i.e. one can
137similar properties of the handle will have been reset. 144start a handshake and then decide to retry with the next host if the
145handshake fails.
138 146
139In most cases, you should ignore the C<$retry> parameter. 147In most cases, you should ignore the C<$retry> parameter.
140 148
141=item on_connect_error => $cb->($handle, $message) 149=item on_connect_error => $cb->($handle, $message)
142 150
157 165
158Some errors are fatal (which is indicated by C<$fatal> being true). On 166Some errors are fatal (which is indicated by C<$fatal> being true). On
159fatal errors the handle object will be destroyed (by a call to C<< -> 167fatal errors the handle object will be destroyed (by a call to C<< ->
160destroy >>) after invoking the error callback (which means you are free to 168destroy >>) after invoking the error callback (which means you are free to
161examine the handle object). Examples of fatal errors are an EOF condition 169examine the handle object). Examples of fatal errors are an EOF condition
162with active (but unsatisifable) read watchers (C<EPIPE>) or I/O errors. In 170with active (but unsatisfiable) read watchers (C<EPIPE>) or I/O errors. In
163cases where the other side can close the connection at will, it is 171cases where the other side can close the connection at will, it is
164often easiest to not report C<EPIPE> errors in this callback. 172often easiest to not report C<EPIPE> errors in this callback.
165 173
166AnyEvent::Handle tries to find an appropriate error code for you to check 174AnyEvent::Handle tries to find an appropriate error code for you to check
167against, but in some cases (TLS errors), this does not work well. It is 175against, but in some cases (TLS errors), this does not work well.
168recommended to always output the C<$message> argument in human-readable 176
169error messages (it's usually the same as C<"$!">). 177If you report the error to the user, it is recommended to always output
178the C<$message> argument in human-readable error messages (you don't need
179to report C<"$!"> if you report C<$message>).
180
181If you want to react programmatically to the error, then looking at C<$!>
182and comparing it against some of the documented C<Errno> values is usually
183better than looking at the C<$message>.
170 184
171Non-fatal errors can be retried by returning, but it is recommended 185Non-fatal errors can be retried by returning, but it is recommended
172to simply ignore this parameter and instead abondon the handle object 186to simply ignore this parameter and instead abondon the handle object
173when this callback is invoked. Examples of non-fatal errors are timeouts 187when this callback is invoked. Examples of non-fatal errors are timeouts
174C<ETIMEDOUT>) or badly-formatted data (C<EBADMSG>). 188C<ETIMEDOUT>) or badly-formatted data (C<EBADMSG>).
222If an EOF condition has been detected but no C<on_eof> callback has been 236If an EOF condition has been detected but no C<on_eof> callback has been
223set, then a fatal error will be raised with C<$!> set to <0>. 237set, then a fatal error will be raised with C<$!> set to <0>.
224 238
225=item on_drain => $cb->($handle) 239=item on_drain => $cb->($handle)
226 240
227This sets the callback that is called when the write buffer becomes empty 241This sets the callback that is called once when the write buffer becomes
228(or immediately if the buffer is empty already). 242empty (and immediately when the handle object is created).
229 243
230To append to the write buffer, use the C<< ->push_write >> method. 244To append to the write buffer, use the C<< ->push_write >> method.
231 245
232This callback is useful when you don't want to put all of your write data 246This callback is useful when you don't want to put all of your write data
233into the queue at once, for example, when you want to write the contents 247into the queue at once, for example, when you want to write the contents
245many seconds pass without a successful read or write on the underlying 259many seconds pass without a successful read or write on the underlying
246file handle (or a call to C<timeout_reset>), the C<on_timeout> callback 260file handle (or a call to C<timeout_reset>), the C<on_timeout> callback
247will be invoked (and if that one is missing, a non-fatal C<ETIMEDOUT> 261will be invoked (and if that one is missing, a non-fatal C<ETIMEDOUT>
248error will be raised). 262error will be raised).
249 263
250There are three variants of the timeouts that work independently 264There are three variants of the timeouts that work independently of each
251of each other, for both read and write, just read, and just write: 265other, for both read and write (triggered when nothing was read I<OR>
266written), just read (triggered when nothing was read), and just write:
252C<timeout>, C<rtimeout> and C<wtimeout>, with corresponding callbacks 267C<timeout>, C<rtimeout> and C<wtimeout>, with corresponding callbacks
253C<on_timeout>, C<on_rtimeout> and C<on_wtimeout>, and reset functions 268C<on_timeout>, C<on_rtimeout> and C<on_wtimeout>, and reset functions
254C<timeout_reset>, C<rtimeout_reset>, and C<wtimeout_reset>. 269C<timeout_reset>, C<rtimeout_reset>, and C<wtimeout_reset>.
255 270
256Note that timeout processing is active even when you do not have 271Note that timeout processing is active even when you do not have any
257any outstanding read or write requests: If you plan to keep the connection 272outstanding read or write requests: If you plan to keep the connection
258idle then you should disable the timeout temporarily or ignore the timeout 273idle then you should disable the timeout temporarily or ignore the
259in the C<on_timeout> callback, in which case AnyEvent::Handle will simply 274timeout in the corresponding C<on_timeout> callback, in which case
260restart the timeout. 275AnyEvent::Handle will simply restart the timeout.
261 276
262Zero (the default) disables this timeout. 277Zero (the default) disables the corresponding timeout.
263 278
264=item on_timeout => $cb->($handle) 279=item on_timeout => $cb->($handle)
280
281=item on_rtimeout => $cb->($handle)
282
283=item on_wtimeout => $cb->($handle)
265 284
266Called whenever the inactivity timeout passes. If you return from this 285Called whenever the inactivity timeout passes. If you return from this
267callback, then the timeout will be reset as if some activity had happened, 286callback, then the timeout will be reset as if some activity had happened,
268so this condition is not fatal in any way. 287so this condition is not fatal in any way.
269 288
276For example, a server accepting connections from untrusted sources should 295For example, a server accepting connections from untrusted sources should
277be configured to accept only so-and-so much data that it cannot act on 296be configured to accept only so-and-so much data that it cannot act on
278(for example, when expecting a line, an attacker could send an unlimited 297(for example, when expecting a line, an attacker could send an unlimited
279amount of data without a callback ever being called as long as the line 298amount of data without a callback ever being called as long as the line
280isn't finished). 299isn't finished).
300
301=item wbuf_max => <bytes>
302
303If defined, then a fatal error will be raised (with C<$!> set to C<ENOSPC>)
304when the write buffer ever (strictly) exceeds this size. This is useful to
305avoid some forms of denial-of-service attacks.
306
307Although the units of this parameter is bytes, this is the I<raw> number
308of bytes not yet accepted by the kernel. This can make a difference when
309you e.g. use TLS, as TLS typically makes your write data larger (but it
310can also make it smaller due to compression).
311
312As an example of when this limit is useful, take a chat server that sends
313chat messages to a client. If the client does not read those in a timely
314manner then the send buffer in the server would grow unbounded.
281 315
282=item autocork => <boolean> 316=item autocork => <boolean>
283 317
284When disabled (the default), C<push_write> will try to immediately 318When disabled (the default), C<push_write> will try to immediately
285write the data to the handle if possible. This avoids having to register 319write the data to the handle if possible. This avoids having to register
337already have occured on BSD systems), but at least it will protect you 371already have occured on BSD systems), but at least it will protect you
338from most attacks. 372from most attacks.
339 373
340=item read_size => <bytes> 374=item read_size => <bytes>
341 375
342The default read block size (the number of bytes this module will 376The initial read block size, the number of bytes this module will try
343try to read during each loop iteration, which affects memory 377to read during each loop iteration. Each handle object will consume
344requirements). Default: C<8192>. 378at least this amount of memory for the read buffer as well, so when
379handling many connections watch out for memory requirements). See also
380C<max_read_size>. Default: C<2048>.
381
382=item max_read_size => <bytes>
383
384The maximum read buffer size used by the dynamic adjustment
385algorithm: Each time AnyEvent::Handle can read C<read_size> bytes in
386one go it will double C<read_size> up to the maximum given by this
387option. Default: C<131072> or C<read_size>, whichever is higher.
345 388
346=item low_water_mark => <bytes> 389=item low_water_mark => <bytes>
347 390
348Sets the number of bytes (default: C<0>) that make up an "empty" write 391Sets the number of bytes (default: C<0>) that make up an "empty" write
349buffer: If the buffer reaches this size or gets even samller it is 392buffer: If the buffer reaches this size or gets even samller it is
412Use the C<< ->starttls >> method if you need to start TLS negotiation later. 455Use the C<< ->starttls >> method if you need to start TLS negotiation later.
413 456
414=item tls_ctx => $anyevent_tls 457=item tls_ctx => $anyevent_tls
415 458
416Use the given C<AnyEvent::TLS> object to create the new TLS connection 459Use the given C<AnyEvent::TLS> object to create the new TLS connection
417(unless a connection object was specified directly). If this parameter is 460(unless a connection object was specified directly). If this
418missing, then AnyEvent::Handle will use C<AnyEvent::Handle::TLS_CTX>. 461parameter is missing (or C<undef>), then AnyEvent::Handle will use
462C<AnyEvent::Handle::TLS_CTX>.
419 463
420Instead of an object, you can also specify a hash reference with C<< key 464Instead of an object, you can also specify a hash reference with C<< key
421=> value >> pairs. Those will be passed to L<AnyEvent::TLS> to create a 465=> value >> pairs. Those will be passed to L<AnyEvent::TLS> to create a
422new TLS context object. 466new TLS context object.
423 467
492 $self->{connect}[0], 536 $self->{connect}[0],
493 $self->{connect}[1], 537 $self->{connect}[1],
494 sub { 538 sub {
495 my ($fh, $host, $port, $retry) = @_; 539 my ($fh, $host, $port, $retry) = @_;
496 540
541 delete $self->{_connect}; # no longer needed
542
497 if ($fh) { 543 if ($fh) {
498 $self->{fh} = $fh; 544 $self->{fh} = $fh;
499 545
500 delete $self->{_skip_drain_rbuf}; 546 delete $self->{_skip_drain_rbuf};
501 $self->_start; 547 $self->_start;
508 }); 554 });
509 555
510 } else { 556 } else {
511 if ($self->{on_connect_error}) { 557 if ($self->{on_connect_error}) {
512 $self->{on_connect_error}($self, "$!"); 558 $self->{on_connect_error}($self, "$!");
513 $self->destroy; 559 $self->destroy if $self;
514 } else { 560 } else {
515 $self->_error ($!, 1); 561 $self->_error ($!, 1);
516 } 562 }
517 } 563 }
518 }, 564 },
519 sub { 565 sub {
520 local $self->{fh} = $_[0]; 566 local $self->{fh} = $_[0];
521 567
522 $self->{on_prepare} 568 $self->{on_prepare}
523 ? $self->{on_prepare}->($self) 569 ? $self->{on_prepare}->($self)
524 : () 570 : ()
525 } 571 }
526 ); 572 );
527 } 573 }
528 574
545 AnyEvent::Util::fh_nonblocking $self->{fh}, 1; 591 AnyEvent::Util::fh_nonblocking $self->{fh}, 1;
546 592
547 $self->{_activity} = 593 $self->{_activity} =
548 $self->{_ractivity} = 594 $self->{_ractivity} =
549 $self->{_wactivity} = AE::now; 595 $self->{_wactivity} = AE::now;
596
597 $self->{read_size} ||= 2048;
598 $self->{max_read_size} = $self->{read_size}
599 if $self->{read_size} > ($self->{max_read_size} || MAX_READ_SIZE);
550 600
551 $self->timeout (delete $self->{timeout} ) if $self->{timeout}; 601 $self->timeout (delete $self->{timeout} ) if $self->{timeout};
552 $self->rtimeout (delete $self->{rtimeout} ) if $self->{rtimeout}; 602 $self->rtimeout (delete $self->{rtimeout} ) if $self->{rtimeout};
553 $self->wtimeout (delete $self->{wtimeout} ) if $self->{wtimeout}; 603 $self->wtimeout (delete $self->{wtimeout} ) if $self->{wtimeout};
554 604
723 773
724=item $handle->rbuf_max ($max_octets) 774=item $handle->rbuf_max ($max_octets)
725 775
726Configures the C<rbuf_max> setting (C<undef> disables it). 776Configures the C<rbuf_max> setting (C<undef> disables it).
727 777
778=item $handle->wbuf_max ($max_octets)
779
780Configures the C<wbuf_max> setting (C<undef> disables it).
781
728=cut 782=cut
729 783
730sub rbuf_max { 784sub rbuf_max {
731 $_[0]{rbuf_max} = $_[1]; 785 $_[0]{rbuf_max} = $_[1];
732} 786}
733 787
788sub wbuf_max {
789 $_[0]{wbuf_max} = $_[1];
790}
791
734############################################################################# 792#############################################################################
735 793
736=item $handle->timeout ($seconds) 794=item $handle->timeout ($seconds)
737 795
738=item $handle->rtimeout ($seconds) 796=item $handle->rtimeout ($seconds)
739 797
740=item $handle->wtimeout ($seconds) 798=item $handle->wtimeout ($seconds)
741 799
742Configures (or disables) the inactivity timeout. 800Configures (or disables) the inactivity timeout.
801
802The timeout will be checked instantly, so this method might destroy the
803handle before it returns.
743 804
744=item $handle->timeout_reset 805=item $handle->timeout_reset
745 806
746=item $handle->rtimeout_reset 807=item $handle->rtimeout_reset
747 808
831 892
832The write queue is very simple: you can add data to its end, and 893The write queue is very simple: you can add data to its end, and
833AnyEvent::Handle will automatically try to get rid of it for you. 894AnyEvent::Handle will automatically try to get rid of it for you.
834 895
835When data could be written and the write buffer is shorter then the low 896When data could be written and the write buffer is shorter then the low
836water mark, the C<on_drain> callback will be invoked. 897water mark, the C<on_drain> callback will be invoked once.
837 898
838=over 4 899=over 4
839 900
840=item $handle->on_drain ($cb) 901=item $handle->on_drain ($cb)
841 902
856 if $cb && $self->{low_water_mark} >= (length $self->{wbuf}) + (length $self->{_tls_wbuf}); 917 if $cb && $self->{low_water_mark} >= (length $self->{wbuf}) + (length $self->{_tls_wbuf});
857} 918}
858 919
859=item $handle->push_write ($data) 920=item $handle->push_write ($data)
860 921
861Queues the given scalar to be written. You can push as much data as you 922Queues the given scalar to be written. You can push as much data as
862want (only limited by the available memory), as C<AnyEvent::Handle> 923you want (only limited by the available memory and C<wbuf_max>), as
863buffers it independently of the kernel. 924C<AnyEvent::Handle> buffers it independently of the kernel.
864 925
865This method may invoke callbacks (and therefore the handle might be 926This method may invoke callbacks (and therefore the handle might be
866destroyed after it returns). 927destroyed after it returns).
867 928
868=cut 929=cut
896 $cb->() unless $self->{autocork}; 957 $cb->() unless $self->{autocork};
897 958
898 # if still data left in wbuf, we need to poll 959 # if still data left in wbuf, we need to poll
899 $self->{_ww} = AE::io $self->{fh}, 1, $cb 960 $self->{_ww} = AE::io $self->{fh}, 1, $cb
900 if length $self->{wbuf}; 961 if length $self->{wbuf};
962
963 if (
964 defined $self->{wbuf_max}
965 && $self->{wbuf_max} < length $self->{wbuf}
966 ) {
967 $self->_error (Errno::ENOSPC, 1), return;
968 }
901 }; 969 };
902} 970}
903 971
904our %WH; 972our %WH;
905 973
1025=cut 1093=cut
1026 1094
1027register_write_type storable => sub { 1095register_write_type storable => sub {
1028 my ($self, $ref) = @_; 1096 my ($self, $ref) = @_;
1029 1097
1030 require Storable; 1098 require Storable unless $Storable::VERSION;
1031 1099
1032 pack "w/a*", Storable::nfreeze ($ref) 1100 pack "w/a*", Storable::nfreeze ($ref)
1033}; 1101};
1034 1102
1035=back 1103=back
1040before it was actually written. One way to do that is to replace your 1108before it was actually written. One way to do that is to replace your
1041C<on_drain> handler by a callback that shuts down the socket (and set 1109C<on_drain> handler by a callback that shuts down the socket (and set
1042C<low_water_mark> to C<0>). This method is a shorthand for just that, and 1110C<low_water_mark> to C<0>). This method is a shorthand for just that, and
1043replaces the C<on_drain> callback with: 1111replaces the C<on_drain> callback with:
1044 1112
1045 sub { shutdown $_[0]{fh}, 1 } # for push_shutdown 1113 sub { shutdown $_[0]{fh}, 1 }
1046 1114
1047This simply shuts down the write side and signals an EOF condition to the 1115This simply shuts down the write side and signals an EOF condition to the
1048the peer. 1116the peer.
1049 1117
1050You can rely on the normal read queue and C<on_eof> handling 1118You can rely on the normal read queue and C<on_eof> handling
1072 1140
1073Whenever the given C<type> is used, C<push_write> will the function with 1141Whenever the given C<type> is used, C<push_write> will the function with
1074the handle object and the remaining arguments. 1142the handle object and the remaining arguments.
1075 1143
1076The function is supposed to return a single octet string that will be 1144The function is supposed to return a single octet string that will be
1077appended to the write buffer, so you cna mentally treat this function as a 1145appended to the write buffer, so you can mentally treat this function as a
1078"arguments to on-the-wire-format" converter. 1146"arguments to on-the-wire-format" converter.
1079 1147
1080Example: implement a custom write type C<join> that joins the remaining 1148Example: implement a custom write type C<join> that joins the remaining
1081arguments using the first one. 1149arguments using the first one.
1082 1150
1376data. 1444data.
1377 1445
1378Example: read 2 bytes. 1446Example: read 2 bytes.
1379 1447
1380 $handle->push_read (chunk => 2, sub { 1448 $handle->push_read (chunk => 2, sub {
1381 warn "yay ", unpack "H*", $_[1]; 1449 say "yay " . unpack "H*", $_[1];
1382 }); 1450 });
1383 1451
1384=cut 1452=cut
1385 1453
1386register_read_type chunk => sub { 1454register_read_type chunk => sub {
1420 if (@_ < 3) { 1488 if (@_ < 3) {
1421 # this is more than twice as fast as the generic code below 1489 # this is more than twice as fast as the generic code below
1422 sub { 1490 sub {
1423 $_[0]{rbuf} =~ s/^([^\015\012]*)(\015?\012)// or return; 1491 $_[0]{rbuf} =~ s/^([^\015\012]*)(\015?\012)// or return;
1424 1492
1425 $cb->($_[0], $1, $2); 1493 $cb->($_[0], "$1", "$2");
1426 1 1494 1
1427 } 1495 }
1428 } else { 1496 } else {
1429 $eol = quotemeta $eol unless ref $eol; 1497 $eol = quotemeta $eol unless ref $eol;
1430 $eol = qr|^(.*?)($eol)|s; 1498 $eol = qr|^(.*?)($eol)|s;
1431 1499
1432 sub { 1500 sub {
1433 $_[0]{rbuf} =~ s/$eol// or return; 1501 $_[0]{rbuf} =~ s/$eol// or return;
1434 1502
1435 $cb->($_[0], $1, $2); 1503 $cb->($_[0], "$1", "$2");
1436 1 1504 1
1437 } 1505 }
1438 } 1506 }
1439}; 1507};
1440 1508
1488 1556
1489 sub { 1557 sub {
1490 # accept 1558 # accept
1491 if ($$rbuf =~ $accept) { 1559 if ($$rbuf =~ $accept) {
1492 $data .= substr $$rbuf, 0, $+[0], ""; 1560 $data .= substr $$rbuf, 0, $+[0], "";
1493 $cb->($self, $data); 1561 $cb->($_[0], $data);
1494 return 1; 1562 return 1;
1495 } 1563 }
1496 1564
1497 # reject 1565 # reject
1498 if ($reject && $$rbuf =~ $reject) { 1566 if ($reject && $$rbuf =~ $reject) {
1499 $self->_error (Errno::EBADMSG); 1567 $_[0]->_error (Errno::EBADMSG);
1500 } 1568 }
1501 1569
1502 # skip 1570 # skip
1503 if ($skip && $$rbuf =~ $skip) { 1571 if ($skip && $$rbuf =~ $skip) {
1504 $data .= substr $$rbuf, 0, $+[0], ""; 1572 $data .= substr $$rbuf, 0, $+[0], "";
1520 my ($self, $cb) = @_; 1588 my ($self, $cb) = @_;
1521 1589
1522 sub { 1590 sub {
1523 unless ($_[0]{rbuf} =~ s/^(0|[1-9][0-9]*)://) { 1591 unless ($_[0]{rbuf} =~ s/^(0|[1-9][0-9]*)://) {
1524 if ($_[0]{rbuf} =~ /[^0-9]/) { 1592 if ($_[0]{rbuf} =~ /[^0-9]/) {
1525 $self->_error (Errno::EBADMSG); 1593 $_[0]->_error (Errno::EBADMSG);
1526 } 1594 }
1527 return; 1595 return;
1528 } 1596 }
1529 1597
1530 my $len = $1; 1598 my $len = $1;
1531 1599
1532 $self->unshift_read (chunk => $len, sub { 1600 $_[0]->unshift_read (chunk => $len, sub {
1533 my $string = $_[1]; 1601 my $string = $_[1];
1534 $_[0]->unshift_read (chunk => 1, sub { 1602 $_[0]->unshift_read (chunk => 1, sub {
1535 if ($_[1] eq ",") { 1603 if ($_[1] eq ",") {
1536 $cb->($_[0], $string); 1604 $cb->($_[0], $string);
1537 } else { 1605 } else {
1538 $self->_error (Errno::EBADMSG); 1606 $_[0]->_error (Errno::EBADMSG);
1539 } 1607 }
1540 }); 1608 });
1541 }); 1609 });
1542 1610
1543 1 1611 1
1616 1684
1617 my $data; 1685 my $data;
1618 my $rbuf = \$self->{rbuf}; 1686 my $rbuf = \$self->{rbuf};
1619 1687
1620 sub { 1688 sub {
1621 my $ref = eval { $json->incr_parse ($self->{rbuf}) }; 1689 my $ref = eval { $json->incr_parse ($_[0]{rbuf}) };
1622 1690
1623 if ($ref) { 1691 if ($ref) {
1624 $self->{rbuf} = $json->incr_text; 1692 $_[0]{rbuf} = $json->incr_text;
1625 $json->incr_text = ""; 1693 $json->incr_text = "";
1626 $cb->($self, $ref); 1694 $cb->($_[0], $ref);
1627 1695
1628 1 1696 1
1629 } elsif ($@) { 1697 } elsif ($@) {
1630 # error case 1698 # error case
1631 $json->incr_skip; 1699 $json->incr_skip;
1632 1700
1633 $self->{rbuf} = $json->incr_text; 1701 $_[0]{rbuf} = $json->incr_text;
1634 $json->incr_text = ""; 1702 $json->incr_text = "";
1635 1703
1636 $self->_error (Errno::EBADMSG); 1704 $_[0]->_error (Errno::EBADMSG);
1637 1705
1638 () 1706 ()
1639 } else { 1707 } else {
1640 $self->{rbuf} = ""; 1708 $_[0]{rbuf} = "";
1641 1709
1642 () 1710 ()
1643 } 1711 }
1644 } 1712 }
1645}; 1713};
1655=cut 1723=cut
1656 1724
1657register_read_type storable => sub { 1725register_read_type storable => sub {
1658 my ($self, $cb) = @_; 1726 my ($self, $cb) = @_;
1659 1727
1660 require Storable; 1728 require Storable unless $Storable::VERSION;
1661 1729
1662 sub { 1730 sub {
1663 # when we can use 5.10 we can use ".", but for 5.8 we use the re-pack method 1731 # when we can use 5.10 we can use ".", but for 5.8 we use the re-pack method
1664 defined (my $len = eval { unpack "w", $_[0]{rbuf} }) 1732 defined (my $len = eval { unpack "w", $_[0]{rbuf} })
1665 or return; 1733 or return;
1668 1736
1669 # bypass unshift if we already have the remaining chunk 1737 # bypass unshift if we already have the remaining chunk
1670 if ($format + $len <= length $_[0]{rbuf}) { 1738 if ($format + $len <= length $_[0]{rbuf}) {
1671 my $data = substr $_[0]{rbuf}, $format, $len; 1739 my $data = substr $_[0]{rbuf}, $format, $len;
1672 substr $_[0]{rbuf}, 0, $format + $len, ""; 1740 substr $_[0]{rbuf}, 0, $format + $len, "";
1741
1673 $cb->($_[0], Storable::thaw ($data)); 1742 eval { $cb->($_[0], Storable::thaw ($data)); 1 }
1743 or return $_[0]->_error (Errno::EBADMSG);
1674 } else { 1744 } else {
1675 # remove prefix 1745 # remove prefix
1676 substr $_[0]{rbuf}, 0, $format, ""; 1746 substr $_[0]{rbuf}, 0, $format, "";
1677 1747
1678 # read remaining chunk 1748 # read remaining chunk
1679 $_[0]->unshift_read (chunk => $len, sub { 1749 $_[0]->unshift_read (chunk => $len, sub {
1680 if (my $ref = eval { Storable::thaw ($_[1]) }) { 1750 eval { $cb->($_[0], Storable::thaw ($_[1])); 1 }
1681 $cb->($_[0], $ref);
1682 } else {
1683 $self->_error (Errno::EBADMSG); 1751 or $_[0]->_error (Errno::EBADMSG);
1684 }
1685 }); 1752 });
1686 } 1753 }
1687 1754
1688 1 1755 1
1689 } 1756 }
1726Note that AnyEvent::Handle will automatically C<start_read> for you when 1793Note that AnyEvent::Handle will automatically C<start_read> for you when
1727you change the C<on_read> callback or push/unshift a read callback, and it 1794you change the C<on_read> callback or push/unshift a read callback, and it
1728will automatically C<stop_read> for you when neither C<on_read> is set nor 1795will automatically C<stop_read> for you when neither C<on_read> is set nor
1729there are any read requests in the queue. 1796there are any read requests in the queue.
1730 1797
1731These methods will have no effect when in TLS mode (as TLS doesn't support 1798In older versions of this module (<= 5.3), these methods had no effect,
1732half-duplex connections). 1799as TLS does not support half-duplex connections. In current versions they
1800work as expected, as this behaviour is required to avoid certain resource
1801attacks, where the program would be forced to read (and buffer) arbitrary
1802amounts of data before being able to send some data. The drawback is that
1803some readings of the the SSL/TLS specifications basically require this
1804attack to be working, as SSL/TLS implementations might stall sending data
1805during a rehandshake.
1806
1807As a guideline, during the initial handshake, you should not stop reading,
1808and as a client, it might cause problems, depending on your application.
1733 1809
1734=cut 1810=cut
1735 1811
1736sub stop_read { 1812sub stop_read {
1737 my ($self) = @_; 1813 my ($self) = @_;
1738 1814
1739 delete $self->{_rw} unless $self->{tls}; 1815 delete $self->{_rw};
1740} 1816}
1741 1817
1742sub start_read { 1818sub start_read {
1743 my ($self) = @_; 1819 my ($self) = @_;
1744 1820
1745 unless ($self->{_rw} || $self->{_eof} || !$self->{fh}) { 1821 unless ($self->{_rw} || $self->{_eof} || !$self->{fh}) {
1746 Scalar::Util::weaken $self; 1822 Scalar::Util::weaken $self;
1747 1823
1748 $self->{_rw} = AE::io $self->{fh}, 0, sub { 1824 $self->{_rw} = AE::io $self->{fh}, 0, sub {
1749 my $rbuf = \($self->{tls} ? my $buf : $self->{rbuf}); 1825 my $rbuf = \($self->{tls} ? my $buf : $self->{rbuf});
1750 my $len = sysread $self->{fh}, $$rbuf, $self->{read_size} || 8192, length $$rbuf; 1826 my $len = sysread $self->{fh}, $$rbuf, $self->{read_size}, length $$rbuf;
1751 1827
1752 if ($len > 0) { 1828 if ($len > 0) {
1753 $self->{_activity} = $self->{_ractivity} = AE::now; 1829 $self->{_activity} = $self->{_ractivity} = AE::now;
1754 1830
1755 if ($self->{tls}) { 1831 if ($self->{tls}) {
1758 &_dotls ($self); 1834 &_dotls ($self);
1759 } else { 1835 } else {
1760 $self->_drain_rbuf; 1836 $self->_drain_rbuf;
1761 } 1837 }
1762 1838
1839 if ($len == $self->{read_size}) {
1840 $self->{read_size} *= 2;
1841 $self->{read_size} = $self->{max_read_size} || MAX_READ_SIZE
1842 if $self->{read_size} > ($self->{max_read_size} || MAX_READ_SIZE);
1843 }
1844
1763 } elsif (defined $len) { 1845 } elsif (defined $len) {
1764 delete $self->{_rw}; 1846 delete $self->{_rw};
1765 $self->{_eof} = 1; 1847 $self->{_eof} = 1;
1766 $self->_drain_rbuf; 1848 $self->_drain_rbuf;
1767 1849
1779 my ($self, $err) = @_; 1861 my ($self, $err) = @_;
1780 1862
1781 return $self->_error ($!, 1) 1863 return $self->_error ($!, 1)
1782 if $err == Net::SSLeay::ERROR_SYSCALL (); 1864 if $err == Net::SSLeay::ERROR_SYSCALL ();
1783 1865
1784 my $err =Net::SSLeay::ERR_error_string (Net::SSLeay::ERR_get_error ()); 1866 my $err = Net::SSLeay::ERR_error_string (Net::SSLeay::ERR_get_error ());
1785 1867
1786 # reduce error string to look less scary 1868 # reduce error string to look less scary
1787 $err =~ s/^error:[0-9a-fA-F]{8}:[^:]+:([^:]+):/\L$1: /; 1869 $err =~ s/^error:[0-9a-fA-F]{8}:[^:]+:([^:]+):/\L$1: /;
1788 1870
1789 if ($self->{_on_starttls}) { 1871 if ($self->{_on_starttls}) {
1940 Net::SSLeay::CTX_set_mode ($tls, 1|2); 2022 Net::SSLeay::CTX_set_mode ($tls, 1|2);
1941 2023
1942 $self->{_rbio} = Net::SSLeay::BIO_new (Net::SSLeay::BIO_s_mem ()); 2024 $self->{_rbio} = Net::SSLeay::BIO_new (Net::SSLeay::BIO_s_mem ());
1943 $self->{_wbio} = Net::SSLeay::BIO_new (Net::SSLeay::BIO_s_mem ()); 2025 $self->{_wbio} = Net::SSLeay::BIO_new (Net::SSLeay::BIO_s_mem ());
1944 2026
1945 Net::SSLeay::BIO_write ($self->{_rbio}, delete $self->{rbuf}); 2027 Net::SSLeay::BIO_write ($self->{_rbio}, $self->{rbuf});
2028 $self->{rbuf} = "";
1946 2029
1947 Net::SSLeay::set_bio ($tls, $self->{_rbio}, $self->{_wbio}); 2030 Net::SSLeay::set_bio ($tls, $self->{_rbio}, $self->{_wbio});
1948 2031
1949 $self->{_on_starttls} = sub { $_[0]{on_starttls}(@_) } 2032 $self->{_on_starttls} = sub { $_[0]{on_starttls}(@_) }
1950 if $self->{on_starttls}; 2033 if $self->{on_starttls};
1987 $self->{tls_ctx}->_put_session (delete $self->{tls}) 2070 $self->{tls_ctx}->_put_session (delete $self->{tls})
1988 if $self->{tls} > 0; 2071 if $self->{tls} > 0;
1989 2072
1990 delete @$self{qw(_rbio _wbio _tls_wbuf _on_starttls)}; 2073 delete @$self{qw(_rbio _wbio _tls_wbuf _on_starttls)};
1991} 2074}
2075
2076=item $handle->resettls
2077
2078This rarely-used method simply resets and TLS state on the handle, usually
2079causing data loss.
2080
2081One case where it may be useful is when you want to skip over the data in
2082the stream but you are not interested in interpreting it, so data loss is
2083no concern.
2084
2085=cut
2086
2087*resettls = \&_freetls;
1992 2088
1993sub DESTROY { 2089sub DESTROY {
1994 my ($self) = @_; 2090 my ($self) = @_;
1995 2091
1996 &_freetls; 2092 &_freetls;
2112 2208
2113It is only safe to "forget" the reference inside EOF or error callbacks, 2209It is only safe to "forget" the reference inside EOF or error callbacks,
2114from within all other callbacks, you need to explicitly call the C<< 2210from within all other callbacks, you need to explicitly call the C<<
2115->destroy >> method. 2211->destroy >> method.
2116 2212
2213=item Why is my C<on_eof> callback never called?
2214
2215Probably because your C<on_error> callback is being called instead: When
2216you have outstanding requests in your read queue, then an EOF is
2217considered an error as you clearly expected some data.
2218
2219To avoid this, make sure you have an empty read queue whenever your handle
2220is supposed to be "idle" (i.e. connection closes are O.K.). You can set
2221an C<on_read> handler that simply pushes the first read requests in the
2222queue.
2223
2224See also the next question, which explains this in a bit more detail.
2225
2226=item How can I serve requests in a loop?
2227
2228Most protocols consist of some setup phase (authentication for example)
2229followed by a request handling phase, where the server waits for requests
2230and handles them, in a loop.
2231
2232There are two important variants: The first (traditional, better) variant
2233handles requests until the server gets some QUIT command, causing it to
2234close the connection first (highly desirable for a busy TCP server). A
2235client dropping the connection is an error, which means this variant can
2236detect an unexpected detection close.
2237
2238To handle this case, always make sure you have a on-empty read queue, by
2239pushing the "read request start" handler on it:
2240
2241 # we assume a request starts with a single line
2242 my @start_request; @start_request = (line => sub {
2243 my ($hdl, $line) = @_;
2244
2245 ... handle request
2246
2247 # push next request read, possibly from a nested callback
2248 $hdl->push_read (@start_request);
2249 });
2250
2251 # auth done, now go into request handling loop
2252 # now push the first @start_request
2253 $hdl->push_read (@start_request);
2254
2255By always having an outstanding C<push_read>, the handle always expects
2256some data and raises the C<EPIPE> error when the connction is dropped
2257unexpectedly.
2258
2259The second variant is a protocol where the client can drop the connection
2260at any time. For TCP, this means that the server machine may run out of
2261sockets easier, and in general, it means you cannot distinguish a protocl
2262failure/client crash from a normal connection close. Nevertheless, these
2263kinds of protocols are common (and sometimes even the best solution to the
2264problem).
2265
2266Having an outstanding read request at all times is possible if you ignore
2267C<EPIPE> errors, but this doesn't help with when the client drops the
2268connection during a request, which would still be an error.
2269
2270A better solution is to push the initial request read in an C<on_read>
2271callback. This avoids an error, as when the server doesn't expect data
2272(i.e. is idly waiting for the next request, an EOF will not raise an
2273error, but simply result in an C<on_eof> callback. It is also a bit slower
2274and simpler:
2275
2276 # auth done, now go into request handling loop
2277 $hdl->on_read (sub {
2278 my ($hdl) = @_;
2279
2280 # called each time we receive data but the read queue is empty
2281 # simply start read the request
2282
2283 $hdl->push_read (line => sub {
2284 my ($hdl, $line) = @_;
2285
2286 ... handle request
2287
2288 # do nothing special when the request has been handled, just
2289 # let the request queue go empty.
2290 });
2291 });
2292
2117=item I get different callback invocations in TLS mode/Why can't I pause 2293=item I get different callback invocations in TLS mode/Why can't I pause
2118reading? 2294reading?
2119 2295
2120Unlike, say, TCP, TLS connections do not consist of two independent 2296Unlike, say, TCP, TLS connections do not consist of two independent
2121communication channels, one for each direction. Or put differently, the 2297communication channels, one for each direction. Or put differently, the
2142 $handle->on_eof (undef); 2318 $handle->on_eof (undef);
2143 $handle->on_error (sub { 2319 $handle->on_error (sub {
2144 my $data = delete $_[0]{rbuf}; 2320 my $data = delete $_[0]{rbuf};
2145 }); 2321 });
2146 2322
2323Note that this example removes the C<rbuf> member from the handle object,
2324which is not normally allowed by the API. It is expressly permitted in
2325this case only, as the handle object needs to be destroyed afterwards.
2326
2147The reason to use C<on_error> is that TCP connections, due to latencies 2327The reason to use C<on_error> is that TCP connections, due to latencies
2148and packets loss, might get closed quite violently with an error, when in 2328and packets loss, might get closed quite violently with an error, when in
2149fact all data has been received. 2329fact all data has been received.
2150 2330
2151It is usually better to use acknowledgements when transferring data, 2331It is usually better to use acknowledgements when transferring data,
2161C<low_water_mark> this will be called precisely when all data has been 2341C<low_water_mark> this will be called precisely when all data has been
2162written to the socket: 2342written to the socket:
2163 2343
2164 $handle->push_write (...); 2344 $handle->push_write (...);
2165 $handle->on_drain (sub { 2345 $handle->on_drain (sub {
2166 warn "all data submitted to the kernel\n"; 2346 AE::log debug => "All data submitted to the kernel.";
2167 undef $handle; 2347 undef $handle;
2168 }); 2348 });
2169 2349
2170If you just want to queue some data and then signal EOF to the other side, 2350If you just want to queue some data and then signal EOF to the other side,
2171consider using C<< ->push_shutdown >> instead. 2351consider using C<< ->push_shutdown >> instead.
2255When you have intermediate CA certificates that your clients might not 2435When you have intermediate CA certificates that your clients might not
2256know about, just append them to the C<cert_file>. 2436know about, just append them to the C<cert_file>.
2257 2437
2258=back 2438=back
2259 2439
2260
2261=head1 SUBCLASSING AnyEvent::Handle 2440=head1 SUBCLASSING AnyEvent::Handle
2262 2441
2263In many cases, you might want to subclass AnyEvent::Handle. 2442In many cases, you might want to subclass AnyEvent::Handle.
2264 2443
2265To make this easier, a given version of AnyEvent::Handle uses these 2444To make this easier, a given version of AnyEvent::Handle uses these
2291 2470
2292Robin Redeker C<< <elmex at ta-sa.org> >>, Marc Lehmann <schmorp@schmorp.de>. 2471Robin Redeker C<< <elmex at ta-sa.org> >>, Marc Lehmann <schmorp@schmorp.de>.
2293 2472
2294=cut 2473=cut
2295 2474
22961; # End of AnyEvent::Handle 24751
2476

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines