ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/AnyEvent/lib/AnyEvent/Handle.pm
(Generate patch)

Comparing AnyEvent/lib/AnyEvent/Handle.pm (file contents):
Revision 1.185 by root, Thu Sep 3 19:48:27 2009 UTC vs.
Revision 1.245 by root, Tue Jun 16 03:22:20 2015 UTC

1=head1 NAME 1=head1 NAME
2 2
3AnyEvent::Handle - non-blocking I/O on file handles via AnyEvent 3AnyEvent::Handle - non-blocking I/O on streaming handles via AnyEvent
4 4
5=head1 SYNOPSIS 5=head1 SYNOPSIS
6 6
7 use AnyEvent; 7 use AnyEvent;
8 use AnyEvent::Handle; 8 use AnyEvent::Handle;
11 11
12 my $hdl; $hdl = new AnyEvent::Handle 12 my $hdl; $hdl = new AnyEvent::Handle
13 fh => \*STDIN, 13 fh => \*STDIN,
14 on_error => sub { 14 on_error => sub {
15 my ($hdl, $fatal, $msg) = @_; 15 my ($hdl, $fatal, $msg) = @_;
16 warn "got error $msg\n"; 16 AE::log error => $msg;
17 $hdl->destroy; 17 $hdl->destroy;
18 $cv->send; 18 $cv->send;
19 ); 19 };
20 20
21 # send some request line 21 # send some request line
22 $hdl->push_write ("getinfo\015\012"); 22 $hdl->push_write ("getinfo\015\012");
23 23
24 # read the response line 24 # read the response line
25 $hdl->push_read (line => sub { 25 $hdl->push_read (line => sub {
26 my ($hdl, $line) = @_; 26 my ($hdl, $line) = @_;
27 warn "got line <$line>\n"; 27 say "got line <$line>";
28 $cv->send; 28 $cv->send;
29 }); 29 });
30 30
31 $cv->recv; 31 $cv->recv;
32 32
33=head1 DESCRIPTION 33=head1 DESCRIPTION
34 34
35This module is a helper module to make it easier to do event-based I/O on 35This is a helper module to make it easier to do event-based I/O on
36filehandles. 36stream-based filehandles (sockets, pipes, and other stream things).
37 37
38The L<AnyEvent::Intro> tutorial contains some well-documented 38The L<AnyEvent::Intro> tutorial contains some well-documented
39AnyEvent::Handle examples. 39AnyEvent::Handle examples.
40 40
41In the following, when the documentation refers to of "bytes" then this 41In the following, where the documentation refers to "bytes", it means
42means characters. As sysread and syswrite are used for all I/O, their 42characters. As sysread and syswrite are used for all I/O, their
43treatment of characters applies to this module as well. 43treatment of characters applies to this module as well.
44 44
45At the very minimum, you should specify C<fh> or C<connect>, and the 45At the very minimum, you should specify C<fh> or C<connect>, and the
46C<on_error> callback. 46C<on_error> callback.
47 47
53package AnyEvent::Handle; 53package AnyEvent::Handle;
54 54
55use Scalar::Util (); 55use Scalar::Util ();
56use List::Util (); 56use List::Util ();
57use Carp (); 57use Carp ();
58use Errno qw(EAGAIN EINTR); 58use Errno qw(EAGAIN EWOULDBLOCK EINTR);
59 59
60use AnyEvent (); BEGIN { AnyEvent::common_sense } 60use AnyEvent (); BEGIN { AnyEvent::common_sense }
61use AnyEvent::Util qw(WSAEWOULDBLOCK); 61use AnyEvent::Util qw(WSAEWOULDBLOCK);
62 62
63our $VERSION = $AnyEvent::VERSION; 63our $VERSION = $AnyEvent::VERSION;
75 } 75 }
76 76
77 \&$func 77 \&$func
78} 78}
79 79
80sub MAX_READ_SIZE() { 131072 }
81
80=head1 METHODS 82=head1 METHODS
81 83
82=over 4 84=over 4
83 85
84=item $handle = B<new> AnyEvent::TLS fh => $filehandle, key => value... 86=item $handle = B<new> AnyEvent::Handle fh => $filehandle, key => value...
85 87
86The constructor supports these arguments (all as C<< key => value >> pairs). 88The constructor supports these arguments (all as C<< key => value >> pairs).
87 89
88=over 4 90=over 4
89 91
90=item fh => $filehandle [C<fh> or C<connect> MANDATORY] 92=item fh => $filehandle [C<fh> or C<connect> MANDATORY]
91 93
92The filehandle this L<AnyEvent::Handle> object will operate on. 94The filehandle this L<AnyEvent::Handle> object will operate on.
93NOTE: The filehandle will be set to non-blocking mode (using 95NOTE: The filehandle will be set to non-blocking mode (using
94C<AnyEvent::Util::fh_nonblocking>) by the constructor and needs to stay in 96C<AnyEvent::fh_unblock>) by the constructor and needs to stay in
95that mode. 97that mode.
96 98
97=item connect => [$host, $service] [C<fh> or C<connect> MANDATORY] 99=item connect => [$host, $service] [C<fh> or C<connect> MANDATORY]
98 100
99Try to connect to the specified host and service (port), using 101Try to connect to the specified host and service (port), using
112=over 4 114=over 4
113 115
114=item on_prepare => $cb->($handle) 116=item on_prepare => $cb->($handle)
115 117
116This (rarely used) callback is called before a new connection is 118This (rarely used) callback is called before a new connection is
117attempted, but after the file handle has been created. It could be used to 119attempted, but after the file handle has been created (you can access that
120file handle via C<< $handle->{fh} >>). It could be used to prepare the
118prepare the file handle with parameters required for the actual connect 121file handle with parameters required for the actual connect (as opposed to
119(as opposed to settings that can be changed when the connection is already 122settings that can be changed when the connection is already established).
120established).
121 123
122The return value of this callback should be the connect timeout value in 124The return value of this callback should be the connect timeout value in
123seconds (or C<0>, or C<undef>, or the empty list, to indicate the default 125seconds (or C<0>, or C<undef>, or the empty list, to indicate that the
124timeout is to be used). 126default timeout is to be used).
125 127
126=item on_connect => $cb->($handle, $host, $port, $retry->()) 128=item on_connect => $cb->($handle, $host, $port, $retry->())
127 129
128This callback is called when a connection has been successfully established. 130This callback is called when a connection has been successfully established.
129 131
130The actual numeric host and port (the socket peername) are passed as 132The peer's numeric host and port (the socket peername) are passed as
131parameters, together with a retry callback. 133parameters, together with a retry callback. At the time it is called the
134read and write queues, EOF status, TLS status and similar properties of
135the handle will have been reset.
132 136
133When, for some reason, the handle is not acceptable, then calling 137If, for some reason, the handle is not acceptable, calling C<$retry> will
134C<$retry> will continue with the next conenction target (in case of 138continue with the next connection target (in case of multi-homed hosts or
135multi-homed hosts or SRV records there can be multiple connection 139SRV records there can be multiple connection endpoints). The C<$retry>
136endpoints). When it is called then the read and write queues, eof status, 140callback can be invoked after the connect callback returns, i.e. one can
137tls status and similar properties of the handle are being reset. 141start a handshake and then decide to retry with the next host if the
142handshake fails.
138 143
139In most cases, ignoring the C<$retry> parameter is the way to go. 144In most cases, you should ignore the C<$retry> parameter.
140 145
141=item on_connect_error => $cb->($handle, $message) 146=item on_connect_error => $cb->($handle, $message)
142 147
143This callback is called when the conenction could not be 148This callback is called when the connection could not be
144established. C<$!> will contain the relevant error code, and C<$message> a 149established. C<$!> will contain the relevant error code, and C<$message> a
145message describing it (usually the same as C<"$!">). 150message describing it (usually the same as C<"$!">).
146 151
147If this callback isn't specified, then C<on_error> will be called with a 152If this callback isn't specified, then C<on_error> will be called with a
148fatal error instead. 153fatal error instead.
151 156
152=item on_error => $cb->($handle, $fatal, $message) 157=item on_error => $cb->($handle, $fatal, $message)
153 158
154This is the error callback, which is called when, well, some error 159This is the error callback, which is called when, well, some error
155occured, such as not being able to resolve the hostname, failure to 160occured, such as not being able to resolve the hostname, failure to
156connect or a read error. 161connect, or a read error.
157 162
158Some errors are fatal (which is indicated by C<$fatal> being true). On 163Some errors are fatal (which is indicated by C<$fatal> being true). On
159fatal errors the handle object will be destroyed (by a call to C<< -> 164fatal errors the handle object will be destroyed (by a call to C<< ->
160destroy >>) after invoking the error callback (which means you are free to 165destroy >>) after invoking the error callback (which means you are free to
161examine the handle object). Examples of fatal errors are an EOF condition 166examine the handle object). Examples of fatal errors are an EOF condition
162with active (but unsatisifable) read watchers (C<EPIPE>) or I/O errors. In 167with active (but unsatisfiable) read watchers (C<EPIPE>) or I/O errors. In
163cases where the other side can close the connection at their will it is 168cases where the other side can close the connection at will, it is
164often easiest to not report C<EPIPE> errors in this callback. 169often easiest to not report C<EPIPE> errors in this callback.
165 170
166AnyEvent::Handle tries to find an appropriate error code for you to check 171AnyEvent::Handle tries to find an appropriate error code for you to check
167against, but in some cases (TLS errors), this does not work well. It is 172against, but in some cases (TLS errors), this does not work well.
168recommended to always output the C<$message> argument in human-readable
169error messages (it's usually the same as C<"$!">).
170 173
174If you report the error to the user, it is recommended to always output
175the C<$message> argument in human-readable error messages (you don't need
176to report C<"$!"> if you report C<$message>).
177
178If you want to react programmatically to the error, then looking at C<$!>
179and comparing it against some of the documented C<Errno> values is usually
180better than looking at the C<$message>.
181
171Non-fatal errors can be retried by simply returning, but it is recommended 182Non-fatal errors can be retried by returning, but it is recommended
172to simply ignore this parameter and instead abondon the handle object 183to simply ignore this parameter and instead abondon the handle object
173when this callback is invoked. Examples of non-fatal errors are timeouts 184when this callback is invoked. Examples of non-fatal errors are timeouts
174C<ETIMEDOUT>) or badly-formatted data (C<EBADMSG>). 185C<ETIMEDOUT>) or badly-formatted data (C<EBADMSG>).
175 186
176On callback entrance, the value of C<$!> contains the operating system 187On entry to the callback, the value of C<$!> contains the operating
177error code (or C<ENOSPC>, C<EPIPE>, C<ETIMEDOUT>, C<EBADMSG> or 188system error code (or C<ENOSPC>, C<EPIPE>, C<ETIMEDOUT>, C<EBADMSG> or
178C<EPROTO>). 189C<EPROTO>).
179 190
180While not mandatory, it is I<highly> recommended to set this callback, as 191While not mandatory, it is I<highly> recommended to set this callback, as
181you will not be notified of errors otherwise. The default simply calls 192you will not be notified of errors otherwise. The default just calls
182C<croak>. 193C<croak>.
183 194
184=item on_read => $cb->($handle) 195=item on_read => $cb->($handle)
185 196
186This sets the default read callback, which is called when data arrives 197This sets the default read callback, which is called when data arrives
191To access (and remove data from) the read buffer, use the C<< ->rbuf >> 202To access (and remove data from) the read buffer, use the C<< ->rbuf >>
192method or access the C<< $handle->{rbuf} >> member directly. Note that you 203method or access the C<< $handle->{rbuf} >> member directly. Note that you
193must not enlarge or modify the read buffer, you can only remove data at 204must not enlarge or modify the read buffer, you can only remove data at
194the beginning from it. 205the beginning from it.
195 206
207You can also call C<< ->push_read (...) >> or any other function that
208modifies the read queue. Or do both. Or ...
209
196When an EOF condition is detected then AnyEvent::Handle will first try to 210When an EOF condition is detected, AnyEvent::Handle will first try to
197feed all the remaining data to the queued callbacks and C<on_read> before 211feed all the remaining data to the queued callbacks and C<on_read> before
198calling the C<on_eof> callback. If no progress can be made, then a fatal 212calling the C<on_eof> callback. If no progress can be made, then a fatal
199error will be raised (with C<$!> set to C<EPIPE>). 213error will be raised (with C<$!> set to C<EPIPE>).
200 214
201Note that, unlike requests in the read queue, an C<on_read> callback 215Note that, unlike requests in the read queue, an C<on_read> callback
219If an EOF condition has been detected but no C<on_eof> callback has been 233If an EOF condition has been detected but no C<on_eof> callback has been
220set, then a fatal error will be raised with C<$!> set to <0>. 234set, then a fatal error will be raised with C<$!> set to <0>.
221 235
222=item on_drain => $cb->($handle) 236=item on_drain => $cb->($handle)
223 237
224This sets the callback that is called when the write buffer becomes empty 238This sets the callback that is called once when the write buffer becomes
225(or when the callback is set and the buffer is empty already). 239empty (and immediately when the handle object is created).
226 240
227To append to the write buffer, use the C<< ->push_write >> method. 241To append to the write buffer, use the C<< ->push_write >> method.
228 242
229This callback is useful when you don't want to put all of your write data 243This callback is useful when you don't want to put all of your write data
230into the queue at once, for example, when you want to write the contents 244into the queue at once, for example, when you want to write the contents
242many seconds pass without a successful read or write on the underlying 256many seconds pass without a successful read or write on the underlying
243file handle (or a call to C<timeout_reset>), the C<on_timeout> callback 257file handle (or a call to C<timeout_reset>), the C<on_timeout> callback
244will be invoked (and if that one is missing, a non-fatal C<ETIMEDOUT> 258will be invoked (and if that one is missing, a non-fatal C<ETIMEDOUT>
245error will be raised). 259error will be raised).
246 260
247There are three variants of the timeouts that work fully independent 261There are three variants of the timeouts that work independently of each
248of each other, for both read and write, just read, and just write: 262other, for both read and write (triggered when nothing was read I<OR>
263written), just read (triggered when nothing was read), and just write:
249C<timeout>, C<rtimeout> and C<wtimeout>, with corresponding callbacks 264C<timeout>, C<rtimeout> and C<wtimeout>, with corresponding callbacks
250C<on_timeout>, C<on_rtimeout> and C<on_wtimeout>, and reset functions 265C<on_timeout>, C<on_rtimeout> and C<on_wtimeout>, and reset functions
251C<timeout_reset>, C<rtimeout_reset>, and C<wtimeout_reset>. 266C<timeout_reset>, C<rtimeout_reset>, and C<wtimeout_reset>.
252 267
253Note that timeout processing is also active when you currently do not have 268Note that timeout processing is active even when you do not have any
254any outstanding read or write requests: If you plan to keep the connection 269outstanding read or write requests: If you plan to keep the connection
255idle then you should disable the timout temporarily or ignore the timeout 270idle then you should disable the timeout temporarily or ignore the
256in the C<on_timeout> callback, in which case AnyEvent::Handle will simply 271timeout in the corresponding C<on_timeout> callback, in which case
257restart the timeout. 272AnyEvent::Handle will simply restart the timeout.
258 273
259Zero (the default) disables this timeout. 274Zero (the default) disables the corresponding timeout.
260 275
261=item on_timeout => $cb->($handle) 276=item on_timeout => $cb->($handle)
277
278=item on_rtimeout => $cb->($handle)
279
280=item on_wtimeout => $cb->($handle)
262 281
263Called whenever the inactivity timeout passes. If you return from this 282Called whenever the inactivity timeout passes. If you return from this
264callback, then the timeout will be reset as if some activity had happened, 283callback, then the timeout will be reset as if some activity had happened,
265so this condition is not fatal in any way. 284so this condition is not fatal in any way.
266 285
274be configured to accept only so-and-so much data that it cannot act on 293be configured to accept only so-and-so much data that it cannot act on
275(for example, when expecting a line, an attacker could send an unlimited 294(for example, when expecting a line, an attacker could send an unlimited
276amount of data without a callback ever being called as long as the line 295amount of data without a callback ever being called as long as the line
277isn't finished). 296isn't finished).
278 297
298=item wbuf_max => <bytes>
299
300If defined, then a fatal error will be raised (with C<$!> set to C<ENOSPC>)
301when the write buffer ever (strictly) exceeds this size. This is useful to
302avoid some forms of denial-of-service attacks.
303
304Although the units of this parameter is bytes, this is the I<raw> number
305of bytes not yet accepted by the kernel. This can make a difference when
306you e.g. use TLS, as TLS typically makes your write data larger (but it
307can also make it smaller due to compression).
308
309As an example of when this limit is useful, take a chat server that sends
310chat messages to a client. If the client does not read those in a timely
311manner then the send buffer in the server would grow unbounded.
312
279=item autocork => <boolean> 313=item autocork => <boolean>
280 314
281When disabled (the default), then C<push_write> will try to immediately 315When disabled (the default), C<push_write> will try to immediately
282write the data to the handle, if possible. This avoids having to register 316write the data to the handle if possible. This avoids having to register
283a write watcher and wait for the next event loop iteration, but can 317a write watcher and wait for the next event loop iteration, but can
284be inefficient if you write multiple small chunks (on the wire, this 318be inefficient if you write multiple small chunks (on the wire, this
285disadvantage is usually avoided by your kernel's nagle algorithm, see 319disadvantage is usually avoided by your kernel's nagle algorithm, see
286C<no_delay>, but this option can save costly syscalls). 320C<no_delay>, but this option can save costly syscalls).
287 321
288When enabled, then writes will always be queued till the next event loop 322When enabled, writes will always be queued till the next event loop
289iteration. This is efficient when you do many small writes per iteration, 323iteration. This is efficient when you do many small writes per iteration,
290but less efficient when you do a single write only per iteration (or when 324but less efficient when you do a single write only per iteration (or when
291the write buffer often is full). It also increases write latency. 325the write buffer often is full). It also increases write latency.
292 326
293=item no_delay => <boolean> 327=item no_delay => <boolean>
297the Nagle algorithm, and usually it is beneficial. 331the Nagle algorithm, and usually it is beneficial.
298 332
299In some situations you want as low a delay as possible, which can be 333In some situations you want as low a delay as possible, which can be
300accomplishd by setting this option to a true value. 334accomplishd by setting this option to a true value.
301 335
302The default is your opertaing system's default behaviour (most likely 336The default is your operating system's default behaviour (most likely
303enabled), this option explicitly enables or disables it, if possible. 337enabled). This option explicitly enables or disables it, if possible.
304 338
305=item keepalive => <boolean> 339=item keepalive => <boolean>
306 340
307Enables (default disable) the SO_KEEPALIVE option on the stream socket: 341Enables (default disable) the SO_KEEPALIVE option on the stream socket:
308normally, TCP connections have no time-out once established, so TCP 342normally, TCP connections have no time-out once established, so TCP
309conenctions, once established, can stay alive forever even when the other 343connections, once established, can stay alive forever even when the other
310side has long gone. TCP keepalives are a cheap way to take down long-lived 344side has long gone. TCP keepalives are a cheap way to take down long-lived
311TCP connections whent he other side becomes unreachable. While the default 345TCP connections when the other side becomes unreachable. While the default
312is OS-dependent, TCP keepalives usually kick in after around two hours, 346is OS-dependent, TCP keepalives usually kick in after around two hours,
313and, if the other side doesn't reply, take down the TCP connection some 10 347and, if the other side doesn't reply, take down the TCP connection some 10
314to 15 minutes later. 348to 15 minutes later.
315 349
316It is harmless to specify this option for file handles that do not support 350It is harmless to specify this option for file handles that do not support
334already have occured on BSD systems), but at least it will protect you 368already have occured on BSD systems), but at least it will protect you
335from most attacks. 369from most attacks.
336 370
337=item read_size => <bytes> 371=item read_size => <bytes>
338 372
339The default read block size (the amount of bytes this module will 373The initial read block size, the number of bytes this module will try
340try to read during each loop iteration, which affects memory 374to read during each loop iteration. Each handle object will consume
341requirements). Default: C<8192>. 375at least this amount of memory for the read buffer as well, so when
376handling many connections watch out for memory requirements). See also
377C<max_read_size>. Default: C<2048>.
378
379=item max_read_size => <bytes>
380
381The maximum read buffer size used by the dynamic adjustment
382algorithm: Each time AnyEvent::Handle can read C<read_size> bytes in
383one go it will double C<read_size> up to the maximum given by this
384option. Default: C<131072> or C<read_size>, whichever is higher.
342 385
343=item low_water_mark => <bytes> 386=item low_water_mark => <bytes>
344 387
345Sets the amount of bytes (default: C<0>) that make up an "empty" write 388Sets the number of bytes (default: C<0>) that make up an "empty" write
346buffer: If the write reaches this size or gets even samller it is 389buffer: If the buffer reaches this size or gets even samller it is
347considered empty. 390considered empty.
348 391
349Sometimes it can be beneficial (for performance reasons) to add data to 392Sometimes it can be beneficial (for performance reasons) to add data to
350the write buffer before it is fully drained, but this is a rare case, as 393the write buffer before it is fully drained, but this is a rare case, as
351the operating system kernel usually buffers data as well, so the default 394the operating system kernel usually buffers data as well, so the default
352is good in almost all cases. 395is good in almost all cases.
353 396
354=item linger => <seconds> 397=item linger => <seconds>
355 398
356If non-zero (default: C<3600>), then the destructor of the 399If this is non-zero (default: C<3600>), the destructor of the
357AnyEvent::Handle object will check whether there is still outstanding 400AnyEvent::Handle object will check whether there is still outstanding
358write data and will install a watcher that will write this data to the 401write data and will install a watcher that will write this data to the
359socket. No errors will be reported (this mostly matches how the operating 402socket. No errors will be reported (this mostly matches how the operating
360system treats outstanding data at socket close time). 403system treats outstanding data at socket close time).
361 404
368A string used to identify the remote site - usually the DNS hostname 411A string used to identify the remote site - usually the DNS hostname
369(I<not> IDN!) used to create the connection, rarely the IP address. 412(I<not> IDN!) used to create the connection, rarely the IP address.
370 413
371Apart from being useful in error messages, this string is also used in TLS 414Apart from being useful in error messages, this string is also used in TLS
372peername verification (see C<verify_peername> in L<AnyEvent::TLS>). This 415peername verification (see C<verify_peername> in L<AnyEvent::TLS>). This
373verification will be skipped when C<peername> is not specified or 416verification will be skipped when C<peername> is not specified or is
374C<undef>. 417C<undef>.
375 418
376=item tls => "accept" | "connect" | Net::SSLeay::SSL object 419=item tls => "accept" | "connect" | Net::SSLeay::SSL object
377 420
378When this parameter is given, it enables TLS (SSL) mode, that means 421When this parameter is given, it enables TLS (SSL) mode, that means
379AnyEvent will start a TLS handshake as soon as the conenction has been 422AnyEvent will start a TLS handshake as soon as the connection has been
380established and will transparently encrypt/decrypt data afterwards. 423established and will transparently encrypt/decrypt data afterwards.
381 424
382All TLS protocol errors will be signalled as C<EPROTO>, with an 425All TLS protocol errors will be signalled as C<EPROTO>, with an
383appropriate error message. 426appropriate error message.
384 427
385TLS mode requires Net::SSLeay to be installed (it will be loaded 428TLS mode requires Net::SSLeay to be installed (it will be loaded
386automatically when you try to create a TLS handle): this module doesn't 429automatically when you try to create a TLS handle): this module doesn't
387have a dependency on that module, so if your module requires it, you have 430have a dependency on that module, so if your module requires it, you have
388to add the dependency yourself. 431to add the dependency yourself. If Net::SSLeay cannot be loaded or is too
432old, you get an C<EPROTO> error.
389 433
390Unlike TCP, TLS has a server and client side: for the TLS server side, use 434Unlike TCP, TLS has a server and client side: for the TLS server side, use
391C<accept>, and for the TLS client side of a connection, use C<connect> 435C<accept>, and for the TLS client side of a connection, use C<connect>
392mode. 436mode.
393 437
404B<IMPORTANT:> since Net::SSLeay "objects" are really only integers, 448B<IMPORTANT:> since Net::SSLeay "objects" are really only integers,
405passing in the wrong integer will lead to certain crash. This most often 449passing in the wrong integer will lead to certain crash. This most often
406happens when one uses a stylish C<< tls => 1 >> and is surprised about the 450happens when one uses a stylish C<< tls => 1 >> and is surprised about the
407segmentation fault. 451segmentation fault.
408 452
409See the C<< ->starttls >> method for when need to start TLS negotiation later. 453Use the C<< ->starttls >> method if you need to start TLS negotiation later.
410 454
411=item tls_ctx => $anyevent_tls 455=item tls_ctx => $anyevent_tls
412 456
413Use the given C<AnyEvent::TLS> object to create the new TLS connection 457Use the given C<AnyEvent::TLS> object to create the new TLS connection
414(unless a connection object was specified directly). If this parameter is 458(unless a connection object was specified directly). If this
415missing, then AnyEvent::Handle will use C<AnyEvent::Handle::TLS_CTX>. 459parameter is missing (or C<undef>), then AnyEvent::Handle will use
460C<AnyEvent::Handle::TLS_CTX>.
416 461
417Instead of an object, you can also specify a hash reference with C<< key 462Instead of an object, you can also specify a hash reference with C<< key
418=> value >> pairs. Those will be passed to L<AnyEvent::TLS> to create a 463=> value >> pairs. Those will be passed to L<AnyEvent::TLS> to create a
419new TLS context object. 464new TLS context object.
420 465
429 474
430TLS handshake failures will not cause C<on_error> to be invoked when this 475TLS handshake failures will not cause C<on_error> to be invoked when this
431callback is in effect, instead, the error message will be passed to C<on_starttls>. 476callback is in effect, instead, the error message will be passed to C<on_starttls>.
432 477
433Without this callback, handshake failures lead to C<on_error> being 478Without this callback, handshake failures lead to C<on_error> being
434called, as normal. 479called as usual.
435 480
436Note that you cannot call C<starttls> right again in this callback. If you 481Note that you cannot just call C<starttls> again in this callback. If you
437need to do that, start an zero-second timer instead whose callback can 482need to do that, start an zero-second timer instead whose callback can
438then call C<< ->starttls >> again. 483then call C<< ->starttls >> again.
439 484
440=item on_stoptls => $cb->($handle) 485=item on_stoptls => $cb->($handle)
441 486
448callback. 493callback.
449 494
450This callback will only be called on TLS shutdowns, not when the 495This callback will only be called on TLS shutdowns, not when the
451underlying handle signals EOF. 496underlying handle signals EOF.
452 497
453=item json => JSON or JSON::XS object 498=item json => L<JSON>, L<JSON::PP> or L<JSON::XS> object
454 499
455This is the json coder object used by the C<json> read and write types. 500This is the json coder object used by the C<json> read and write types.
456 501
457If you don't supply it, then AnyEvent::Handle will create and use a 502If you don't supply it, then AnyEvent::Handle will create and use a
458suitable one (on demand), which will write and expect UTF-8 encoded JSON 503suitable one (on demand), which will write and expect UTF-8 encoded
504JSON texts (either using L<JSON::XS> or L<JSON>). The written texts are
505guaranteed not to contain any newline character.
506
507For security reasons, this encoder will likely I<not> handle numbers and
508strings, only arrays and objects/hashes. The reason is that originally
509JSON was self-delimited, but Dougles Crockford thought it was a splendid
510idea to redefine JSON incompatibly, so this is no longer true.
511
512For protocols that used back-to-back JSON texts, this might lead to
513run-ins, where two or more JSON texts will be interpreted as one JSON
459texts. 514text.
460 515
516For this reason, if the default encoder uses L<JSON::XS>, it will default
517to not allowing anything but arrays and objects/hashes, at least for the
518forseeable future (it will change at some point). This might or might not
519be true for the L<JSON> module, so this might cause a security issue.
520
521If you depend on either behaviour, you should create your own json object
522and pass it in explicitly.
523
524=item cbor => L<CBOR::XS> object
525
526This is the cbor coder object used by the C<cbor> read and write types.
527
528If you don't supply it, then AnyEvent::Handle will create and use a
529suitable one (on demand), which will write CBOR without using extensions,
530if possible.
531
461Note that you are responsible to depend on the JSON module if you want to 532Note that you are responsible to depend on the L<CBOR::XS> module if you
462use this functionality, as AnyEvent does not have a dependency itself. 533want to use this functionality, as AnyEvent does not have a dependency on
534it itself.
463 535
464=back 536=back
465 537
466=cut 538=cut
467 539
489 $self->{connect}[0], 561 $self->{connect}[0],
490 $self->{connect}[1], 562 $self->{connect}[1],
491 sub { 563 sub {
492 my ($fh, $host, $port, $retry) = @_; 564 my ($fh, $host, $port, $retry) = @_;
493 565
566 delete $self->{_connect}; # no longer needed
567
494 if ($fh) { 568 if ($fh) {
495 $self->{fh} = $fh; 569 $self->{fh} = $fh;
496 570
497 delete $self->{_skip_drain_rbuf}; 571 delete $self->{_skip_drain_rbuf};
498 $self->_start; 572 $self->_start;
505 }); 579 });
506 580
507 } else { 581 } else {
508 if ($self->{on_connect_error}) { 582 if ($self->{on_connect_error}) {
509 $self->{on_connect_error}($self, "$!"); 583 $self->{on_connect_error}($self, "$!");
510 $self->destroy; 584 $self->destroy if $self;
511 } else { 585 } else {
512 $self->_error ($!, 1); 586 $self->_error ($!, 1);
513 } 587 }
514 } 588 }
515 }, 589 },
516 sub { 590 sub {
517 local $self->{fh} = $_[0]; 591 local $self->{fh} = $_[0];
518 592
519 $self->{on_prepare} 593 $self->{on_prepare}
520 ? $self->{on_prepare}->($self) 594 ? $self->{on_prepare}->($self)
521 : () 595 : ()
522 } 596 }
523 ); 597 );
524 } 598 }
525 599
531} 605}
532 606
533sub _start { 607sub _start {
534 my ($self) = @_; 608 my ($self) = @_;
535 609
610 # too many clueless people try to use udp and similar sockets
611 # with AnyEvent::Handle, do them a favour.
612 my $type = getsockopt $self->{fh}, Socket::SOL_SOCKET (), Socket::SO_TYPE ();
613 Carp::croak "AnyEvent::Handle: only stream sockets supported, anything else will NOT work!"
614 if Socket::SOCK_STREAM () != (unpack "I", $type) && defined $type;
615
536 AnyEvent::Util::fh_nonblocking $self->{fh}, 1; 616 AnyEvent::fh_unblock $self->{fh};
537 617
538 $self->{_activity} = 618 $self->{_activity} =
539 $self->{_ractivity} = 619 $self->{_ractivity} =
540 $self->{_wactivity} = AE::now; 620 $self->{_wactivity} = AE::now;
541 621
622 $self->{read_size} ||= 2048;
623 $self->{max_read_size} = $self->{read_size}
624 if $self->{read_size} > ($self->{max_read_size} || MAX_READ_SIZE);
625
542 $self->timeout (delete $self->{timeout} ) if $self->{timeout}; 626 $self->timeout (delete $self->{timeout} ) if $self->{timeout};
543 $self->rtimeout (delete $self->{rtimeout} ) if $self->{rtimeout}; 627 $self->rtimeout (delete $self->{rtimeout} ) if $self->{rtimeout};
544 $self->wtimeout (delete $self->{wtimeout} ) if $self->{wtimeout}; 628 $self->wtimeout (delete $self->{wtimeout} ) if $self->{wtimeout};
545 629
546 $self->no_delay (delete $self->{no_delay} ) if exists $self->{no_delay} && $self->{no_delay}; 630 $self->no_delay (delete $self->{no_delay} ) if exists $self->{no_delay} && $self->{no_delay};
549 $self->oobinline (exists $self->{oobinline} ? delete $self->{oobinline} : 1); 633 $self->oobinline (exists $self->{oobinline} ? delete $self->{oobinline} : 1);
550 634
551 $self->starttls (delete $self->{tls}, delete $self->{tls_ctx}) 635 $self->starttls (delete $self->{tls}, delete $self->{tls_ctx})
552 if $self->{tls}; 636 if $self->{tls};
553 637
554 $self->on_drain (delete $self->{on_drain}) if $self->{on_drain}; 638 $self->on_drain (delete $self->{on_drain} ) if $self->{on_drain};
555 639
556 $self->start_read 640 $self->start_read
557 if $self->{on_read} || @{ $self->{_queue} }; 641 if $self->{on_read} || @{ $self->{_queue} };
558 642
559 $self->_drain_wbuf; 643 $self->_drain_wbuf;
566 $message ||= "$!"; 650 $message ||= "$!";
567 651
568 if ($self->{on_error}) { 652 if ($self->{on_error}) {
569 $self->{on_error}($self, $fatal, $message); 653 $self->{on_error}($self, $fatal, $message);
570 $self->destroy if $fatal; 654 $self->destroy if $fatal;
571 } elsif ($self->{fh}) { 655 } elsif ($self->{fh} || $self->{connect}) {
572 $self->destroy; 656 $self->destroy;
573 Carp::croak "AnyEvent::Handle uncaught error: $message"; 657 Carp::croak "AnyEvent::Handle uncaught error: $message";
574 } 658 }
575} 659}
576 660
635=cut 719=cut
636 720
637sub no_delay { 721sub no_delay {
638 $_[0]{no_delay} = $_[1]; 722 $_[0]{no_delay} = $_[1];
639 723
640 eval {
641 local $SIG{__DIE__};
642 setsockopt $_[0]{fh}, Socket::IPPROTO_TCP (), Socket::TCP_NODELAY (), int $_[1] 724 setsockopt $_[0]{fh}, Socket::IPPROTO_TCP (), Socket::TCP_NODELAY (), int $_[1]
643 if $_[0]{fh}; 725 if $_[0]{fh};
644 };
645} 726}
646 727
647=item $handle->keepalive ($boolean) 728=item $handle->keepalive ($boolean)
648 729
649Enables or disables the C<keepalive> setting (see constructor argument of 730Enables or disables the C<keepalive> setting (see constructor argument of
709 790
710Replace the current C<on_stoptls> callback (see the C<on_stoptls> constructor argument). 791Replace the current C<on_stoptls> callback (see the C<on_stoptls> constructor argument).
711 792
712=cut 793=cut
713 794
714sub on_starttls { 795sub on_stoptls {
715 $_[0]{on_stoptls} = $_[1]; 796 $_[0]{on_stoptls} = $_[1];
716} 797}
717 798
718=item $handle->rbuf_max ($max_octets) 799=item $handle->rbuf_max ($max_octets)
719 800
720Configures the C<rbuf_max> setting (C<undef> disables it). 801Configures the C<rbuf_max> setting (C<undef> disables it).
802
803=item $handle->wbuf_max ($max_octets)
804
805Configures the C<wbuf_max> setting (C<undef> disables it).
721 806
722=cut 807=cut
723 808
724sub rbuf_max { 809sub rbuf_max {
725 $_[0]{rbuf_max} = $_[1]; 810 $_[0]{rbuf_max} = $_[1];
726} 811}
727 812
813sub wbuf_max {
814 $_[0]{wbuf_max} = $_[1];
815}
816
728############################################################################# 817#############################################################################
729 818
730=item $handle->timeout ($seconds) 819=item $handle->timeout ($seconds)
731 820
732=item $handle->rtimeout ($seconds) 821=item $handle->rtimeout ($seconds)
733 822
734=item $handle->wtimeout ($seconds) 823=item $handle->wtimeout ($seconds)
735 824
736Configures (or disables) the inactivity timeout. 825Configures (or disables) the inactivity timeout.
826
827The timeout will be checked instantly, so this method might destroy the
828handle before it returns.
737 829
738=item $handle->timeout_reset 830=item $handle->timeout_reset
739 831
740=item $handle->rtimeout_reset 832=item $handle->rtimeout_reset
741 833
758 $_[0]{$on_timeout} = $_[1]; 850 $_[0]{$on_timeout} = $_[1];
759 }; 851 };
760 852
761 *$timeout = sub { 853 *$timeout = sub {
762 my ($self, $new_value) = @_; 854 my ($self, $new_value) = @_;
855
856 $new_value >= 0
857 or Carp::croak "AnyEvent::Handle->$timeout called with negative timeout ($new_value), caught";
763 858
764 $self->{$timeout} = $new_value; 859 $self->{$timeout} = $new_value;
765 delete $self->{$tw}; &$cb; 860 delete $self->{$tw}; &$cb;
766 }; 861 };
767 862
822 917
823The write queue is very simple: you can add data to its end, and 918The write queue is very simple: you can add data to its end, and
824AnyEvent::Handle will automatically try to get rid of it for you. 919AnyEvent::Handle will automatically try to get rid of it for you.
825 920
826When data could be written and the write buffer is shorter then the low 921When data could be written and the write buffer is shorter then the low
827water mark, the C<on_drain> callback will be invoked. 922water mark, the C<on_drain> callback will be invoked once.
828 923
829=over 4 924=over 4
830 925
831=item $handle->on_drain ($cb) 926=item $handle->on_drain ($cb)
832 927
833Sets the C<on_drain> callback or clears it (see the description of 928Sets the C<on_drain> callback or clears it (see the description of
834C<on_drain> in the constructor). 929C<on_drain> in the constructor).
835 930
931This method may invoke callbacks (and therefore the handle might be
932destroyed after it returns).
933
836=cut 934=cut
837 935
838sub on_drain { 936sub on_drain {
839 my ($self, $cb) = @_; 937 my ($self, $cb) = @_;
840 938
844 if $cb && $self->{low_water_mark} >= (length $self->{wbuf}) + (length $self->{_tls_wbuf}); 942 if $cb && $self->{low_water_mark} >= (length $self->{wbuf}) + (length $self->{_tls_wbuf});
845} 943}
846 944
847=item $handle->push_write ($data) 945=item $handle->push_write ($data)
848 946
849Queues the given scalar to be written. You can push as much data as you 947Queues the given scalar to be written. You can push as much data as
850want (only limited by the available memory), as C<AnyEvent::Handle> 948you want (only limited by the available memory and C<wbuf_max>), as
851buffers it independently of the kernel. 949C<AnyEvent::Handle> buffers it independently of the kernel.
950
951This method may invoke callbacks (and therefore the handle might be
952destroyed after it returns).
852 953
853=cut 954=cut
854 955
855sub _drain_wbuf { 956sub _drain_wbuf {
856 my ($self) = @_; 957 my ($self) = @_;
870 $self->{on_drain}($self) 971 $self->{on_drain}($self)
871 if $self->{low_water_mark} >= (length $self->{wbuf}) + (length $self->{_tls_wbuf}) 972 if $self->{low_water_mark} >= (length $self->{wbuf}) + (length $self->{_tls_wbuf})
872 && $self->{on_drain}; 973 && $self->{on_drain};
873 974
874 delete $self->{_ww} unless length $self->{wbuf}; 975 delete $self->{_ww} unless length $self->{wbuf};
875 } elsif ($! != EAGAIN && $! != EINTR && $! != WSAEWOULDBLOCK) { 976 } elsif ($! != EAGAIN && $! != EINTR && $! != EWOULDBLOCK && $! != WSAEWOULDBLOCK) {
876 $self->_error ($!, 1); 977 $self->_error ($!, 1);
877 } 978 }
878 }; 979 };
879 980
880 # try to write data immediately 981 # try to write data immediately
881 $cb->() unless $self->{autocork}; 982 $cb->() unless $self->{autocork};
882 983
883 # if still data left in wbuf, we need to poll 984 # if still data left in wbuf, we need to poll
884 $self->{_ww} = AE::io $self->{fh}, 1, $cb 985 $self->{_ww} = AE::io $self->{fh}, 1, $cb
885 if length $self->{wbuf}; 986 if length $self->{wbuf};
987
988 if (
989 defined $self->{wbuf_max}
990 && $self->{wbuf_max} < length $self->{wbuf}
991 ) {
992 $self->_error (Errno::ENOSPC, 1), return;
993 }
886 }; 994 };
887} 995}
888 996
889our %WH; 997our %WH;
890 998
902 @_ = ($WH{$type} ||= _load_func "$type\::anyevent_write_type" 1010 @_ = ($WH{$type} ||= _load_func "$type\::anyevent_write_type"
903 or Carp::croak "unsupported/unloadable type '$type' passed to AnyEvent::Handle::push_write") 1011 or Carp::croak "unsupported/unloadable type '$type' passed to AnyEvent::Handle::push_write")
904 ->($self, @_); 1012 ->($self, @_);
905 } 1013 }
906 1014
1015 # we downgrade here to avoid hard-to-track-down bugs,
1016 # and diagnose the problem earlier and better.
1017
907 if ($self->{tls}) { 1018 if ($self->{tls}) {
908 $self->{_tls_wbuf} .= $_[0]; 1019 utf8::downgrade $self->{_tls_wbuf} .= $_[0];
909 &_dotls ($self) if $self->{fh}; 1020 &_dotls ($self) if $self->{fh};
910 } else { 1021 } else {
911 $self->{wbuf} .= $_[0]; 1022 utf8::downgrade $self->{wbuf} .= $_[0];
912 $self->_drain_wbuf if $self->{fh}; 1023 $self->_drain_wbuf if $self->{fh};
913 } 1024 }
914} 1025}
915 1026
916=item $handle->push_write (type => @args) 1027=item $handle->push_write (type => @args)
917 1028
918Instead of formatting your data yourself, you can also let this module 1029Instead of formatting your data yourself, you can also let this module
919do the job by specifying a type and type-specific arguments. You 1030do the job by specifying a type and type-specific arguments. You
920can also specify the (fully qualified) name of a package, in which 1031can also specify the (fully qualified) name of a package, in which
921case AnyEvent tries to load the package and then expects to find the 1032case AnyEvent tries to load the package and then expects to find the
922C<anyevent_read_type> function inside (see "custom write types", below). 1033C<anyevent_write_type> function inside (see "custom write types", below).
923 1034
924Predefined types are (if you have ideas for additional types, feel free to 1035Predefined types are (if you have ideas for additional types, feel free to
925drop by and tell us): 1036drop by and tell us):
926 1037
927=over 4 1038=over 4
958 1069
959Encodes the given hash or array reference into a JSON object. Unless you 1070Encodes the given hash or array reference into a JSON object. Unless you
960provide your own JSON object, this means it will be encoded to JSON text 1071provide your own JSON object, this means it will be encoded to JSON text
961in UTF-8. 1072in UTF-8.
962 1073
1074The default encoder might or might not handle every type of JSON value -
1075it might be limited to arrays and objects for security reasons. See the
1076C<json> constructor attribute for more details.
1077
963JSON objects (and arrays) are self-delimiting, so you can write JSON at 1078JSON objects (and arrays) are self-delimiting, so if you only use arrays
964one end of a handle and read them at the other end without using any 1079and hashes, you can write JSON at one end of a handle and read them at the
965additional framing. 1080other end without using any additional framing.
966 1081
967The generated JSON text is guaranteed not to contain any newlines: While 1082The JSON text generated by the default encoder is guaranteed not to
968this module doesn't need delimiters after or between JSON texts to be 1083contain any newlines: While this module doesn't need delimiters after or
969able to read them, many other languages depend on that. 1084between JSON texts to be able to read them, many other languages depend on
1085them.
970 1086
971A simple RPC protocol that interoperates easily with others is to send 1087A simple RPC protocol that interoperates easily with other languages is
972JSON arrays (or objects, although arrays are usually the better choice as 1088to send JSON arrays (or objects, although arrays are usually the better
973they mimic how function argument passing works) and a newline after each 1089choice as they mimic how function argument passing works) and a newline
974JSON text: 1090after each JSON text:
975 1091
976 $handle->push_write (json => ["method", "arg1", "arg2"]); # whatever 1092 $handle->push_write (json => ["method", "arg1", "arg2"]); # whatever
977 $handle->push_write ("\012"); 1093 $handle->push_write ("\012");
978 1094
979An AnyEvent::Handle receiver would simply use the C<json> read type and 1095An AnyEvent::Handle receiver would simply use the C<json> read type and
982 $handle->push_read (json => sub { my $array = $_[1]; ... }); 1098 $handle->push_read (json => sub { my $array = $_[1]; ... });
983 1099
984Other languages could read single lines terminated by a newline and pass 1100Other languages could read single lines terminated by a newline and pass
985this line into their JSON decoder of choice. 1101this line into their JSON decoder of choice.
986 1102
1103=item cbor => $perl_scalar
1104
1105Encodes the given scalar into a CBOR value. Unless you provide your own
1106L<CBOR::XS> object, this means it will be encoded to a CBOR string not
1107using any extensions, if possible.
1108
1109CBOR values are self-delimiting, so you can write CBOR at one end of
1110a handle and read them at the other end without using any additional
1111framing.
1112
1113A simple nd very very fast RPC protocol that interoperates with
1114other languages is to send CBOR and receive CBOR values (arrays are
1115recommended):
1116
1117 $handle->push_write (cbor => ["method", "arg1", "arg2"]); # whatever
1118
1119An AnyEvent::Handle receiver would simply use the C<cbor> read type:
1120
1121 $handle->push_read (cbor => sub { my $array = $_[1]; ... });
1122
987=cut 1123=cut
988 1124
989sub json_coder() { 1125sub json_coder() {
990 eval { require JSON::XS; JSON::XS->new->utf8 } 1126 eval { require JSON::XS; JSON::XS->new->utf8 }
991 || do { require JSON; JSON->new->utf8 } 1127 || do { require JSON::PP; JSON::PP->new->utf8 }
992} 1128}
993 1129
994register_write_type json => sub { 1130register_write_type json => sub {
995 my ($self, $ref) = @_; 1131 my ($self, $ref) = @_;
996 1132
997 my $json = $self->{json} ||= json_coder; 1133 ($self->{json} ||= json_coder)
998
999 $json->encode ($ref) 1134 ->encode ($ref)
1135};
1136
1137sub cbor_coder() {
1138 require CBOR::XS;
1139 CBOR::XS->new
1140}
1141
1142register_write_type cbor => sub {
1143 my ($self, $scalar) = @_;
1144
1145 ($self->{cbor} ||= cbor_coder)
1146 ->encode ($scalar)
1000}; 1147};
1001 1148
1002=item storable => $reference 1149=item storable => $reference
1003 1150
1004Freezes the given reference using L<Storable> and writes it to the 1151Freezes the given reference using L<Storable> and writes it to the
1007=cut 1154=cut
1008 1155
1009register_write_type storable => sub { 1156register_write_type storable => sub {
1010 my ($self, $ref) = @_; 1157 my ($self, $ref) = @_;
1011 1158
1012 require Storable; 1159 require Storable unless $Storable::VERSION;
1013 1160
1014 pack "w/a*", Storable::nfreeze ($ref) 1161 pack "w/a*", Storable::nfreeze ($ref)
1015}; 1162};
1016 1163
1017=back 1164=back
1022before it was actually written. One way to do that is to replace your 1169before it was actually written. One way to do that is to replace your
1023C<on_drain> handler by a callback that shuts down the socket (and set 1170C<on_drain> handler by a callback that shuts down the socket (and set
1024C<low_water_mark> to C<0>). This method is a shorthand for just that, and 1171C<low_water_mark> to C<0>). This method is a shorthand for just that, and
1025replaces the C<on_drain> callback with: 1172replaces the C<on_drain> callback with:
1026 1173
1027 sub { shutdown $_[0]{fh}, 1 } # for push_shutdown 1174 sub { shutdown $_[0]{fh}, 1 }
1028 1175
1029This simply shuts down the write side and signals an EOF condition to the 1176This simply shuts down the write side and signals an EOF condition to the
1030the peer. 1177the peer.
1031 1178
1032You can rely on the normal read queue and C<on_eof> handling 1179You can rely on the normal read queue and C<on_eof> handling
1033afterwards. This is the cleanest way to close a connection. 1180afterwards. This is the cleanest way to close a connection.
1181
1182This method may invoke callbacks (and therefore the handle might be
1183destroyed after it returns).
1034 1184
1035=cut 1185=cut
1036 1186
1037sub push_shutdown { 1187sub push_shutdown {
1038 my ($self) = @_; 1188 my ($self) = @_;
1051 1201
1052Whenever the given C<type> is used, C<push_write> will the function with 1202Whenever the given C<type> is used, C<push_write> will the function with
1053the handle object and the remaining arguments. 1203the handle object and the remaining arguments.
1054 1204
1055The function is supposed to return a single octet string that will be 1205The function is supposed to return a single octet string that will be
1056appended to the write buffer, so you cna mentally treat this function as a 1206appended to the write buffer, so you can mentally treat this function as a
1057"arguments to on-the-wire-format" converter. 1207"arguments to on-the-wire-format" converter.
1058 1208
1059Example: implement a custom write type C<join> that joins the remaining 1209Example: implement a custom write type C<join> that joins the remaining
1060arguments using the first one. 1210arguments using the first one.
1061 1211
1088ways, the "simple" way, using only C<on_read> and the "complex" way, using 1238ways, the "simple" way, using only C<on_read> and the "complex" way, using
1089a queue. 1239a queue.
1090 1240
1091In the simple case, you just install an C<on_read> callback and whenever 1241In the simple case, you just install an C<on_read> callback and whenever
1092new data arrives, it will be called. You can then remove some data (if 1242new data arrives, it will be called. You can then remove some data (if
1093enough is there) from the read buffer (C<< $handle->rbuf >>). Or you cna 1243enough is there) from the read buffer (C<< $handle->rbuf >>). Or you can
1094leave the data there if you want to accumulate more (e.g. when only a 1244leave the data there if you want to accumulate more (e.g. when only a
1095partial message has been received so far). 1245partial message has been received so far), or change the read queue with
1246e.g. C<push_read>.
1096 1247
1097In the more complex case, you want to queue multiple callbacks. In this 1248In the more complex case, you want to queue multiple callbacks. In this
1098case, AnyEvent::Handle will call the first queued callback each time new 1249case, AnyEvent::Handle will call the first queued callback each time new
1099data arrives (also the first time it is queued) and removes it when it has 1250data arrives (also the first time it is queued) and remove it when it has
1100done its job (see C<push_read>, below). 1251done its job (see C<push_read>, below).
1101 1252
1102This way you can, for example, push three line-reads, followed by reading 1253This way you can, for example, push three line-reads, followed by reading
1103a chunk of data, and AnyEvent::Handle will execute them in order. 1254a chunk of data, and AnyEvent::Handle will execute them in order.
1104 1255
1235 1386
1236This replaces the currently set C<on_read> callback, or clears it (when 1387This replaces the currently set C<on_read> callback, or clears it (when
1237the new callback is C<undef>). See the description of C<on_read> in the 1388the new callback is C<undef>). See the description of C<on_read> in the
1238constructor. 1389constructor.
1239 1390
1391This method may invoke callbacks (and therefore the handle might be
1392destroyed after it returns).
1393
1240=cut 1394=cut
1241 1395
1242sub on_read { 1396sub on_read {
1243 my ($self, $cb) = @_; 1397 my ($self, $cb) = @_;
1244 1398
1246 $self->_drain_rbuf if $cb; 1400 $self->_drain_rbuf if $cb;
1247} 1401}
1248 1402
1249=item $handle->rbuf 1403=item $handle->rbuf
1250 1404
1251Returns the read buffer (as a modifiable lvalue). 1405Returns the read buffer (as a modifiable lvalue). You can also access the
1406read buffer directly as the C<< ->{rbuf} >> member, if you want (this is
1407much faster, and no less clean).
1252 1408
1253You can access the read buffer directly as the C<< ->{rbuf} >> 1409The only operation allowed on the read buffer (apart from looking at it)
1254member, if you want. However, the only operation allowed on the 1410is removing data from its beginning. Otherwise modifying or appending to
1255read buffer (apart from looking at it) is removing data from its 1411it is not allowed and will lead to hard-to-track-down bugs.
1256beginning. Otherwise modifying or appending to it is not allowed and will
1257lead to hard-to-track-down bugs.
1258 1412
1259NOTE: The read buffer should only be used or modified if the C<on_read>, 1413NOTE: The read buffer should only be used or modified in the C<on_read>
1260C<push_read> or C<unshift_read> methods are used. The other read methods 1414callback or when C<push_read> or C<unshift_read> are used with a single
1261automatically manage the read buffer. 1415callback (i.e. untyped). Typed C<push_read> and C<unshift_read> methods
1416will manage the read buffer on their own.
1262 1417
1263=cut 1418=cut
1264 1419
1265sub rbuf : lvalue { 1420sub rbuf : lvalue {
1266 $_[0]{rbuf} 1421 $_[0]{rbuf}
1283 1438
1284If enough data was available, then the callback must remove all data it is 1439If enough data was available, then the callback must remove all data it is
1285interested in (which can be none at all) and return a true value. After returning 1440interested in (which can be none at all) and return a true value. After returning
1286true, it will be removed from the queue. 1441true, it will be removed from the queue.
1287 1442
1443These methods may invoke callbacks (and therefore the handle might be
1444destroyed after it returns).
1445
1288=cut 1446=cut
1289 1447
1290our %RH; 1448our %RH;
1291 1449
1292sub register_read_type($$) { 1450sub register_read_type($$) {
1314 my $cb = pop; 1472 my $cb = pop;
1315 1473
1316 if (@_) { 1474 if (@_) {
1317 my $type = shift; 1475 my $type = shift;
1318 1476
1477 $cb = ($RH{$type} ||= _load_func "$type\::anyevent_read_type"
1319 $cb = ($RH{$type} or Carp::croak "unsupported type passed to AnyEvent::Handle::unshift_read") 1478 or Carp::croak "unsupported/unloadable type '$type' passed to AnyEvent::Handle::unshift_read")
1320 ->($self, $cb, @_); 1479 ->($self, $cb, @_);
1321 } 1480 }
1322 1481
1323 unshift @{ $self->{_queue} }, $cb; 1482 unshift @{ $self->{_queue} }, $cb;
1324 $self->_drain_rbuf; 1483 $self->_drain_rbuf;
1346data. 1505data.
1347 1506
1348Example: read 2 bytes. 1507Example: read 2 bytes.
1349 1508
1350 $handle->push_read (chunk => 2, sub { 1509 $handle->push_read (chunk => 2, sub {
1351 warn "yay ", unpack "H*", $_[1]; 1510 say "yay " . unpack "H*", $_[1];
1352 }); 1511 });
1353 1512
1354=cut 1513=cut
1355 1514
1356register_read_type chunk => sub { 1515register_read_type chunk => sub {
1386 1545
1387register_read_type line => sub { 1546register_read_type line => sub {
1388 my ($self, $cb, $eol) = @_; 1547 my ($self, $cb, $eol) = @_;
1389 1548
1390 if (@_ < 3) { 1549 if (@_ < 3) {
1391 # this is more than twice as fast as the generic code below 1550 # this is faster then the generic code below
1392 sub { 1551 sub {
1393 $_[0]{rbuf} =~ s/^([^\015\012]*)(\015?\012)// or return; 1552 (my $pos = index $_[0]{rbuf}, "\012") >= 0
1553 or return;
1394 1554
1555 (my $str = substr $_[0]{rbuf}, 0, $pos + 1, "") =~ s/(\015?\012)\Z// or die;
1395 $cb->($_[0], $1, $2); 1556 $cb->($_[0], $str, "$1");
1396 1 1557 1
1397 } 1558 }
1398 } else { 1559 } else {
1399 $eol = quotemeta $eol unless ref $eol; 1560 $eol = quotemeta $eol unless ref $eol;
1400 $eol = qr|^(.*?)($eol)|s; 1561 $eol = qr|^(.*?)($eol)|s;
1401 1562
1402 sub { 1563 sub {
1403 $_[0]{rbuf} =~ s/$eol// or return; 1564 $_[0]{rbuf} =~ s/$eol// or return;
1404 1565
1405 $cb->($_[0], $1, $2); 1566 $cb->($_[0], "$1", "$2");
1406 1 1567 1
1407 } 1568 }
1408 } 1569 }
1409}; 1570};
1410 1571
1411=item regex => $accept[, $reject[, $skip], $cb->($handle, $data) 1572=item regex => $accept[, $reject[, $skip], $cb->($handle, $data)
1412 1573
1413Makes a regex match against the regex object C<$accept> and returns 1574Makes a regex match against the regex object C<$accept> and returns
1414everything up to and including the match. 1575everything up to and including the match. All the usual regex variables
1576($1, %+ etc.) from the regex match are available in the callback.
1415 1577
1416Example: read a single line terminated by '\n'. 1578Example: read a single line terminated by '\n'.
1417 1579
1418 $handle->push_read (regex => qr<\n>, sub { ... }); 1580 $handle->push_read (regex => qr<\n>, sub { ... });
1419 1581
1432the receive buffer when neither C<$accept> nor C<$reject> match, 1594the receive buffer when neither C<$accept> nor C<$reject> match,
1433and everything preceding and including the match will be accepted 1595and everything preceding and including the match will be accepted
1434unconditionally. This is useful to skip large amounts of data that you 1596unconditionally. This is useful to skip large amounts of data that you
1435know cannot be matched, so that the C<$accept> or C<$reject> regex do not 1597know cannot be matched, so that the C<$accept> or C<$reject> regex do not
1436have to start matching from the beginning. This is purely an optimisation 1598have to start matching from the beginning. This is purely an optimisation
1437and is usually worth only when you expect more than a few kilobytes. 1599and is usually worth it only when you expect more than a few kilobytes.
1438 1600
1439Example: expect a http header, which ends at C<\015\012\015\012>. Since we 1601Example: expect a http header, which ends at C<\015\012\015\012>. Since we
1440expect the header to be very large (it isn't in practise, but...), we use 1602expect the header to be very large (it isn't in practice, but...), we use
1441a skip regex to skip initial portions. The skip regex is tricky in that 1603a skip regex to skip initial portions. The skip regex is tricky in that
1442it only accepts something not ending in either \015 or \012, as these are 1604it only accepts something not ending in either \015 or \012, as these are
1443required for the accept regex. 1605required for the accept regex.
1444 1606
1445 $handle->push_read (regex => 1607 $handle->push_read (regex =>
1458 1620
1459 sub { 1621 sub {
1460 # accept 1622 # accept
1461 if ($$rbuf =~ $accept) { 1623 if ($$rbuf =~ $accept) {
1462 $data .= substr $$rbuf, 0, $+[0], ""; 1624 $data .= substr $$rbuf, 0, $+[0], "";
1463 $cb->($self, $data); 1625 $cb->($_[0], $data);
1464 return 1; 1626 return 1;
1465 } 1627 }
1466 1628
1467 # reject 1629 # reject
1468 if ($reject && $$rbuf =~ $reject) { 1630 if ($reject && $$rbuf =~ $reject) {
1469 $self->_error (Errno::EBADMSG); 1631 $_[0]->_error (Errno::EBADMSG);
1470 } 1632 }
1471 1633
1472 # skip 1634 # skip
1473 if ($skip && $$rbuf =~ $skip) { 1635 if ($skip && $$rbuf =~ $skip) {
1474 $data .= substr $$rbuf, 0, $+[0], ""; 1636 $data .= substr $$rbuf, 0, $+[0], "";
1490 my ($self, $cb) = @_; 1652 my ($self, $cb) = @_;
1491 1653
1492 sub { 1654 sub {
1493 unless ($_[0]{rbuf} =~ s/^(0|[1-9][0-9]*)://) { 1655 unless ($_[0]{rbuf} =~ s/^(0|[1-9][0-9]*)://) {
1494 if ($_[0]{rbuf} =~ /[^0-9]/) { 1656 if ($_[0]{rbuf} =~ /[^0-9]/) {
1495 $self->_error (Errno::EBADMSG); 1657 $_[0]->_error (Errno::EBADMSG);
1496 } 1658 }
1497 return; 1659 return;
1498 } 1660 }
1499 1661
1500 my $len = $1; 1662 my $len = $1;
1501 1663
1502 $self->unshift_read (chunk => $len, sub { 1664 $_[0]->unshift_read (chunk => $len, sub {
1503 my $string = $_[1]; 1665 my $string = $_[1];
1504 $_[0]->unshift_read (chunk => 1, sub { 1666 $_[0]->unshift_read (chunk => 1, sub {
1505 if ($_[1] eq ",") { 1667 if ($_[1] eq ",") {
1506 $cb->($_[0], $string); 1668 $cb->($_[0], $string);
1507 } else { 1669 } else {
1508 $self->_error (Errno::EBADMSG); 1670 $_[0]->_error (Errno::EBADMSG);
1509 } 1671 }
1510 }); 1672 });
1511 }); 1673 });
1512 1674
1513 1 1675 1
1563=item json => $cb->($handle, $hash_or_arrayref) 1725=item json => $cb->($handle, $hash_or_arrayref)
1564 1726
1565Reads a JSON object or array, decodes it and passes it to the 1727Reads a JSON object or array, decodes it and passes it to the
1566callback. When a parse error occurs, an C<EBADMSG> error will be raised. 1728callback. When a parse error occurs, an C<EBADMSG> error will be raised.
1567 1729
1568If a C<json> object was passed to the constructor, then that will be used 1730If a C<json> object was passed to the constructor, then that will be
1569for the final decode, otherwise it will create a JSON coder expecting UTF-8. 1731used for the final decode, otherwise it will create a L<JSON::XS> or
1732L<JSON::PP> coder object expecting UTF-8.
1570 1733
1571This read type uses the incremental parser available with JSON version 1734This read type uses the incremental parser available with JSON version
15722.09 (and JSON::XS version 2.2) and above. You have to provide a 17352.09 (and JSON::XS version 2.2) and above.
1573dependency on your own: this module will load the JSON module, but
1574AnyEvent does not depend on it itself.
1575 1736
1576Since JSON texts are fully self-delimiting, the C<json> read and write 1737Since JSON texts are fully self-delimiting, the C<json> read and write
1577types are an ideal simple RPC protocol: just exchange JSON datagrams. See 1738types are an ideal simple RPC protocol: just exchange JSON datagrams. See
1578the C<json> write type description, above, for an actual example. 1739the C<json> write type description, above, for an actual example.
1579 1740
1583 my ($self, $cb) = @_; 1744 my ($self, $cb) = @_;
1584 1745
1585 my $json = $self->{json} ||= json_coder; 1746 my $json = $self->{json} ||= json_coder;
1586 1747
1587 my $data; 1748 my $data;
1588 my $rbuf = \$self->{rbuf};
1589 1749
1590 sub { 1750 sub {
1591 my $ref = eval { $json->incr_parse ($self->{rbuf}) }; 1751 my $ref = eval { $json->incr_parse ($_[0]{rbuf}) };
1592 1752
1593 if ($ref) { 1753 if ($ref) {
1594 $self->{rbuf} = $json->incr_text; 1754 $_[0]{rbuf} = $json->incr_text;
1595 $json->incr_text = ""; 1755 $json->incr_text = "";
1596 $cb->($self, $ref); 1756 $cb->($_[0], $ref);
1597 1757
1598 1 1758 1
1599 } elsif ($@) { 1759 } elsif ($@) {
1600 # error case 1760 # error case
1601 $json->incr_skip; 1761 $json->incr_skip;
1602 1762
1603 $self->{rbuf} = $json->incr_text; 1763 $_[0]{rbuf} = $json->incr_text;
1604 $json->incr_text = ""; 1764 $json->incr_text = "";
1605 1765
1606 $self->_error (Errno::EBADMSG); 1766 $_[0]->_error (Errno::EBADMSG);
1607 1767
1608 () 1768 ()
1609 } else { 1769 } else {
1610 $self->{rbuf} = ""; 1770 $_[0]{rbuf} = "";
1611 1771
1772 ()
1773 }
1774 }
1775};
1776
1777=item cbor => $cb->($handle, $scalar)
1778
1779Reads a CBOR value, decodes it and passes it to the callback. When a parse
1780error occurs, an C<EBADMSG> error will be raised.
1781
1782If a L<CBOR::XS> object was passed to the constructor, then that will be
1783used for the final decode, otherwise it will create a CBOR coder without
1784enabling any options.
1785
1786You have to provide a dependency to L<CBOR::XS> on your own: this module
1787will load the L<CBOR::XS> module, but AnyEvent does not depend on it
1788itself.
1789
1790Since CBOR values are fully self-delimiting, the C<cbor> read and write
1791types are an ideal simple RPC protocol: just exchange CBOR datagrams. See
1792the C<cbor> write type description, above, for an actual example.
1793
1794=cut
1795
1796register_read_type cbor => sub {
1797 my ($self, $cb) = @_;
1798
1799 my $cbor = $self->{cbor} ||= cbor_coder;
1800
1801 my $data;
1802
1803 sub {
1804 my (@value) = eval { $cbor->incr_parse ($_[0]{rbuf}) };
1805
1806 if (@value) {
1807 $cb->($_[0], @value);
1808
1809 1
1810 } elsif ($@) {
1811 # error case
1812 $cbor->incr_reset;
1813
1814 $_[0]->_error (Errno::EBADMSG);
1815
1816 ()
1817 } else {
1612 () 1818 ()
1613 } 1819 }
1614 } 1820 }
1615}; 1821};
1616 1822
1625=cut 1831=cut
1626 1832
1627register_read_type storable => sub { 1833register_read_type storable => sub {
1628 my ($self, $cb) = @_; 1834 my ($self, $cb) = @_;
1629 1835
1630 require Storable; 1836 require Storable unless $Storable::VERSION;
1631 1837
1632 sub { 1838 sub {
1633 # when we can use 5.10 we can use ".", but for 5.8 we use the re-pack method 1839 # when we can use 5.10 we can use ".", but for 5.8 we use the re-pack method
1634 defined (my $len = eval { unpack "w", $_[0]{rbuf} }) 1840 defined (my $len = eval { unpack "w", $_[0]{rbuf} })
1635 or return; 1841 or return;
1638 1844
1639 # bypass unshift if we already have the remaining chunk 1845 # bypass unshift if we already have the remaining chunk
1640 if ($format + $len <= length $_[0]{rbuf}) { 1846 if ($format + $len <= length $_[0]{rbuf}) {
1641 my $data = substr $_[0]{rbuf}, $format, $len; 1847 my $data = substr $_[0]{rbuf}, $format, $len;
1642 substr $_[0]{rbuf}, 0, $format + $len, ""; 1848 substr $_[0]{rbuf}, 0, $format + $len, "";
1849
1643 $cb->($_[0], Storable::thaw ($data)); 1850 eval { $cb->($_[0], Storable::thaw ($data)); 1 }
1851 or return $_[0]->_error (Errno::EBADMSG);
1644 } else { 1852 } else {
1645 # remove prefix 1853 # remove prefix
1646 substr $_[0]{rbuf}, 0, $format, ""; 1854 substr $_[0]{rbuf}, 0, $format, "";
1647 1855
1648 # read remaining chunk 1856 # read remaining chunk
1649 $_[0]->unshift_read (chunk => $len, sub { 1857 $_[0]->unshift_read (chunk => $len, sub {
1650 if (my $ref = eval { Storable::thaw ($_[1]) }) { 1858 eval { $cb->($_[0], Storable::thaw ($_[1])); 1 }
1651 $cb->($_[0], $ref);
1652 } else {
1653 $self->_error (Errno::EBADMSG); 1859 or $_[0]->_error (Errno::EBADMSG);
1654 }
1655 }); 1860 });
1656 } 1861 }
1657 1862
1658 1 1863 1
1659 } 1864 }
1865};
1866
1867=item tls_detect => $cb->($handle, $detect, $major, $minor)
1868
1869Checks the input stream for a valid SSL or TLS handshake TLSPaintext
1870record without consuming anything. Only SSL version 3 or higher
1871is handled, up to the fictituous protocol 4.x (but both SSL3+ and
1872SSL2-compatible framing is supported).
1873
1874If it detects that the input data is likely TLS, it calls the callback
1875with a true value for C<$detect> and the (on-wire) TLS version as second
1876and third argument (C<$major> is C<3>, and C<$minor> is 0..3 for SSL
18773.0, TLS 1.0, 1.1 and 1.2, respectively). If it detects the input to
1878be definitely not TLS, it calls the callback with a false value for
1879C<$detect>.
1880
1881The callback could use this information to decide whether or not to start
1882TLS negotiation.
1883
1884In all cases the data read so far is passed to the following read
1885handlers.
1886
1887Usually you want to use the C<tls_autostart> read type instead.
1888
1889If you want to design a protocol that works in the presence of TLS
1890dtection, make sure that any non-TLS data doesn't start with the octet 22
1891(ASCII SYN, 16 hex) or 128-255 (i.e. highest bit set). The checks this
1892read type does are a bit more strict, but might losen in the future to
1893accomodate protocol changes.
1894
1895This read type does not rely on L<AnyEvent::TLS> (and thus, not on
1896L<Net::SSLeay>).
1897
1898=item tls_autostart => $tls[, $tls_ctx]
1899
1900Tries to detect a valid SSL or TLS handshake. If one is detected, it tries
1901to start tls by calling C<starttls> with the given arguments.
1902
1903In practise, C<$tls> must be C<accept>, or a Net::SSLeay context that has
1904been configured to accept, as servers do not normally send a handshake on
1905their own and ths cannot be detected in this way.
1906
1907See C<tls_detect> above for more details.
1908
1909Example: give the client a chance to start TLS before accepting a text
1910line.
1911
1912 $hdl->push_read (tls_detect => "accept");
1913 $hdl->push_read (line => sub {
1914 print "received ", ($_[0]{tls} ? "encrypted" : "cleartext"), " <$_[1]>\n";
1915 });
1916
1917=cut
1918
1919register_read_type tls_detect => sub {
1920 my ($self, $cb) = @_;
1921
1922 sub {
1923 # this regex matches a full or partial tls record
1924 if (
1925 # ssl3+: type(22=handshake) major(=3) minor(any) length_hi
1926 $self->{rbuf} =~ /^(?:\z| \x16 (\z| [\x03\x04] (?:\z| . (?:\z| [\x00-\x40] ))))/xs
1927 # ssl2 comapatible: len_hi len_lo type(1) major minor dummy(forlength)
1928 or $self->{rbuf} =~ /^(?:\z| [\x80-\xff] (?:\z| . (?:\z| \x01 (\z| [\x03\x04] (?:\z| . (?:\z| . ))))))/xs
1929 ) {
1930 return if 3 != length $1; # partial match, can't decide yet
1931
1932 # full match, valid TLS record
1933 my ($major, $minor) = unpack "CC", $1;
1934 $cb->($self, "accept", $major + $minor * 0.1);
1935 } else {
1936 # mismatch == guaranteed not TLS
1937 $cb->($self, undef);
1938 }
1939
1940 1
1941 }
1942};
1943
1944register_read_type tls_autostart => sub {
1945 my ($self, @tls) = @_;
1946
1947 $RH{tls_detect}($self, sub {
1948 return unless $_[1];
1949 $_[0]->starttls (@tls);
1950 })
1660}; 1951};
1661 1952
1662=back 1953=back
1663 1954
1664=item custom read types - Package::anyevent_read_type $handle, $cb, @args 1955=item custom read types - Package::anyevent_read_type $handle, $cb, @args
1696Note that AnyEvent::Handle will automatically C<start_read> for you when 1987Note that AnyEvent::Handle will automatically C<start_read> for you when
1697you change the C<on_read> callback or push/unshift a read callback, and it 1988you change the C<on_read> callback or push/unshift a read callback, and it
1698will automatically C<stop_read> for you when neither C<on_read> is set nor 1989will automatically C<stop_read> for you when neither C<on_read> is set nor
1699there are any read requests in the queue. 1990there are any read requests in the queue.
1700 1991
1701These methods will have no effect when in TLS mode (as TLS doesn't support 1992In older versions of this module (<= 5.3), these methods had no effect,
1702half-duplex connections). 1993as TLS does not support half-duplex connections. In current versions they
1994work as expected, as this behaviour is required to avoid certain resource
1995attacks, where the program would be forced to read (and buffer) arbitrary
1996amounts of data before being able to send some data. The drawback is that
1997some readings of the the SSL/TLS specifications basically require this
1998attack to be working, as SSL/TLS implementations might stall sending data
1999during a rehandshake.
2000
2001As a guideline, during the initial handshake, you should not stop reading,
2002and as a client, it might cause problems, depending on your application.
1703 2003
1704=cut 2004=cut
1705 2005
1706sub stop_read { 2006sub stop_read {
1707 my ($self) = @_; 2007 my ($self) = @_;
1708 2008
1709 delete $self->{_rw} unless $self->{tls}; 2009 delete $self->{_rw};
1710} 2010}
1711 2011
1712sub start_read { 2012sub start_read {
1713 my ($self) = @_; 2013 my ($self) = @_;
1714 2014
1715 unless ($self->{_rw} || $self->{_eof}) { 2015 unless ($self->{_rw} || $self->{_eof} || !$self->{fh}) {
1716 Scalar::Util::weaken $self; 2016 Scalar::Util::weaken $self;
1717 2017
1718 $self->{_rw} = AE::io $self->{fh}, 0, sub { 2018 $self->{_rw} = AE::io $self->{fh}, 0, sub {
1719 my $rbuf = \($self->{tls} ? my $buf : $self->{rbuf}); 2019 my $rbuf = \($self->{tls} ? my $buf : $self->{rbuf});
1720 my $len = sysread $self->{fh}, $$rbuf, $self->{read_size} || 8192, length $$rbuf; 2020 my $len = sysread $self->{fh}, $$rbuf, $self->{read_size}, length $$rbuf;
1721 2021
1722 if ($len > 0) { 2022 if ($len > 0) {
1723 $self->{_activity} = $self->{_ractivity} = AE::now; 2023 $self->{_activity} = $self->{_ractivity} = AE::now;
1724 2024
1725 if ($self->{tls}) { 2025 if ($self->{tls}) {
1728 &_dotls ($self); 2028 &_dotls ($self);
1729 } else { 2029 } else {
1730 $self->_drain_rbuf; 2030 $self->_drain_rbuf;
1731 } 2031 }
1732 2032
2033 if ($len == $self->{read_size}) {
2034 $self->{read_size} *= 2;
2035 $self->{read_size} = $self->{max_read_size} || MAX_READ_SIZE
2036 if $self->{read_size} > ($self->{max_read_size} || MAX_READ_SIZE);
2037 }
2038
1733 } elsif (defined $len) { 2039 } elsif (defined $len) {
1734 delete $self->{_rw}; 2040 delete $self->{_rw};
1735 $self->{_eof} = 1; 2041 $self->{_eof} = 1;
1736 $self->_drain_rbuf; 2042 $self->_drain_rbuf;
1737 2043
1738 } elsif ($! != EAGAIN && $! != EINTR && $! != WSAEWOULDBLOCK) { 2044 } elsif ($! != EAGAIN && $! != EINTR && $! != EWOULDBLOCK && $! != WSAEWOULDBLOCK) {
1739 return $self->_error ($!, 1); 2045 return $self->_error ($!, 1);
1740 } 2046 }
1741 }; 2047 };
1742 } 2048 }
1743} 2049}
1749 my ($self, $err) = @_; 2055 my ($self, $err) = @_;
1750 2056
1751 return $self->_error ($!, 1) 2057 return $self->_error ($!, 1)
1752 if $err == Net::SSLeay::ERROR_SYSCALL (); 2058 if $err == Net::SSLeay::ERROR_SYSCALL ();
1753 2059
1754 my $err =Net::SSLeay::ERR_error_string (Net::SSLeay::ERR_get_error ()); 2060 my $err = Net::SSLeay::ERR_error_string (Net::SSLeay::ERR_get_error ());
1755 2061
1756 # reduce error string to look less scary 2062 # reduce error string to look less scary
1757 $err =~ s/^error:[0-9a-fA-F]{8}:[^:]+:([^:]+):/\L$1: /; 2063 $err =~ s/^error:[0-9a-fA-F]{8}:[^:]+:([^:]+):/\L$1: /;
1758 2064
1759 if ($self->{_on_starttls}) { 2065 if ($self->{_on_starttls}) {
1773sub _dotls { 2079sub _dotls {
1774 my ($self) = @_; 2080 my ($self) = @_;
1775 2081
1776 my $tmp; 2082 my $tmp;
1777 2083
1778 if (length $self->{_tls_wbuf}) { 2084 while (length $self->{_tls_wbuf}) {
1779 while (($tmp = Net::SSLeay::write ($self->{tls}, $self->{_tls_wbuf})) > 0) { 2085 if (($tmp = Net::SSLeay::write ($self->{tls}, $self->{_tls_wbuf})) <= 0) {
1780 substr $self->{_tls_wbuf}, 0, $tmp, ""; 2086 $tmp = Net::SSLeay::get_error ($self->{tls}, $tmp);
2087
2088 return $self->_tls_error ($tmp)
2089 if $tmp != $ERROR_WANT_READ
2090 && ($tmp != $ERROR_SYSCALL || $!);
2091
2092 last;
1781 } 2093 }
1782 2094
1783 $tmp = Net::SSLeay::get_error ($self->{tls}, $tmp); 2095 substr $self->{_tls_wbuf}, 0, $tmp, "";
1784 return $self->_tls_error ($tmp)
1785 if $tmp != $ERROR_WANT_READ
1786 && ($tmp != $ERROR_SYSCALL || $!);
1787 } 2096 }
1788 2097
1789 while (defined ($tmp = Net::SSLeay::read ($self->{tls}))) { 2098 while (defined ($tmp = Net::SSLeay::read ($self->{tls}))) {
1790 unless (length $tmp) { 2099 unless (length $tmp) {
1791 $self->{_on_starttls} 2100 $self->{_on_starttls}
1805 $self->{_tls_rbuf} .= $tmp; 2114 $self->{_tls_rbuf} .= $tmp;
1806 $self->_drain_rbuf; 2115 $self->_drain_rbuf;
1807 $self->{tls} or return; # tls session might have gone away in callback 2116 $self->{tls} or return; # tls session might have gone away in callback
1808 } 2117 }
1809 2118
1810 $tmp = Net::SSLeay::get_error ($self->{tls}, -1); 2119 $tmp = Net::SSLeay::get_error ($self->{tls}, -1); # -1 is not neccessarily correct, but Net::SSLeay doesn't tell us
1811 return $self->_tls_error ($tmp) 2120 return $self->_tls_error ($tmp)
1812 if $tmp != $ERROR_WANT_READ 2121 if $tmp != $ERROR_WANT_READ
1813 && ($tmp != $ERROR_SYSCALL || $!); 2122 && ($tmp != $ERROR_SYSCALL || $!);
1814 2123
1815 while (length ($tmp = Net::SSLeay::BIO_read ($self->{_wbio}))) { 2124 while (length ($tmp = Net::SSLeay::BIO_read ($self->{_wbio}))) {
1816 $self->{wbuf} .= $tmp; 2125 $self->{wbuf} .= $tmp;
1817 $self->_drain_wbuf; 2126 $self->_drain_wbuf;
2127 $self->{tls} or return; # tls session might have gone away in callback
1818 } 2128 }
1819 2129
1820 $self->{_on_starttls} 2130 $self->{_on_starttls}
1821 and Net::SSLeay::state ($self->{tls}) == Net::SSLeay::ST_OK () 2131 and Net::SSLeay::state ($self->{tls}) == Net::SSLeay::ST_OK ()
1822 and (delete $self->{_on_starttls})->($self, 1, "TLS/SSL connection established"); 2132 and (delete $self->{_on_starttls})->($self, 1, "TLS/SSL connection established");
1824 2134
1825=item $handle->starttls ($tls[, $tls_ctx]) 2135=item $handle->starttls ($tls[, $tls_ctx])
1826 2136
1827Instead of starting TLS negotiation immediately when the AnyEvent::Handle 2137Instead of starting TLS negotiation immediately when the AnyEvent::Handle
1828object is created, you can also do that at a later time by calling 2138object is created, you can also do that at a later time by calling
1829C<starttls>. 2139C<starttls>. See the C<tls> constructor argument for general info.
1830 2140
1831Starting TLS is currently an asynchronous operation - when you push some 2141Starting TLS is currently an asynchronous operation - when you push some
1832write data and then call C<< ->starttls >> then TLS negotiation will start 2142write data and then call C<< ->starttls >> then TLS negotiation will start
1833immediately, after which the queued write data is then sent. 2143immediately, after which the queued write data is then sent. This might
2144change in future versions, so best make sure you have no outstanding write
2145data when calling this method.
1834 2146
1835The first argument is the same as the C<tls> constructor argument (either 2147The first argument is the same as the C<tls> constructor argument (either
1836C<"connect">, C<"accept"> or an existing Net::SSLeay object). 2148C<"connect">, C<"accept"> or an existing Net::SSLeay object).
1837 2149
1838The second argument is the optional C<AnyEvent::TLS> object that is used 2150The second argument is the optional C<AnyEvent::TLS> object that is used
1844context in C<< $handle->{tls_ctx} >> after this call and can be used or 2156context in C<< $handle->{tls_ctx} >> after this call and can be used or
1845changed to your liking. Note that the handshake might have already started 2157changed to your liking. Note that the handshake might have already started
1846when this function returns. 2158when this function returns.
1847 2159
1848Due to bugs in OpenSSL, it might or might not be possible to do multiple 2160Due to bugs in OpenSSL, it might or might not be possible to do multiple
1849handshakes on the same stream. Best do not attempt to use the stream after 2161handshakes on the same stream. It is best to not attempt to use the
1850stopping TLS. 2162stream after stopping TLS.
2163
2164This method may invoke callbacks (and therefore the handle might be
2165destroyed after it returns).
1851 2166
1852=cut 2167=cut
1853 2168
1854our %TLS_CACHE; #TODO not yet documented, should we? 2169our %TLS_CACHE; #TODO not yet documented, should we?
1855 2170
1857 my ($self, $tls, $ctx) = @_; 2172 my ($self, $tls, $ctx) = @_;
1858 2173
1859 Carp::croak "It is an error to call starttls on an AnyEvent::Handle object while TLS is already active, caught" 2174 Carp::croak "It is an error to call starttls on an AnyEvent::Handle object while TLS is already active, caught"
1860 if $self->{tls}; 2175 if $self->{tls};
1861 2176
2177 unless (defined $AnyEvent::TLS::VERSION) {
2178 eval {
2179 require Net::SSLeay;
2180 require AnyEvent::TLS;
2181 1
2182 } or return $self->_error (Errno::EPROTO, 1, "TLS support not available on this system");
2183 }
2184
1862 $self->{tls} = $tls; 2185 $self->{tls} = $tls;
1863 $self->{tls_ctx} = $ctx if @_ > 2; 2186 $self->{tls_ctx} = $ctx if @_ > 2;
1864 2187
1865 return unless $self->{fh}; 2188 return unless $self->{fh};
1866 2189
1867 require Net::SSLeay;
1868
1869 $ERROR_SYSCALL = Net::SSLeay::ERROR_SYSCALL (); 2190 $ERROR_SYSCALL = Net::SSLeay::ERROR_SYSCALL ();
1870 $ERROR_WANT_READ = Net::SSLeay::ERROR_WANT_READ (); 2191 $ERROR_WANT_READ = Net::SSLeay::ERROR_WANT_READ ();
1871 2192
1872 $tls = delete $self->{tls}; 2193 $tls = delete $self->{tls};
1873 $ctx = $self->{tls_ctx}; 2194 $ctx = $self->{tls_ctx};
1874 2195
1875 local $Carp::CarpLevel = 1; # skip ourselves when creating a new context or session 2196 local $Carp::CarpLevel = 1; # skip ourselves when creating a new context or session
1876 2197
1877 if ("HASH" eq ref $ctx) { 2198 if ("HASH" eq ref $ctx) {
1878 require AnyEvent::TLS;
1879
1880 if ($ctx->{cache}) { 2199 if ($ctx->{cache}) {
1881 my $key = $ctx+0; 2200 my $key = $ctx+0;
1882 $ctx = $TLS_CACHE{$key} ||= new AnyEvent::TLS %$ctx; 2201 $ctx = $TLS_CACHE{$key} ||= new AnyEvent::TLS %$ctx;
1883 } else { 2202 } else {
1884 $ctx = new AnyEvent::TLS %$ctx; 2203 $ctx = new AnyEvent::TLS %$ctx;
1906 Net::SSLeay::CTX_set_mode ($tls, 1|2); 2225 Net::SSLeay::CTX_set_mode ($tls, 1|2);
1907 2226
1908 $self->{_rbio} = Net::SSLeay::BIO_new (Net::SSLeay::BIO_s_mem ()); 2227 $self->{_rbio} = Net::SSLeay::BIO_new (Net::SSLeay::BIO_s_mem ());
1909 $self->{_wbio} = Net::SSLeay::BIO_new (Net::SSLeay::BIO_s_mem ()); 2228 $self->{_wbio} = Net::SSLeay::BIO_new (Net::SSLeay::BIO_s_mem ());
1910 2229
1911 Net::SSLeay::BIO_write ($self->{_rbio}, delete $self->{rbuf}); 2230 Net::SSLeay::BIO_write ($self->{_rbio}, $self->{rbuf});
2231 $self->{rbuf} = "";
1912 2232
1913 Net::SSLeay::set_bio ($tls, $self->{_rbio}, $self->{_wbio}); 2233 Net::SSLeay::set_bio ($tls, $self->{_rbio}, $self->{_wbio});
1914 2234
1915 $self->{_on_starttls} = sub { $_[0]{on_starttls}(@_) } 2235 $self->{_on_starttls} = sub { $_[0]{on_starttls}(@_) }
1916 if $self->{on_starttls}; 2236 if $self->{on_starttls};
1921 2241
1922=item $handle->stoptls 2242=item $handle->stoptls
1923 2243
1924Shuts down the SSL connection - this makes a proper EOF handshake by 2244Shuts down the SSL connection - this makes a proper EOF handshake by
1925sending a close notify to the other side, but since OpenSSL doesn't 2245sending a close notify to the other side, but since OpenSSL doesn't
1926support non-blocking shut downs, it is not guarenteed that you can re-use 2246support non-blocking shut downs, it is not guaranteed that you can re-use
1927the stream afterwards. 2247the stream afterwards.
2248
2249This method may invoke callbacks (and therefore the handle might be
2250destroyed after it returns).
1928 2251
1929=cut 2252=cut
1930 2253
1931sub stoptls { 2254sub stoptls {
1932 my ($self) = @_; 2255 my ($self) = @_;
1933 2256
1934 if ($self->{tls}) { 2257 if ($self->{tls} && $self->{fh}) {
1935 Net::SSLeay::shutdown ($self->{tls}); 2258 Net::SSLeay::shutdown ($self->{tls});
1936 2259
1937 &_dotls; 2260 &_dotls;
1938 2261
1939# # we don't give a shit. no, we do, but we can't. no...#d# 2262# # we don't give a shit. no, we do, but we can't. no...#d#
1951 if $self->{tls} > 0; 2274 if $self->{tls} > 0;
1952 2275
1953 delete @$self{qw(_rbio _wbio _tls_wbuf _on_starttls)}; 2276 delete @$self{qw(_rbio _wbio _tls_wbuf _on_starttls)};
1954} 2277}
1955 2278
2279=item $handle->resettls
2280
2281This rarely-used method simply resets and TLS state on the handle, usually
2282causing data loss.
2283
2284One case where it may be useful is when you want to skip over the data in
2285the stream but you are not interested in interpreting it, so data loss is
2286no concern.
2287
2288=cut
2289
2290*resettls = \&_freetls;
2291
1956sub DESTROY { 2292sub DESTROY {
1957 my ($self) = @_; 2293 my ($self) = @_;
1958 2294
1959 &_freetls; 2295 &_freetls;
1960 2296
1969 push @linger, AE::io $fh, 1, sub { 2305 push @linger, AE::io $fh, 1, sub {
1970 my $len = syswrite $fh, $wbuf, length $wbuf; 2306 my $len = syswrite $fh, $wbuf, length $wbuf;
1971 2307
1972 if ($len > 0) { 2308 if ($len > 0) {
1973 substr $wbuf, 0, $len, ""; 2309 substr $wbuf, 0, $len, "";
1974 } else { 2310 } elsif (defined $len || ($! != EAGAIN && $! != EINTR && $! != EWOULDBLOCK && $! != WSAEWOULDBLOCK)) {
1975 @linger = (); # end 2311 @linger = (); # end
1976 } 2312 }
1977 }; 2313 };
1978 push @linger, AE::timer $linger, 0, sub { 2314 push @linger, AE::timer $linger, 0, sub {
1979 @linger = (); 2315 @linger = ();
2016 2352
2017sub AnyEvent::Handle::destroyed::AUTOLOAD { 2353sub AnyEvent::Handle::destroyed::AUTOLOAD {
2018 #nop 2354 #nop
2019} 2355}
2020 2356
2357=item $handle->destroyed
2358
2359Returns false as long as the handle hasn't been destroyed by a call to C<<
2360->destroy >>, true otherwise.
2361
2362Can be useful to decide whether the handle is still valid after some
2363callback possibly destroyed the handle. For example, C<< ->push_write >>,
2364C<< ->starttls >> and other methods can call user callbacks, which in turn
2365can destroy the handle, so work can be avoided by checking sometimes:
2366
2367 $hdl->starttls ("accept");
2368 return if $hdl->destroyed;
2369 $hdl->push_write (...
2370
2371Note that the call to C<push_write> will silently be ignored if the handle
2372has been destroyed, so often you can just ignore the possibility of the
2373handle being destroyed.
2374
2375=cut
2376
2377sub destroyed { 0 }
2378sub AnyEvent::Handle::destroyed::destroyed { 1 }
2379
2021=item AnyEvent::Handle::TLS_CTX 2380=item AnyEvent::Handle::TLS_CTX
2022 2381
2023This function creates and returns the AnyEvent::TLS object used by default 2382This function creates and returns the AnyEvent::TLS object used by default
2024for TLS mode. 2383for TLS mode.
2025 2384
2052 2411
2053It is only safe to "forget" the reference inside EOF or error callbacks, 2412It is only safe to "forget" the reference inside EOF or error callbacks,
2054from within all other callbacks, you need to explicitly call the C<< 2413from within all other callbacks, you need to explicitly call the C<<
2055->destroy >> method. 2414->destroy >> method.
2056 2415
2416=item Why is my C<on_eof> callback never called?
2417
2418Probably because your C<on_error> callback is being called instead: When
2419you have outstanding requests in your read queue, then an EOF is
2420considered an error as you clearly expected some data.
2421
2422To avoid this, make sure you have an empty read queue whenever your handle
2423is supposed to be "idle" (i.e. connection closes are O.K.). You can set
2424an C<on_read> handler that simply pushes the first read requests in the
2425queue.
2426
2427See also the next question, which explains this in a bit more detail.
2428
2429=item How can I serve requests in a loop?
2430
2431Most protocols consist of some setup phase (authentication for example)
2432followed by a request handling phase, where the server waits for requests
2433and handles them, in a loop.
2434
2435There are two important variants: The first (traditional, better) variant
2436handles requests until the server gets some QUIT command, causing it to
2437close the connection first (highly desirable for a busy TCP server). A
2438client dropping the connection is an error, which means this variant can
2439detect an unexpected detection close.
2440
2441To handle this case, always make sure you have a non-empty read queue, by
2442pushing the "read request start" handler on it:
2443
2444 # we assume a request starts with a single line
2445 my @start_request; @start_request = (line => sub {
2446 my ($hdl, $line) = @_;
2447
2448 ... handle request
2449
2450 # push next request read, possibly from a nested callback
2451 $hdl->push_read (@start_request);
2452 });
2453
2454 # auth done, now go into request handling loop
2455 # now push the first @start_request
2456 $hdl->push_read (@start_request);
2457
2458By always having an outstanding C<push_read>, the handle always expects
2459some data and raises the C<EPIPE> error when the connction is dropped
2460unexpectedly.
2461
2462The second variant is a protocol where the client can drop the connection
2463at any time. For TCP, this means that the server machine may run out of
2464sockets easier, and in general, it means you cannot distinguish a protocl
2465failure/client crash from a normal connection close. Nevertheless, these
2466kinds of protocols are common (and sometimes even the best solution to the
2467problem).
2468
2469Having an outstanding read request at all times is possible if you ignore
2470C<EPIPE> errors, but this doesn't help with when the client drops the
2471connection during a request, which would still be an error.
2472
2473A better solution is to push the initial request read in an C<on_read>
2474callback. This avoids an error, as when the server doesn't expect data
2475(i.e. is idly waiting for the next request, an EOF will not raise an
2476error, but simply result in an C<on_eof> callback. It is also a bit slower
2477and simpler:
2478
2479 # auth done, now go into request handling loop
2480 $hdl->on_read (sub {
2481 my ($hdl) = @_;
2482
2483 # called each time we receive data but the read queue is empty
2484 # simply start read the request
2485
2486 $hdl->push_read (line => sub {
2487 my ($hdl, $line) = @_;
2488
2489 ... handle request
2490
2491 # do nothing special when the request has been handled, just
2492 # let the request queue go empty.
2493 });
2494 });
2495
2057=item I get different callback invocations in TLS mode/Why can't I pause 2496=item I get different callback invocations in TLS mode/Why can't I pause
2058reading? 2497reading?
2059 2498
2060Unlike, say, TCP, TLS connections do not consist of two independent 2499Unlike, say, TCP, TLS connections do not consist of two independent
2061communication channels, one for each direction. Or put differently. The 2500communication channels, one for each direction. Or put differently, the
2062read and write directions are not independent of each other: you cannot 2501read and write directions are not independent of each other: you cannot
2063write data unless you are also prepared to read, and vice versa. 2502write data unless you are also prepared to read, and vice versa.
2064 2503
2065This can mean than, in TLS mode, you might get C<on_error> or C<on_eof> 2504This means that, in TLS mode, you might get C<on_error> or C<on_eof>
2066callback invocations when you are not expecting any read data - the reason 2505callback invocations when you are not expecting any read data - the reason
2067is that AnyEvent::Handle always reads in TLS mode. 2506is that AnyEvent::Handle always reads in TLS mode.
2068 2507
2069During the connection, you have to make sure that you always have a 2508During the connection, you have to make sure that you always have a
2070non-empty read-queue, or an C<on_read> watcher. At the end of the 2509non-empty read-queue, or an C<on_read> watcher. At the end of the
2082 $handle->on_eof (undef); 2521 $handle->on_eof (undef);
2083 $handle->on_error (sub { 2522 $handle->on_error (sub {
2084 my $data = delete $_[0]{rbuf}; 2523 my $data = delete $_[0]{rbuf};
2085 }); 2524 });
2086 2525
2526Note that this example removes the C<rbuf> member from the handle object,
2527which is not normally allowed by the API. It is expressly permitted in
2528this case only, as the handle object needs to be destroyed afterwards.
2529
2087The reason to use C<on_error> is that TCP connections, due to latencies 2530The reason to use C<on_error> is that TCP connections, due to latencies
2088and packets loss, might get closed quite violently with an error, when in 2531and packets loss, might get closed quite violently with an error, when in
2089fact, all data has been received. 2532fact all data has been received.
2090 2533
2091It is usually better to use acknowledgements when transferring data, 2534It is usually better to use acknowledgements when transferring data,
2092to make sure the other side hasn't just died and you got the data 2535to make sure the other side hasn't just died and you got the data
2093intact. This is also one reason why so many internet protocols have an 2536intact. This is also one reason why so many internet protocols have an
2094explicit QUIT command. 2537explicit QUIT command.
2101C<low_water_mark> this will be called precisely when all data has been 2544C<low_water_mark> this will be called precisely when all data has been
2102written to the socket: 2545written to the socket:
2103 2546
2104 $handle->push_write (...); 2547 $handle->push_write (...);
2105 $handle->on_drain (sub { 2548 $handle->on_drain (sub {
2106 warn "all data submitted to the kernel\n"; 2549 AE::log debug => "All data submitted to the kernel.";
2107 undef $handle; 2550 undef $handle;
2108 }); 2551 });
2109 2552
2110If you just want to queue some data and then signal EOF to the other side, 2553If you just want to queue some data and then signal EOF to the other side,
2111consider using C<< ->push_shutdown >> instead. 2554consider using C<< ->push_shutdown >> instead.
2112 2555
2113=item I want to contact a TLS/SSL server, I don't care about security. 2556=item I want to contact a TLS/SSL server, I don't care about security.
2114 2557
2115If your TLS server is a pure TLS server (e.g. HTTPS) that only speaks TLS, 2558If your TLS server is a pure TLS server (e.g. HTTPS) that only speaks TLS,
2116simply connect to it and then create the AnyEvent::Handle with the C<tls> 2559connect to it and then create the AnyEvent::Handle with the C<tls>
2117parameter: 2560parameter:
2118 2561
2119 tcp_connect $host, $port, sub { 2562 tcp_connect $host, $port, sub {
2120 my ($fh) = @_; 2563 my ($fh) = @_;
2121 2564
2195When you have intermediate CA certificates that your clients might not 2638When you have intermediate CA certificates that your clients might not
2196know about, just append them to the C<cert_file>. 2639know about, just append them to the C<cert_file>.
2197 2640
2198=back 2641=back
2199 2642
2200
2201=head1 SUBCLASSING AnyEvent::Handle 2643=head1 SUBCLASSING AnyEvent::Handle
2202 2644
2203In many cases, you might want to subclass AnyEvent::Handle. 2645In many cases, you might want to subclass AnyEvent::Handle.
2204 2646
2205To make this easier, a given version of AnyEvent::Handle uses these 2647To make this easier, a given version of AnyEvent::Handle uses these
2221 2663
2222=item * all members not documented here and not prefixed with an underscore 2664=item * all members not documented here and not prefixed with an underscore
2223are free to use in subclasses. 2665are free to use in subclasses.
2224 2666
2225Of course, new versions of AnyEvent::Handle may introduce more "public" 2667Of course, new versions of AnyEvent::Handle may introduce more "public"
2226member variables, but thats just life, at least it is documented. 2668member variables, but that's just life. At least it is documented.
2227 2669
2228=back 2670=back
2229 2671
2230=head1 AUTHOR 2672=head1 AUTHOR
2231 2673
2232Robin Redeker C<< <elmex at ta-sa.org> >>, Marc Lehmann <schmorp@schmorp.de>. 2674Robin Redeker C<< <elmex at ta-sa.org> >>, Marc Lehmann <schmorp@schmorp.de>.
2233 2675
2234=cut 2676=cut
2235 2677
22361; # End of AnyEvent::Handle 26781
2679

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines