ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/AnyEvent/lib/AnyEvent/Handle.pm
Revision: 1.52
Committed: Mon Jun 2 09:10:38 2008 UTC (15 years, 11 months ago) by root
Branch: MAIN
Changes since 1.51: +44 -49 lines
Log Message:
*** empty log message ***

File Contents

# User Rev Content
1 elmex 1.1 package AnyEvent::Handle;
2    
3 elmex 1.6 no warnings;
4 elmex 1.1 use strict;
5    
6 root 1.8 use AnyEvent ();
7 root 1.42 use AnyEvent::Util qw(WSAEWOULDBLOCK);
8 root 1.8 use Scalar::Util ();
9     use Carp ();
10     use Fcntl ();
11 root 1.43 use Errno qw(EAGAIN EINTR);
12 elmex 1.1
13     =head1 NAME
14    
15 root 1.22 AnyEvent::Handle - non-blocking I/O on file handles via AnyEvent
16 elmex 1.1
17     =cut
18    
19 root 1.50 our $VERSION = 4.1;
20 elmex 1.1
21     =head1 SYNOPSIS
22    
23     use AnyEvent;
24     use AnyEvent::Handle;
25    
26     my $cv = AnyEvent->condvar;
27    
28 root 1.31 my $handle =
29 elmex 1.2 AnyEvent::Handle->new (
30     fh => \*STDIN,
31     on_eof => sub {
32     $cv->broadcast;
33     },
34     );
35    
36 root 1.31 # send some request line
37     $handle->push_write ("getinfo\015\012");
38    
39     # read the response line
40     $handle->push_read (line => sub {
41     my ($handle, $line) = @_;
42     warn "read line <$line>\n";
43     $cv->send;
44     });
45    
46     $cv->recv;
47 elmex 1.1
48     =head1 DESCRIPTION
49    
50 root 1.8 This module is a helper module to make it easier to do event-based I/O on
51 elmex 1.13 filehandles. For utility functions for doing non-blocking connects and accepts
52     on sockets see L<AnyEvent::Util>.
53 root 1.8
54     In the following, when the documentation refers to of "bytes" then this
55     means characters. As sysread and syswrite are used for all I/O, their
56     treatment of characters applies to this module as well.
57 elmex 1.1
58 root 1.8 All callbacks will be invoked with the handle object as their first
59     argument.
60 elmex 1.1
61     =head1 METHODS
62    
63     =over 4
64    
65     =item B<new (%args)>
66    
67 root 1.8 The constructor supports these arguments (all as key => value pairs).
68 elmex 1.1
69     =over 4
70    
71 root 1.8 =item fh => $filehandle [MANDATORY]
72 elmex 1.1
73     The filehandle this L<AnyEvent::Handle> object will operate on.
74    
75 root 1.8 NOTE: The filehandle will be set to non-blocking (using
76     AnyEvent::Util::fh_nonblocking).
77    
78 root 1.40 =item on_eof => $cb->($handle)
79 root 1.10
80 root 1.52 Set the callback to be called when an end-of-file condition is detcted,
81     i.e. in the case of a socket, when the other side has closed the
82     connection cleanly.
83 root 1.8
84 root 1.16 While not mandatory, it is highly recommended to set an eof callback,
85     otherwise you might end up with a closed socket while you are still
86     waiting for data.
87    
88 root 1.52 =item on_error => $cb->($handle, $fatal)
89 root 1.10
90 root 1.52 This is the error callback, which is called when, well, some error
91     occured, such as not being able to resolve the hostname, failure to
92     connect or a read error.
93    
94     Some errors are fatal (which is indicated by C<$fatal> being true). On
95     fatal errors the handle object will be shut down and will not be
96     usable. Non-fatal errors can be retried by simply returning, but it is
97     recommended to simply ignore this parameter and instead abondon the handle
98     object when this callback is invoked.
99 root 1.8
100 root 1.10 On callback entrance, the value of C<$!> contains the operating system
101 root 1.43 error (or C<ENOSPC>, C<EPIPE>, C<ETIMEDOUT> or C<EBADMSG>).
102 root 1.8
103 root 1.10 While not mandatory, it is I<highly> recommended to set this callback, as
104     you will not be notified of errors otherwise. The default simply calls
105 root 1.52 C<croak>.
106 root 1.8
107 root 1.40 =item on_read => $cb->($handle)
108 root 1.8
109     This sets the default read callback, which is called when data arrives
110 root 1.10 and no read request is in the queue.
111 root 1.8
112     To access (and remove data from) the read buffer, use the C<< ->rbuf >>
113 root 1.40 method or access the C<$handle->{rbuf}> member directly.
114 root 1.8
115     When an EOF condition is detected then AnyEvent::Handle will first try to
116     feed all the remaining data to the queued callbacks and C<on_read> before
117     calling the C<on_eof> callback. If no progress can be made, then a fatal
118     error will be raised (with C<$!> set to C<EPIPE>).
119 elmex 1.1
120 root 1.40 =item on_drain => $cb->($handle)
121 elmex 1.1
122 root 1.8 This sets the callback that is called when the write buffer becomes empty
123     (or when the callback is set and the buffer is empty already).
124 elmex 1.1
125 root 1.8 To append to the write buffer, use the C<< ->push_write >> method.
126 elmex 1.2
127 root 1.43 =item timeout => $fractional_seconds
128    
129     If non-zero, then this enables an "inactivity" timeout: whenever this many
130     seconds pass without a successful read or write on the underlying file
131     handle, the C<on_timeout> callback will be invoked (and if that one is
132 root 1.45 missing, an C<ETIMEDOUT> error will be raised).
133 root 1.43
134     Note that timeout processing is also active when you currently do not have
135     any outstanding read or write requests: If you plan to keep the connection
136     idle then you should disable the timout temporarily or ignore the timeout
137     in the C<on_timeout> callback.
138    
139     Zero (the default) disables this timeout.
140    
141     =item on_timeout => $cb->($handle)
142    
143     Called whenever the inactivity timeout passes. If you return from this
144     callback, then the timeout will be reset as if some activity had happened,
145     so this condition is not fatal in any way.
146    
147 root 1.8 =item rbuf_max => <bytes>
148 elmex 1.2
149 root 1.8 If defined, then a fatal error will be raised (with C<$!> set to C<ENOSPC>)
150     when the read buffer ever (strictly) exceeds this size. This is useful to
151     avoid denial-of-service attacks.
152 elmex 1.2
153 root 1.8 For example, a server accepting connections from untrusted sources should
154     be configured to accept only so-and-so much data that it cannot act on
155     (for example, when expecting a line, an attacker could send an unlimited
156     amount of data without a callback ever being called as long as the line
157     isn't finished).
158 elmex 1.2
159 root 1.8 =item read_size => <bytes>
160 elmex 1.2
161 root 1.8 The default read block size (the amount of bytes this module will try to read
162 root 1.46 during each (loop iteration). Default: C<8192>.
163 root 1.8
164     =item low_water_mark => <bytes>
165    
166     Sets the amount of bytes (default: C<0>) that make up an "empty" write
167     buffer: If the write reaches this size or gets even samller it is
168     considered empty.
169 elmex 1.2
170 root 1.19 =item tls => "accept" | "connect" | Net::SSLeay::SSL object
171    
172     When this parameter is given, it enables TLS (SSL) mode, that means it
173     will start making tls handshake and will transparently encrypt/decrypt
174     data.
175    
176 root 1.26 TLS mode requires Net::SSLeay to be installed (it will be loaded
177     automatically when you try to create a TLS handle).
178    
179 root 1.19 For the TLS server side, use C<accept>, and for the TLS client side of a
180     connection, use C<connect> mode.
181    
182     You can also provide your own TLS connection object, but you have
183     to make sure that you call either C<Net::SSLeay::set_connect_state>
184     or C<Net::SSLeay::set_accept_state> on it before you pass it to
185     AnyEvent::Handle.
186    
187 root 1.26 See the C<starttls> method if you need to start TLs negotiation later.
188    
189 root 1.19 =item tls_ctx => $ssl_ctx
190    
191     Use the given Net::SSLeay::CTX object to create the new TLS connection
192     (unless a connection object was specified directly). If this parameter is
193     missing, then AnyEvent::Handle will use C<AnyEvent::Handle::TLS_CTX>.
194    
195 root 1.40 =item json => JSON or JSON::XS object
196    
197     This is the json coder object used by the C<json> read and write types.
198    
199 root 1.41 If you don't supply it, then AnyEvent::Handle will create and use a
200     suitable one, which will write and expect UTF-8 encoded JSON texts.
201 root 1.40
202     Note that you are responsible to depend on the JSON module if you want to
203     use this functionality, as AnyEvent does not have a dependency itself.
204    
205 root 1.38 =item filter_r => $cb
206    
207     =item filter_w => $cb
208    
209     These exist, but are undocumented at this time.
210    
211 elmex 1.1 =back
212    
213     =cut
214    
215     sub new {
216 root 1.8 my $class = shift;
217    
218     my $self = bless { @_ }, $class;
219    
220     $self->{fh} or Carp::croak "mandatory argument fh is missing";
221    
222     AnyEvent::Util::fh_nonblocking $self->{fh}, 1;
223 elmex 1.1
224 root 1.19 if ($self->{tls}) {
225     require Net::SSLeay;
226     $self->starttls (delete $self->{tls}, delete $self->{tls_ctx});
227     }
228    
229 root 1.43 # $self->on_eof (delete $self->{on_eof} ) if $self->{on_eof}; # nop
230     # $self->on_error (delete $self->{on_error}) if $self->{on_error}; # nop
231     # $self->on_read (delete $self->{on_read} ) if $self->{on_read}; # nop
232 root 1.8 $self->on_drain (delete $self->{on_drain}) if $self->{on_drain};
233 root 1.43
234 root 1.44 $self->{_activity} = AnyEvent->now;
235 root 1.43 $self->_timeout;
236 elmex 1.1
237 root 1.10 $self->start_read;
238    
239 root 1.8 $self
240     }
241 elmex 1.2
242 root 1.8 sub _shutdown {
243     my ($self) = @_;
244 elmex 1.2
245 root 1.46 delete $self->{_tw};
246 root 1.38 delete $self->{_rw};
247     delete $self->{_ww};
248 root 1.8 delete $self->{fh};
249 root 1.52
250     $self->stoptls;
251 root 1.8 }
252    
253 root 1.52 sub _error {
254     my ($self, $errno, $fatal) = @_;
255 root 1.8
256 root 1.52 $self->_shutdown
257     if $fatal;
258 elmex 1.1
259 root 1.52 $! = $errno;
260 root 1.37
261 root 1.52 if ($self->{on_error}) {
262     $self->{on_error}($self, $fatal);
263     } else {
264     Carp::croak "AnyEvent::Handle uncaught error: $!";
265     }
266 elmex 1.1 }
267    
268 root 1.8 =item $fh = $handle->fh
269 elmex 1.1
270 root 1.22 This method returns the file handle of the L<AnyEvent::Handle> object.
271 elmex 1.1
272     =cut
273    
274 root 1.38 sub fh { $_[0]{fh} }
275 elmex 1.1
276 root 1.8 =item $handle->on_error ($cb)
277 elmex 1.1
278 root 1.8 Replace the current C<on_error> callback (see the C<on_error> constructor argument).
279 elmex 1.1
280 root 1.8 =cut
281    
282     sub on_error {
283     $_[0]{on_error} = $_[1];
284     }
285    
286     =item $handle->on_eof ($cb)
287    
288     Replace the current C<on_eof> callback (see the C<on_eof> constructor argument).
289 elmex 1.1
290     =cut
291    
292 root 1.8 sub on_eof {
293     $_[0]{on_eof} = $_[1];
294     }
295    
296 root 1.43 =item $handle->on_timeout ($cb)
297    
298     Replace the current C<on_timeout> callback, or disables the callback
299     (but not the timeout) if C<$cb> = C<undef>. See C<timeout> constructor
300     argument.
301    
302     =cut
303    
304     sub on_timeout {
305     $_[0]{on_timeout} = $_[1];
306     }
307    
308     #############################################################################
309    
310     =item $handle->timeout ($seconds)
311    
312     Configures (or disables) the inactivity timeout.
313    
314     =cut
315    
316     sub timeout {
317     my ($self, $timeout) = @_;
318    
319     $self->{timeout} = $timeout;
320     $self->_timeout;
321     }
322    
323     # reset the timeout watcher, as neccessary
324     # also check for time-outs
325     sub _timeout {
326     my ($self) = @_;
327    
328     if ($self->{timeout}) {
329 root 1.44 my $NOW = AnyEvent->now;
330 root 1.43
331     # when would the timeout trigger?
332     my $after = $self->{_activity} + $self->{timeout} - $NOW;
333    
334     # now or in the past already?
335     if ($after <= 0) {
336     $self->{_activity} = $NOW;
337    
338     if ($self->{on_timeout}) {
339 root 1.48 $self->{on_timeout}($self);
340 root 1.43 } else {
341 root 1.52 $self->_error (&Errno::ETIMEDOUT);
342 root 1.43 }
343    
344     # callbakx could have changed timeout value, optimise
345     return unless $self->{timeout};
346    
347     # calculate new after
348     $after = $self->{timeout};
349     }
350    
351     Scalar::Util::weaken $self;
352    
353     $self->{_tw} ||= AnyEvent->timer (after => $after, cb => sub {
354     delete $self->{_tw};
355     $self->_timeout;
356     });
357     } else {
358     delete $self->{_tw};
359     }
360     }
361    
362 root 1.9 #############################################################################
363    
364     =back
365    
366     =head2 WRITE QUEUE
367    
368     AnyEvent::Handle manages two queues per handle, one for writing and one
369     for reading.
370    
371     The write queue is very simple: you can add data to its end, and
372     AnyEvent::Handle will automatically try to get rid of it for you.
373    
374 elmex 1.20 When data could be written and the write buffer is shorter then the low
375 root 1.9 water mark, the C<on_drain> callback will be invoked.
376    
377     =over 4
378    
379 root 1.8 =item $handle->on_drain ($cb)
380    
381     Sets the C<on_drain> callback or clears it (see the description of
382     C<on_drain> in the constructor).
383    
384     =cut
385    
386     sub on_drain {
387 elmex 1.1 my ($self, $cb) = @_;
388    
389 root 1.8 $self->{on_drain} = $cb;
390    
391     $cb->($self)
392     if $cb && $self->{low_water_mark} >= length $self->{wbuf};
393     }
394    
395     =item $handle->push_write ($data)
396    
397     Queues the given scalar to be written. You can push as much data as you
398     want (only limited by the available memory), as C<AnyEvent::Handle>
399     buffers it independently of the kernel.
400    
401     =cut
402    
403 root 1.17 sub _drain_wbuf {
404     my ($self) = @_;
405 root 1.8
406 root 1.38 if (!$self->{_ww} && length $self->{wbuf}) {
407 root 1.35
408 root 1.8 Scalar::Util::weaken $self;
409 root 1.35
410 root 1.8 my $cb = sub {
411     my $len = syswrite $self->{fh}, $self->{wbuf};
412    
413 root 1.29 if ($len >= 0) {
414 root 1.8 substr $self->{wbuf}, 0, $len, "";
415    
416 root 1.44 $self->{_activity} = AnyEvent->now;
417 root 1.43
418 root 1.8 $self->{on_drain}($self)
419     if $self->{low_water_mark} >= length $self->{wbuf}
420     && $self->{on_drain};
421    
422 root 1.38 delete $self->{_ww} unless length $self->{wbuf};
423 root 1.42 } elsif ($! != EAGAIN && $! != EINTR && $! != WSAEWOULDBLOCK) {
424 root 1.52 $self->_error ($!, 1);
425 elmex 1.1 }
426 root 1.8 };
427    
428 root 1.35 # try to write data immediately
429     $cb->();
430 root 1.8
431 root 1.35 # if still data left in wbuf, we need to poll
432 root 1.38 $self->{_ww} = AnyEvent->io (fh => $self->{fh}, poll => "w", cb => $cb)
433 root 1.35 if length $self->{wbuf};
434 root 1.8 };
435     }
436    
437 root 1.30 our %WH;
438    
439     sub register_write_type($$) {
440     $WH{$_[0]} = $_[1];
441     }
442    
443 root 1.17 sub push_write {
444     my $self = shift;
445    
446 root 1.29 if (@_ > 1) {
447     my $type = shift;
448    
449     @_ = ($WH{$type} or Carp::croak "unsupported type passed to AnyEvent::Handle::push_write")
450     ->($self, @_);
451     }
452    
453 root 1.17 if ($self->{filter_w}) {
454 root 1.48 $self->{filter_w}($self, \$_[0]);
455 root 1.17 } else {
456     $self->{wbuf} .= $_[0];
457     $self->_drain_wbuf;
458     }
459     }
460    
461 root 1.29 =item $handle->push_write (type => @args)
462    
463     Instead of formatting your data yourself, you can also let this module do
464     the job by specifying a type and type-specific arguments.
465    
466 root 1.30 Predefined types are (if you have ideas for additional types, feel free to
467     drop by and tell us):
468 root 1.29
469     =over 4
470    
471     =item netstring => $string
472    
473     Formats the given value as netstring
474     (http://cr.yp.to/proto/netstrings.txt, this is not a recommendation to use them).
475    
476     =cut
477    
478     register_write_type netstring => sub {
479     my ($self, $string) = @_;
480    
481     sprintf "%d:%s,", (length $string), $string
482     };
483    
484 root 1.39 =item json => $array_or_hashref
485    
486 root 1.40 Encodes the given hash or array reference into a JSON object. Unless you
487     provide your own JSON object, this means it will be encoded to JSON text
488     in UTF-8.
489    
490     JSON objects (and arrays) are self-delimiting, so you can write JSON at
491     one end of a handle and read them at the other end without using any
492     additional framing.
493    
494 root 1.41 The generated JSON text is guaranteed not to contain any newlines: While
495     this module doesn't need delimiters after or between JSON texts to be
496     able to read them, many other languages depend on that.
497    
498     A simple RPC protocol that interoperates easily with others is to send
499     JSON arrays (or objects, although arrays are usually the better choice as
500     they mimic how function argument passing works) and a newline after each
501     JSON text:
502    
503     $handle->push_write (json => ["method", "arg1", "arg2"]); # whatever
504     $handle->push_write ("\012");
505    
506     An AnyEvent::Handle receiver would simply use the C<json> read type and
507     rely on the fact that the newline will be skipped as leading whitespace:
508    
509     $handle->push_read (json => sub { my $array = $_[1]; ... });
510    
511     Other languages could read single lines terminated by a newline and pass
512     this line into their JSON decoder of choice.
513    
514 root 1.40 =cut
515    
516     register_write_type json => sub {
517     my ($self, $ref) = @_;
518    
519     require JSON;
520    
521     $self->{json} ? $self->{json}->encode ($ref)
522     : JSON::encode_json ($ref)
523     };
524    
525     =item AnyEvent::Handle::register_write_type type => $coderef->($handle, @args)
526 root 1.30
527     This function (not method) lets you add your own types to C<push_write>.
528     Whenever the given C<type> is used, C<push_write> will invoke the code
529     reference with the handle object and the remaining arguments.
530 root 1.29
531 root 1.30 The code reference is supposed to return a single octet string that will
532     be appended to the write buffer.
533 root 1.29
534 root 1.30 Note that this is a function, and all types registered this way will be
535     global, so try to use unique names.
536 root 1.29
537 root 1.52 =back
538    
539 root 1.30 =cut
540 root 1.29
541 root 1.8 #############################################################################
542    
543 root 1.9 =back
544    
545     =head2 READ QUEUE
546    
547     AnyEvent::Handle manages two queues per handle, one for writing and one
548     for reading.
549    
550     The read queue is more complex than the write queue. It can be used in two
551     ways, the "simple" way, using only C<on_read> and the "complex" way, using
552     a queue.
553    
554     In the simple case, you just install an C<on_read> callback and whenever
555     new data arrives, it will be called. You can then remove some data (if
556     enough is there) from the read buffer (C<< $handle->rbuf >>) if you want
557     or not.
558    
559     In the more complex case, you want to queue multiple callbacks. In this
560     case, AnyEvent::Handle will call the first queued callback each time new
561     data arrives and removes it when it has done its job (see C<push_read>,
562     below).
563    
564     This way you can, for example, push three line-reads, followed by reading
565     a chunk of data, and AnyEvent::Handle will execute them in order.
566    
567     Example 1: EPP protocol parser. EPP sends 4 byte length info, followed by
568     the specified number of bytes which give an XML datagram.
569    
570     # in the default state, expect some header bytes
571     $handle->on_read (sub {
572     # some data is here, now queue the length-header-read (4 octets)
573 root 1.52 shift->unshift_read (chunk => 4, sub {
574 root 1.9 # header arrived, decode
575     my $len = unpack "N", $_[1];
576    
577     # now read the payload
578 root 1.52 shift->unshift_read (chunk => $len, sub {
579 root 1.9 my $xml = $_[1];
580     # handle xml
581     });
582     });
583     });
584    
585     Example 2: Implement a client for a protocol that replies either with
586     "OK" and another line or "ERROR" for one request, and 64 bytes for the
587     second request. Due tot he availability of a full queue, we can just
588     pipeline sending both requests and manipulate the queue as necessary in
589     the callbacks:
590    
591     # request one
592     $handle->push_write ("request 1\015\012");
593    
594     # we expect "ERROR" or "OK" as response, so push a line read
595 root 1.52 $handle->push_read (line => sub {
596 root 1.9 # if we got an "OK", we have to _prepend_ another line,
597     # so it will be read before the second request reads its 64 bytes
598     # which are already in the queue when this callback is called
599     # we don't do this in case we got an error
600     if ($_[1] eq "OK") {
601 root 1.52 $_[0]->unshift_read (line => sub {
602 root 1.9 my $response = $_[1];
603     ...
604     });
605     }
606     });
607    
608     # request two
609     $handle->push_write ("request 2\015\012");
610    
611     # simply read 64 bytes, always
612 root 1.52 $handle->push_read (chunk => 64, sub {
613 root 1.9 my $response = $_[1];
614     ...
615     });
616    
617     =over 4
618    
619 root 1.10 =cut
620    
621 root 1.8 sub _drain_rbuf {
622     my ($self) = @_;
623 elmex 1.1
624 root 1.17 if (
625     defined $self->{rbuf_max}
626     && $self->{rbuf_max} < length $self->{rbuf}
627     ) {
628 root 1.52 return $self->_error (&Errno::ENOSPC, 1);
629 root 1.17 }
630    
631 root 1.11 return if $self->{in_drain};
632 root 1.8 local $self->{in_drain} = 1;
633 elmex 1.1
634 root 1.8 while (my $len = length $self->{rbuf}) {
635     no strict 'refs';
636 root 1.38 if (my $cb = shift @{ $self->{_queue} }) {
637 root 1.29 unless ($cb->($self)) {
638 root 1.38 if ($self->{_eof}) {
639 root 1.10 # no progress can be made (not enough data and no data forthcoming)
640 root 1.52 return $self->_error (&Errno::EPIPE, 1);
641 root 1.10 }
642    
643 root 1.38 unshift @{ $self->{_queue} }, $cb;
644 root 1.8 return;
645     }
646     } elsif ($self->{on_read}) {
647     $self->{on_read}($self);
648    
649     if (
650 root 1.38 $self->{_eof} # if no further data will arrive
651 root 1.8 && $len == length $self->{rbuf} # and no data has been consumed
652 root 1.38 && !@{ $self->{_queue} } # and the queue is still empty
653 root 1.8 && $self->{on_read} # and we still want to read data
654     ) {
655     # then no progress can be made
656 root 1.52 return $self->_error (&Errno::EPIPE, 1);
657 elmex 1.1 }
658 root 1.8 } else {
659     # read side becomes idle
660 root 1.38 delete $self->{_rw};
661 root 1.8 return;
662     }
663     }
664    
665 root 1.48 $self->{on_eof}($self)
666     if $self->{_eof} && $self->{on_eof};
667 elmex 1.1 }
668    
669 root 1.8 =item $handle->on_read ($cb)
670 elmex 1.1
671 root 1.8 This replaces the currently set C<on_read> callback, or clears it (when
672     the new callback is C<undef>). See the description of C<on_read> in the
673     constructor.
674 elmex 1.1
675 root 1.8 =cut
676    
677     sub on_read {
678     my ($self, $cb) = @_;
679 elmex 1.1
680 root 1.8 $self->{on_read} = $cb;
681 elmex 1.1 }
682    
683 root 1.8 =item $handle->rbuf
684    
685     Returns the read buffer (as a modifiable lvalue).
686 elmex 1.1
687 root 1.8 You can access the read buffer directly as the C<< ->{rbuf} >> member, if
688     you want.
689 elmex 1.1
690 root 1.8 NOTE: The read buffer should only be used or modified if the C<on_read>,
691     C<push_read> or C<unshift_read> methods are used. The other read methods
692     automatically manage the read buffer.
693 elmex 1.1
694     =cut
695    
696 elmex 1.2 sub rbuf : lvalue {
697 root 1.8 $_[0]{rbuf}
698 elmex 1.2 }
699 elmex 1.1
700 root 1.8 =item $handle->push_read ($cb)
701    
702     =item $handle->unshift_read ($cb)
703    
704     Append the given callback to the end of the queue (C<push_read>) or
705     prepend it (C<unshift_read>).
706    
707     The callback is called each time some additional read data arrives.
708 elmex 1.1
709 elmex 1.20 It must check whether enough data is in the read buffer already.
710 elmex 1.1
711 root 1.8 If not enough data is available, it must return the empty list or a false
712     value, in which case it will be called repeatedly until enough data is
713     available (or an error condition is detected).
714    
715     If enough data was available, then the callback must remove all data it is
716     interested in (which can be none at all) and return a true value. After returning
717     true, it will be removed from the queue.
718 elmex 1.1
719     =cut
720    
721 root 1.30 our %RH;
722    
723     sub register_read_type($$) {
724     $RH{$_[0]} = $_[1];
725     }
726    
727 root 1.8 sub push_read {
728 root 1.28 my $self = shift;
729     my $cb = pop;
730    
731     if (@_) {
732     my $type = shift;
733    
734     $cb = ($RH{$type} or Carp::croak "unsupported type passed to AnyEvent::Handle::push_read")
735     ->($self, $cb, @_);
736     }
737 elmex 1.1
738 root 1.38 push @{ $self->{_queue} }, $cb;
739 root 1.8 $self->_drain_rbuf;
740 elmex 1.1 }
741    
742 root 1.8 sub unshift_read {
743 root 1.28 my $self = shift;
744     my $cb = pop;
745    
746     if (@_) {
747     my $type = shift;
748    
749     $cb = ($RH{$type} or Carp::croak "unsupported type passed to AnyEvent::Handle::unshift_read")
750     ->($self, $cb, @_);
751     }
752    
753 root 1.8
754 root 1.38 unshift @{ $self->{_queue} }, $cb;
755 root 1.8 $self->_drain_rbuf;
756     }
757 elmex 1.1
758 root 1.28 =item $handle->push_read (type => @args, $cb)
759 elmex 1.1
760 root 1.28 =item $handle->unshift_read (type => @args, $cb)
761 elmex 1.1
762 root 1.28 Instead of providing a callback that parses the data itself you can chose
763     between a number of predefined parsing formats, for chunks of data, lines
764     etc.
765 elmex 1.1
766 root 1.30 Predefined types are (if you have ideas for additional types, feel free to
767     drop by and tell us):
768 root 1.28
769     =over 4
770    
771 root 1.40 =item chunk => $octets, $cb->($handle, $data)
772 root 1.28
773     Invoke the callback only once C<$octets> bytes have been read. Pass the
774     data read to the callback. The callback will never be called with less
775     data.
776    
777     Example: read 2 bytes.
778    
779     $handle->push_read (chunk => 2, sub {
780     warn "yay ", unpack "H*", $_[1];
781     });
782 elmex 1.1
783     =cut
784    
785 root 1.28 register_read_type chunk => sub {
786     my ($self, $cb, $len) = @_;
787 elmex 1.1
788 root 1.8 sub {
789     $len <= length $_[0]{rbuf} or return;
790 elmex 1.12 $cb->($_[0], substr $_[0]{rbuf}, 0, $len, "");
791 root 1.8 1
792     }
793 root 1.28 };
794 root 1.8
795 root 1.28 # compatibility with older API
796 root 1.8 sub push_read_chunk {
797 root 1.28 $_[0]->push_read (chunk => $_[1], $_[2]);
798 root 1.8 }
799 elmex 1.1
800 root 1.8 sub unshift_read_chunk {
801 root 1.28 $_[0]->unshift_read (chunk => $_[1], $_[2]);
802 elmex 1.1 }
803    
804 root 1.40 =item line => [$eol, ]$cb->($handle, $line, $eol)
805 elmex 1.1
806 root 1.8 The callback will be called only once a full line (including the end of
807     line marker, C<$eol>) has been read. This line (excluding the end of line
808     marker) will be passed to the callback as second argument (C<$line>), and
809     the end of line marker as the third argument (C<$eol>).
810 elmex 1.1
811 root 1.8 The end of line marker, C<$eol>, can be either a string, in which case it
812     will be interpreted as a fixed record end marker, or it can be a regex
813     object (e.g. created by C<qr>), in which case it is interpreted as a
814     regular expression.
815 elmex 1.1
816 root 1.8 The end of line marker argument C<$eol> is optional, if it is missing (NOT
817     undef), then C<qr|\015?\012|> is used (which is good for most internet
818     protocols).
819 elmex 1.1
820 root 1.8 Partial lines at the end of the stream will never be returned, as they are
821     not marked by the end of line marker.
822 elmex 1.1
823 root 1.8 =cut
824 elmex 1.1
825 root 1.28 register_read_type line => sub {
826     my ($self, $cb, $eol) = @_;
827 elmex 1.1
828 root 1.28 $eol = qr|(\015?\012)| if @_ < 3;
829 root 1.14 $eol = quotemeta $eol unless ref $eol;
830     $eol = qr|^(.*?)($eol)|s;
831 elmex 1.1
832 root 1.8 sub {
833     $_[0]{rbuf} =~ s/$eol// or return;
834 elmex 1.1
835 elmex 1.12 $cb->($_[0], $1, $2);
836 root 1.8 1
837     }
838 root 1.28 };
839 elmex 1.1
840 root 1.28 # compatibility with older API
841 root 1.8 sub push_read_line {
842 root 1.28 my $self = shift;
843     $self->push_read (line => @_);
844 root 1.10 }
845    
846     sub unshift_read_line {
847 root 1.28 my $self = shift;
848     $self->unshift_read (line => @_);
849 root 1.10 }
850    
851 root 1.40 =item netstring => $cb->($handle, $string)
852 root 1.29
853     A netstring (http://cr.yp.to/proto/netstrings.txt, this is not an endorsement).
854    
855     Throws an error with C<$!> set to EBADMSG on format violations.
856    
857     =cut
858    
859     register_read_type netstring => sub {
860     my ($self, $cb) = @_;
861    
862     sub {
863     unless ($_[0]{rbuf} =~ s/^(0|[1-9][0-9]*)://) {
864     if ($_[0]{rbuf} =~ /[^0-9]/) {
865 root 1.52 $self->_error (&Errno::EBADMSG);
866 root 1.29 }
867     return;
868     }
869    
870     my $len = $1;
871    
872     $self->unshift_read (chunk => $len, sub {
873     my $string = $_[1];
874     $_[0]->unshift_read (chunk => 1, sub {
875     if ($_[1] eq ",") {
876     $cb->($_[0], $string);
877     } else {
878 root 1.52 $self->_error (&Errno::EBADMSG);
879 root 1.29 }
880     });
881     });
882    
883     1
884     }
885     };
886    
887 root 1.40 =item regex => $accept[, $reject[, $skip], $cb->($handle, $data)
888 root 1.36
889     Makes a regex match against the regex object C<$accept> and returns
890     everything up to and including the match.
891    
892     Example: read a single line terminated by '\n'.
893    
894     $handle->push_read (regex => qr<\n>, sub { ... });
895    
896     If C<$reject> is given and not undef, then it determines when the data is
897     to be rejected: it is matched against the data when the C<$accept> regex
898     does not match and generates an C<EBADMSG> error when it matches. This is
899     useful to quickly reject wrong data (to avoid waiting for a timeout or a
900     receive buffer overflow).
901    
902     Example: expect a single decimal number followed by whitespace, reject
903     anything else (not the use of an anchor).
904    
905     $handle->push_read (regex => qr<^[0-9]+\s>, qr<[^0-9]>, sub { ... });
906    
907     If C<$skip> is given and not C<undef>, then it will be matched against
908     the receive buffer when neither C<$accept> nor C<$reject> match,
909     and everything preceding and including the match will be accepted
910     unconditionally. This is useful to skip large amounts of data that you
911     know cannot be matched, so that the C<$accept> or C<$reject> regex do not
912     have to start matching from the beginning. This is purely an optimisation
913     and is usually worth only when you expect more than a few kilobytes.
914    
915     Example: expect a http header, which ends at C<\015\012\015\012>. Since we
916     expect the header to be very large (it isn't in practise, but...), we use
917     a skip regex to skip initial portions. The skip regex is tricky in that
918     it only accepts something not ending in either \015 or \012, as these are
919     required for the accept regex.
920    
921     $handle->push_read (regex =>
922     qr<\015\012\015\012>,
923     undef, # no reject
924     qr<^.*[^\015\012]>,
925     sub { ... });
926    
927     =cut
928    
929     register_read_type regex => sub {
930     my ($self, $cb, $accept, $reject, $skip) = @_;
931    
932     my $data;
933     my $rbuf = \$self->{rbuf};
934    
935     sub {
936     # accept
937     if ($$rbuf =~ $accept) {
938     $data .= substr $$rbuf, 0, $+[0], "";
939     $cb->($self, $data);
940     return 1;
941     }
942    
943     # reject
944     if ($reject && $$rbuf =~ $reject) {
945 root 1.52 $self->_error (&Errno::EBADMSG);
946 root 1.36 }
947    
948     # skip
949     if ($skip && $$rbuf =~ $skip) {
950     $data .= substr $$rbuf, 0, $+[0], "";
951     }
952    
953     ()
954     }
955     };
956    
957 root 1.40 =item json => $cb->($handle, $hash_or_arrayref)
958    
959     Reads a JSON object or array, decodes it and passes it to the callback.
960    
961     If a C<json> object was passed to the constructor, then that will be used
962     for the final decode, otherwise it will create a JSON coder expecting UTF-8.
963    
964     This read type uses the incremental parser available with JSON version
965     2.09 (and JSON::XS version 2.2) and above. You have to provide a
966     dependency on your own: this module will load the JSON module, but
967     AnyEvent does not depend on it itself.
968    
969     Since JSON texts are fully self-delimiting, the C<json> read and write
970 root 1.41 types are an ideal simple RPC protocol: just exchange JSON datagrams. See
971     the C<json> write type description, above, for an actual example.
972 root 1.40
973     =cut
974    
975     register_read_type json => sub {
976     my ($self, $cb, $accept, $reject, $skip) = @_;
977    
978     require JSON;
979    
980     my $data;
981     my $rbuf = \$self->{rbuf};
982    
983 root 1.41 my $json = $self->{json} ||= JSON->new->utf8;
984 root 1.40
985     sub {
986     my $ref = $json->incr_parse ($self->{rbuf});
987    
988     if ($ref) {
989     $self->{rbuf} = $json->incr_text;
990     $json->incr_text = "";
991     $cb->($self, $ref);
992    
993     1
994     } else {
995     $self->{rbuf} = "";
996     ()
997     }
998     }
999     };
1000    
1001 root 1.28 =back
1002    
1003 root 1.40 =item AnyEvent::Handle::register_read_type type => $coderef->($handle, $cb, @args)
1004 root 1.30
1005     This function (not method) lets you add your own types to C<push_read>.
1006    
1007     Whenever the given C<type> is used, C<push_read> will invoke the code
1008     reference with the handle object, the callback and the remaining
1009     arguments.
1010    
1011     The code reference is supposed to return a callback (usually a closure)
1012     that works as a plain read callback (see C<< ->push_read ($cb) >>).
1013    
1014     It should invoke the passed callback when it is done reading (remember to
1015 root 1.40 pass C<$handle> as first argument as all other callbacks do that).
1016 root 1.30
1017     Note that this is a function, and all types registered this way will be
1018     global, so try to use unique names.
1019    
1020     For examples, see the source of this module (F<perldoc -m AnyEvent::Handle>,
1021     search for C<register_read_type>)).
1022    
1023 root 1.10 =item $handle->stop_read
1024    
1025     =item $handle->start_read
1026    
1027 root 1.18 In rare cases you actually do not want to read anything from the
1028 root 1.10 socket. In this case you can call C<stop_read>. Neither C<on_read> no
1029 root 1.22 any queued callbacks will be executed then. To start reading again, call
1030 root 1.10 C<start_read>.
1031    
1032     =cut
1033    
1034     sub stop_read {
1035     my ($self) = @_;
1036 elmex 1.1
1037 root 1.38 delete $self->{_rw};
1038 root 1.8 }
1039 elmex 1.1
1040 root 1.10 sub start_read {
1041     my ($self) = @_;
1042    
1043 root 1.38 unless ($self->{_rw} || $self->{_eof}) {
1044 root 1.10 Scalar::Util::weaken $self;
1045    
1046 root 1.38 $self->{_rw} = AnyEvent->io (fh => $self->{fh}, poll => "r", cb => sub {
1047 root 1.17 my $rbuf = $self->{filter_r} ? \my $buf : \$self->{rbuf};
1048     my $len = sysread $self->{fh}, $$rbuf, $self->{read_size} || 8192, length $$rbuf;
1049 root 1.10
1050     if ($len > 0) {
1051 root 1.44 $self->{_activity} = AnyEvent->now;
1052 root 1.43
1053 root 1.17 $self->{filter_r}
1054 root 1.48 ? $self->{filter_r}($self, $rbuf)
1055 root 1.17 : $self->_drain_rbuf;
1056 root 1.10
1057     } elsif (defined $len) {
1058 root 1.38 delete $self->{_rw};
1059     $self->{_eof} = 1;
1060 root 1.17 $self->_drain_rbuf;
1061 root 1.10
1062 root 1.42 } elsif ($! != EAGAIN && $! != EINTR && $! != WSAEWOULDBLOCK) {
1063 root 1.52 return $self->_error ($!, 1);
1064 root 1.10 }
1065     });
1066     }
1067 elmex 1.1 }
1068    
1069 root 1.19 sub _dotls {
1070     my ($self) = @_;
1071    
1072 root 1.38 if (length $self->{_tls_wbuf}) {
1073     while ((my $len = Net::SSLeay::write ($self->{tls}, $self->{_tls_wbuf})) > 0) {
1074     substr $self->{_tls_wbuf}, 0, $len, "";
1075 root 1.22 }
1076 root 1.19 }
1077    
1078 root 1.38 if (defined (my $buf = Net::SSLeay::BIO_read ($self->{_wbio}))) {
1079 root 1.19 $self->{wbuf} .= $buf;
1080     $self->_drain_wbuf;
1081     }
1082    
1083 root 1.23 while (defined (my $buf = Net::SSLeay::read ($self->{tls}))) {
1084     $self->{rbuf} .= $buf;
1085     $self->_drain_rbuf;
1086     }
1087    
1088 root 1.24 my $err = Net::SSLeay::get_error ($self->{tls}, -1);
1089    
1090     if ($err!= Net::SSLeay::ERROR_WANT_READ ()) {
1091 root 1.23 if ($err == Net::SSLeay::ERROR_SYSCALL ()) {
1092 root 1.52 return $self->_error ($!, 1);
1093 root 1.23 } elsif ($err == Net::SSLeay::ERROR_SSL ()) {
1094 root 1.52 return $self->_error (&Errno::EIO, 1);
1095 root 1.19 }
1096 root 1.23
1097     # all others are fine for our purposes
1098 root 1.19 }
1099     }
1100    
1101 root 1.25 =item $handle->starttls ($tls[, $tls_ctx])
1102    
1103     Instead of starting TLS negotiation immediately when the AnyEvent::Handle
1104     object is created, you can also do that at a later time by calling
1105     C<starttls>.
1106    
1107     The first argument is the same as the C<tls> constructor argument (either
1108     C<"connect">, C<"accept"> or an existing Net::SSLeay object).
1109    
1110     The second argument is the optional C<Net::SSLeay::CTX> object that is
1111     used when AnyEvent::Handle has to create its own TLS connection object.
1112    
1113 root 1.38 The TLS connection object will end up in C<< $handle->{tls} >> after this
1114     call and can be used or changed to your liking. Note that the handshake
1115     might have already started when this function returns.
1116    
1117 root 1.25 =cut
1118    
1119 root 1.19 sub starttls {
1120     my ($self, $ssl, $ctx) = @_;
1121    
1122 root 1.25 $self->stoptls;
1123    
1124 root 1.19 if ($ssl eq "accept") {
1125     $ssl = Net::SSLeay::new ($ctx || TLS_CTX ());
1126     Net::SSLeay::set_accept_state ($ssl);
1127     } elsif ($ssl eq "connect") {
1128     $ssl = Net::SSLeay::new ($ctx || TLS_CTX ());
1129     Net::SSLeay::set_connect_state ($ssl);
1130     }
1131    
1132     $self->{tls} = $ssl;
1133    
1134 root 1.21 # basically, this is deep magic (because SSL_read should have the same issues)
1135     # but the openssl maintainers basically said: "trust us, it just works".
1136     # (unfortunately, we have to hardcode constants because the abysmally misdesigned
1137     # and mismaintained ssleay-module doesn't even offer them).
1138 root 1.27 # http://www.mail-archive.com/openssl-dev@openssl.org/msg22420.html
1139 root 1.21 Net::SSLeay::CTX_set_mode ($self->{tls},
1140 root 1.34 (eval { local $SIG{__DIE__}; Net::SSLeay::MODE_ENABLE_PARTIAL_WRITE () } || 1)
1141     | (eval { local $SIG{__DIE__}; Net::SSLeay::MODE_ACCEPT_MOVING_WRITE_BUFFER () } || 2));
1142 root 1.21
1143 root 1.38 $self->{_rbio} = Net::SSLeay::BIO_new (Net::SSLeay::BIO_s_mem ());
1144     $self->{_wbio} = Net::SSLeay::BIO_new (Net::SSLeay::BIO_s_mem ());
1145 root 1.19
1146 root 1.38 Net::SSLeay::set_bio ($ssl, $self->{_rbio}, $self->{_wbio});
1147 root 1.19
1148     $self->{filter_w} = sub {
1149 root 1.38 $_[0]{_tls_wbuf} .= ${$_[1]};
1150 root 1.19 &_dotls;
1151     };
1152     $self->{filter_r} = sub {
1153 root 1.38 Net::SSLeay::BIO_write ($_[0]{_rbio}, ${$_[1]});
1154 root 1.19 &_dotls;
1155     };
1156     }
1157    
1158 root 1.25 =item $handle->stoptls
1159    
1160     Destroys the SSL connection, if any. Partial read or write data will be
1161     lost.
1162    
1163     =cut
1164    
1165     sub stoptls {
1166     my ($self) = @_;
1167    
1168     Net::SSLeay::free (delete $self->{tls}) if $self->{tls};
1169 root 1.38
1170     delete $self->{_rbio};
1171     delete $self->{_wbio};
1172     delete $self->{_tls_wbuf};
1173 root 1.25 delete $self->{filter_r};
1174     delete $self->{filter_w};
1175     }
1176    
1177 root 1.19 sub DESTROY {
1178     my $self = shift;
1179    
1180 root 1.25 $self->stoptls;
1181 root 1.19 }
1182    
1183     =item AnyEvent::Handle::TLS_CTX
1184    
1185     This function creates and returns the Net::SSLeay::CTX object used by
1186     default for TLS mode.
1187    
1188     The context is created like this:
1189    
1190     Net::SSLeay::load_error_strings;
1191     Net::SSLeay::SSLeay_add_ssl_algorithms;
1192     Net::SSLeay::randomize;
1193    
1194     my $CTX = Net::SSLeay::CTX_new;
1195    
1196     Net::SSLeay::CTX_set_options $CTX, Net::SSLeay::OP_ALL
1197    
1198     =cut
1199    
1200     our $TLS_CTX;
1201    
1202     sub TLS_CTX() {
1203     $TLS_CTX || do {
1204     require Net::SSLeay;
1205    
1206     Net::SSLeay::load_error_strings ();
1207     Net::SSLeay::SSLeay_add_ssl_algorithms ();
1208     Net::SSLeay::randomize ();
1209    
1210     $TLS_CTX = Net::SSLeay::CTX_new ();
1211    
1212     Net::SSLeay::CTX_set_options ($TLS_CTX, Net::SSLeay::OP_ALL ());
1213    
1214     $TLS_CTX
1215     }
1216     }
1217    
1218 elmex 1.1 =back
1219    
1220 root 1.38 =head1 SUBCLASSING AnyEvent::Handle
1221    
1222     In many cases, you might want to subclass AnyEvent::Handle.
1223    
1224     To make this easier, a given version of AnyEvent::Handle uses these
1225     conventions:
1226    
1227     =over 4
1228    
1229     =item * all constructor arguments become object members.
1230    
1231     At least initially, when you pass a C<tls>-argument to the constructor it
1232     will end up in C<< $handle->{tls} >>. Those members might be changes or
1233     mutated later on (for example C<tls> will hold the TLS connection object).
1234    
1235     =item * other object member names are prefixed with an C<_>.
1236    
1237     All object members not explicitly documented (internal use) are prefixed
1238     with an underscore character, so the remaining non-C<_>-namespace is free
1239     for use for subclasses.
1240    
1241     =item * all members not documented here and not prefixed with an underscore
1242     are free to use in subclasses.
1243    
1244     Of course, new versions of AnyEvent::Handle may introduce more "public"
1245     member variables, but thats just life, at least it is documented.
1246    
1247     =back
1248    
1249 elmex 1.1 =head1 AUTHOR
1250    
1251 root 1.8 Robin Redeker C<< <elmex at ta-sa.org> >>, Marc Lehmann <schmorp@schmorp.de>.
1252 elmex 1.1
1253     =cut
1254    
1255     1; # End of AnyEvent::Handle