ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/AnyEvent/lib/AnyEvent/Handle.pm
(Generate patch)

Comparing AnyEvent/lib/AnyEvent/Handle.pm (file contents):
Revision 1.7 by root, Thu May 1 16:35:40 2008 UTC vs.
Revision 1.58 by root, Wed Jun 4 22:51:15 2008 UTC

1package AnyEvent::Handle; 1package AnyEvent::Handle;
2 2
3no warnings; 3no warnings;
4use strict; 4use strict;
5 5
6use AnyEvent; 6use AnyEvent ();
7use IO::Handle; 7use AnyEvent::Util qw(WSAEWOULDBLOCK);
8use Scalar::Util ();
9use Carp ();
10use Fcntl ();
8use Errno qw/EAGAIN EINTR/; 11use Errno qw(EAGAIN EINTR);
9 12
10=head1 NAME 13=head1 NAME
11 14
12AnyEvent::Handle - non-blocking I/O on filehandles via AnyEvent 15AnyEvent::Handle - non-blocking I/O on file handles via AnyEvent
13 16
14=head1 VERSION
15
16Version 0.01
17
18=cut 17=cut
19 18
20our $VERSION = '0.01'; 19our $VERSION = 4.13;
21 20
22=head1 SYNOPSIS 21=head1 SYNOPSIS
23 22
24 use AnyEvent; 23 use AnyEvent;
25 use AnyEvent::Handle; 24 use AnyEvent::Handle;
26 25
27 my $cv = AnyEvent->condvar; 26 my $cv = AnyEvent->condvar;
28 27
29 my $ae_fh = AnyEvent::Handle->new (fh => \*STDIN); 28 my $handle =
30
31 $ae_fh->on_eof (sub { $cv->broadcast });
32
33 $ae_fh->readlines (sub {
34 my ($ae_fh, @lines) = @_;
35 for (@lines) {
36 chomp;
37 print "Line: $_";
38 }
39 });
40
41 # or use the constructor to pass the callback:
42
43 my $ae_fh2 =
44 AnyEvent::Handle->new ( 29 AnyEvent::Handle->new (
45 fh => \*STDIN, 30 fh => \*STDIN,
46 on_eof => sub { 31 on_eof => sub {
47 $cv->broadcast; 32 $cv->broadcast;
48 }, 33 },
34 );
35
36 # send some request line
37 $handle->push_write ("getinfo\015\012");
38
39 # read the response line
40 $handle->push_read (line => sub {
41 my ($handle, $line) = @_;
42 warn "read line <$line>\n";
43 $cv->send;
44 });
45
46 $cv->recv;
47
48=head1 DESCRIPTION
49
50This module is a helper module to make it easier to do event-based I/O on
51filehandles. For utility functions for doing non-blocking connects and accepts
52on sockets see L<AnyEvent::Util>.
53
54In the following, when the documentation refers to of "bytes" then this
55means characters. As sysread and syswrite are used for all I/O, their
56treatment of characters applies to this module as well.
57
58All callbacks will be invoked with the handle object as their first
59argument.
60
61=head1 METHODS
62
63=over 4
64
65=item B<new (%args)>
66
67The constructor supports these arguments (all as key => value pairs).
68
69=over 4
70
71=item fh => $filehandle [MANDATORY]
72
73The filehandle this L<AnyEvent::Handle> object will operate on.
74
75NOTE: The filehandle will be set to non-blocking (using
76AnyEvent::Util::fh_nonblocking).
77
78=item on_eof => $cb->($handle)
79
80Set the callback to be called when an end-of-file condition is detcted,
81i.e. in the case of a socket, when the other side has closed the
82connection cleanly.
83
84While not mandatory, it is highly recommended to set an eof callback,
85otherwise you might end up with a closed socket while you are still
86waiting for data.
87
88=item on_error => $cb->($handle, $fatal)
89
90This is the error callback, which is called when, well, some error
91occured, such as not being able to resolve the hostname, failure to
92connect or a read error.
93
94Some errors are fatal (which is indicated by C<$fatal> being true). On
95fatal errors the handle object will be shut down and will not be
96usable. Non-fatal errors can be retried by simply returning, but it is
97recommended to simply ignore this parameter and instead abondon the handle
98object when this callback is invoked.
99
100On callback entrance, the value of C<$!> contains the operating system
101error (or C<ENOSPC>, C<EPIPE>, C<ETIMEDOUT> or C<EBADMSG>).
102
103While not mandatory, it is I<highly> recommended to set this callback, as
104you will not be notified of errors otherwise. The default simply calls
105C<croak>.
106
107=item on_read => $cb->($handle)
108
109This sets the default read callback, which is called when data arrives
110and no read request is in the queue.
111
112To access (and remove data from) the read buffer, use the C<< ->rbuf >>
113method or access the C<$handle->{rbuf}> member directly.
114
115When an EOF condition is detected then AnyEvent::Handle will first try to
116feed all the remaining data to the queued callbacks and C<on_read> before
117calling the C<on_eof> callback. If no progress can be made, then a fatal
118error will be raised (with C<$!> set to C<EPIPE>).
119
120=item on_drain => $cb->($handle)
121
122This sets the callback that is called when the write buffer becomes empty
123(or when the callback is set and the buffer is empty already).
124
125To append to the write buffer, use the C<< ->push_write >> method.
126
127=item timeout => $fractional_seconds
128
129If non-zero, then this enables an "inactivity" timeout: whenever this many
130seconds pass without a successful read or write on the underlying file
131handle, the C<on_timeout> callback will be invoked (and if that one is
132missing, an C<ETIMEDOUT> error will be raised).
133
134Note that timeout processing is also active when you currently do not have
135any outstanding read or write requests: If you plan to keep the connection
136idle then you should disable the timout temporarily or ignore the timeout
137in the C<on_timeout> callback.
138
139Zero (the default) disables this timeout.
140
141=item on_timeout => $cb->($handle)
142
143Called whenever the inactivity timeout passes. If you return from this
144callback, then the timeout will be reset as if some activity had happened,
145so this condition is not fatal in any way.
146
147=item rbuf_max => <bytes>
148
149If defined, then a fatal error will be raised (with C<$!> set to C<ENOSPC>)
150when the read buffer ever (strictly) exceeds this size. This is useful to
151avoid denial-of-service attacks.
152
153For example, a server accepting connections from untrusted sources should
154be configured to accept only so-and-so much data that it cannot act on
155(for example, when expecting a line, an attacker could send an unlimited
156amount of data without a callback ever being called as long as the line
157isn't finished).
158
159=item read_size => <bytes>
160
161The default read block size (the amount of bytes this module will try to read
162during each (loop iteration). Default: C<8192>.
163
164=item low_water_mark => <bytes>
165
166Sets the amount of bytes (default: C<0>) that make up an "empty" write
167buffer: If the write reaches this size or gets even samller it is
168considered empty.
169
170=item tls => "accept" | "connect" | Net::SSLeay::SSL object
171
172When this parameter is given, it enables TLS (SSL) mode, that means it
173will start making tls handshake and will transparently encrypt/decrypt
174data.
175
176TLS mode requires Net::SSLeay to be installed (it will be loaded
177automatically when you try to create a TLS handle).
178
179For the TLS server side, use C<accept>, and for the TLS client side of a
180connection, use C<connect> mode.
181
182You can also provide your own TLS connection object, but you have
183to make sure that you call either C<Net::SSLeay::set_connect_state>
184or C<Net::SSLeay::set_accept_state> on it before you pass it to
185AnyEvent::Handle.
186
187See the C<starttls> method if you need to start TLs negotiation later.
188
189=item tls_ctx => $ssl_ctx
190
191Use the given Net::SSLeay::CTX object to create the new TLS connection
192(unless a connection object was specified directly). If this parameter is
193missing, then AnyEvent::Handle will use C<AnyEvent::Handle::TLS_CTX>.
194
195=item json => JSON or JSON::XS object
196
197This is the json coder object used by the C<json> read and write types.
198
199If you don't supply it, then AnyEvent::Handle will create and use a
200suitable one, which will write and expect UTF-8 encoded JSON texts.
201
202Note that you are responsible to depend on the JSON module if you want to
203use this functionality, as AnyEvent does not have a dependency itself.
204
205=item filter_r => $cb
206
207=item filter_w => $cb
208
209These exist, but are undocumented at this time.
210
211=back
212
213=cut
214
215sub new {
216 my $class = shift;
217
218 my $self = bless { @_ }, $class;
219
220 $self->{fh} or Carp::croak "mandatory argument fh is missing";
221
222 AnyEvent::Util::fh_nonblocking $self->{fh}, 1;
223
224 if ($self->{tls}) {
225 require Net::SSLeay;
226 $self->starttls (delete $self->{tls}, delete $self->{tls_ctx});
227 }
228
229 $self->{_activity} = AnyEvent->now;
230 $self->_timeout;
231
232 $self->on_drain (delete $self->{on_drain}) if $self->{on_drain};
233 $self->on_read (delete $self->{on_read} ) if $self->{on_read};
234
235 $self
236}
237
238sub _shutdown {
239 my ($self) = @_;
240
241 delete $self->{_tw};
242 delete $self->{_rw};
243 delete $self->{_ww};
244 delete $self->{fh};
245
246 $self->stoptls;
247}
248
249sub _error {
250 my ($self, $errno, $fatal) = @_;
251
252 $self->_shutdown
253 if $fatal;
254
255 $! = $errno;
256
257 if ($self->{on_error}) {
258 $self->{on_error}($self, $fatal);
259 } else {
260 Carp::croak "AnyEvent::Handle uncaught error: $!";
261 }
262}
263
264=item $fh = $handle->fh
265
266This method returns the file handle of the L<AnyEvent::Handle> object.
267
268=cut
269
270sub fh { $_[0]{fh} }
271
272=item $handle->on_error ($cb)
273
274Replace the current C<on_error> callback (see the C<on_error> constructor argument).
275
276=cut
277
278sub on_error {
279 $_[0]{on_error} = $_[1];
280}
281
282=item $handle->on_eof ($cb)
283
284Replace the current C<on_eof> callback (see the C<on_eof> constructor argument).
285
286=cut
287
288sub on_eof {
289 $_[0]{on_eof} = $_[1];
290}
291
292=item $handle->on_timeout ($cb)
293
294Replace the current C<on_timeout> callback, or disables the callback
295(but not the timeout) if C<$cb> = C<undef>. See C<timeout> constructor
296argument.
297
298=cut
299
300sub on_timeout {
301 $_[0]{on_timeout} = $_[1];
302}
303
304#############################################################################
305
306=item $handle->timeout ($seconds)
307
308Configures (or disables) the inactivity timeout.
309
310=cut
311
312sub timeout {
313 my ($self, $timeout) = @_;
314
315 $self->{timeout} = $timeout;
316 $self->_timeout;
317}
318
319# reset the timeout watcher, as neccessary
320# also check for time-outs
321sub _timeout {
322 my ($self) = @_;
323
324 if ($self->{timeout}) {
325 my $NOW = AnyEvent->now;
326
327 # when would the timeout trigger?
328 my $after = $self->{_activity} + $self->{timeout} - $NOW;
329
330 # now or in the past already?
331 if ($after <= 0) {
332 $self->{_activity} = $NOW;
333
334 if ($self->{on_timeout}) {
335 $self->{on_timeout}($self);
336 } else {
337 $self->_error (&Errno::ETIMEDOUT);
338 }
339
340 # callback could have changed timeout value, optimise
341 return unless $self->{timeout};
342
343 # calculate new after
344 $after = $self->{timeout};
345 }
346
347 Scalar::Util::weaken $self;
348 return unless $self; # ->error could have destroyed $self
349
350 $self->{_tw} ||= AnyEvent->timer (after => $after, cb => sub {
351 delete $self->{_tw};
352 $self->_timeout;
353 });
354 } else {
355 delete $self->{_tw};
356 }
357}
358
359#############################################################################
360
361=back
362
363=head2 WRITE QUEUE
364
365AnyEvent::Handle manages two queues per handle, one for writing and one
366for reading.
367
368The write queue is very simple: you can add data to its end, and
369AnyEvent::Handle will automatically try to get rid of it for you.
370
371When data could be written and the write buffer is shorter then the low
372water mark, the C<on_drain> callback will be invoked.
373
374=over 4
375
376=item $handle->on_drain ($cb)
377
378Sets the C<on_drain> callback or clears it (see the description of
379C<on_drain> in the constructor).
380
381=cut
382
383sub on_drain {
384 my ($self, $cb) = @_;
385
386 $self->{on_drain} = $cb;
387
388 $cb->($self)
389 if $cb && $self->{low_water_mark} >= length $self->{wbuf};
390}
391
392=item $handle->push_write ($data)
393
394Queues the given scalar to be written. You can push as much data as you
395want (only limited by the available memory), as C<AnyEvent::Handle>
396buffers it independently of the kernel.
397
398=cut
399
400sub _drain_wbuf {
401 my ($self) = @_;
402
403 if (!$self->{_ww} && length $self->{wbuf}) {
404
405 Scalar::Util::weaken $self;
406
407 my $cb = sub {
408 my $len = syswrite $self->{fh}, $self->{wbuf};
409
410 if ($len >= 0) {
411 substr $self->{wbuf}, 0, $len, "";
412
413 $self->{_activity} = AnyEvent->now;
414
415 $self->{on_drain}($self)
416 if $self->{low_water_mark} >= length $self->{wbuf}
417 && $self->{on_drain};
418
419 delete $self->{_ww} unless length $self->{wbuf};
420 } elsif ($! != EAGAIN && $! != EINTR && $! != WSAEWOULDBLOCK) {
421 $self->_error ($!, 1);
422 }
423 };
424
425 # try to write data immediately
426 $cb->();
427
428 # if still data left in wbuf, we need to poll
429 $self->{_ww} = AnyEvent->io (fh => $self->{fh}, poll => "w", cb => $cb)
430 if length $self->{wbuf};
431 };
432}
433
434our %WH;
435
436sub register_write_type($$) {
437 $WH{$_[0]} = $_[1];
438}
439
440sub push_write {
441 my $self = shift;
442
443 if (@_ > 1) {
444 my $type = shift;
445
446 @_ = ($WH{$type} or Carp::croak "unsupported type passed to AnyEvent::Handle::push_write")
447 ->($self, @_);
448 }
449
450 if ($self->{filter_w}) {
451 $self->{filter_w}($self, \$_[0]);
452 } else {
453 $self->{wbuf} .= $_[0];
454 $self->_drain_wbuf;
455 }
456}
457
458=item $handle->push_write (type => @args)
459
460Instead of formatting your data yourself, you can also let this module do
461the job by specifying a type and type-specific arguments.
462
463Predefined types are (if you have ideas for additional types, feel free to
464drop by and tell us):
465
466=over 4
467
468=item netstring => $string
469
470Formats the given value as netstring
471(http://cr.yp.to/proto/netstrings.txt, this is not a recommendation to use them).
472
473=cut
474
475register_write_type netstring => sub {
476 my ($self, $string) = @_;
477
478 sprintf "%d:%s,", (length $string), $string
479};
480
481=item json => $array_or_hashref
482
483Encodes the given hash or array reference into a JSON object. Unless you
484provide your own JSON object, this means it will be encoded to JSON text
485in UTF-8.
486
487JSON objects (and arrays) are self-delimiting, so you can write JSON at
488one end of a handle and read them at the other end without using any
489additional framing.
490
491The generated JSON text is guaranteed not to contain any newlines: While
492this module doesn't need delimiters after or between JSON texts to be
493able to read them, many other languages depend on that.
494
495A simple RPC protocol that interoperates easily with others is to send
496JSON arrays (or objects, although arrays are usually the better choice as
497they mimic how function argument passing works) and a newline after each
498JSON text:
499
500 $handle->push_write (json => ["method", "arg1", "arg2"]); # whatever
501 $handle->push_write ("\012");
502
503An AnyEvent::Handle receiver would simply use the C<json> read type and
504rely on the fact that the newline will be skipped as leading whitespace:
505
506 $handle->push_read (json => sub { my $array = $_[1]; ... });
507
508Other languages could read single lines terminated by a newline and pass
509this line into their JSON decoder of choice.
510
511=cut
512
513register_write_type json => sub {
514 my ($self, $ref) = @_;
515
516 require JSON;
517
518 $self->{json} ? $self->{json}->encode ($ref)
519 : JSON::encode_json ($ref)
520};
521
522=back
523
524=item AnyEvent::Handle::register_write_type type => $coderef->($handle, @args)
525
526This function (not method) lets you add your own types to C<push_write>.
527Whenever the given C<type> is used, C<push_write> will invoke the code
528reference with the handle object and the remaining arguments.
529
530The code reference is supposed to return a single octet string that will
531be appended to the write buffer.
532
533Note that this is a function, and all types registered this way will be
534global, so try to use unique names.
535
536=cut
537
538#############################################################################
539
540=back
541
542=head2 READ QUEUE
543
544AnyEvent::Handle manages two queues per handle, one for writing and one
545for reading.
546
547The read queue is more complex than the write queue. It can be used in two
548ways, the "simple" way, using only C<on_read> and the "complex" way, using
549a queue.
550
551In the simple case, you just install an C<on_read> callback and whenever
552new data arrives, it will be called. You can then remove some data (if
553enough is there) from the read buffer (C<< $handle->rbuf >>) if you want
554or not.
555
556In the more complex case, you want to queue multiple callbacks. In this
557case, AnyEvent::Handle will call the first queued callback each time new
558data arrives and removes it when it has done its job (see C<push_read>,
559below).
560
561This way you can, for example, push three line-reads, followed by reading
562a chunk of data, and AnyEvent::Handle will execute them in order.
563
564Example 1: EPP protocol parser. EPP sends 4 byte length info, followed by
565the specified number of bytes which give an XML datagram.
566
567 # in the default state, expect some header bytes
568 $handle->on_read (sub {
569 # some data is here, now queue the length-header-read (4 octets)
570 shift->unshift_read (chunk => 4, sub {
571 # header arrived, decode
572 my $len = unpack "N", $_[1];
573
574 # now read the payload
575 shift->unshift_read (chunk => $len, sub {
576 my $xml = $_[1];
577 # handle xml
578 });
579 });
580 });
581
582Example 2: Implement a client for a protocol that replies either with
583"OK" and another line or "ERROR" for one request, and 64 bytes for the
584second request. Due tot he availability of a full queue, we can just
585pipeline sending both requests and manipulate the queue as necessary in
586the callbacks:
587
588 # request one
589 $handle->push_write ("request 1\015\012");
590
591 # we expect "ERROR" or "OK" as response, so push a line read
592 $handle->push_read (line => sub {
593 # if we got an "OK", we have to _prepend_ another line,
594 # so it will be read before the second request reads its 64 bytes
595 # which are already in the queue when this callback is called
596 # we don't do this in case we got an error
597 if ($_[1] eq "OK") {
49 on_readline => sub { 598 $_[0]->unshift_read (line => sub {
50 my ($ae_fh, @lines) = @_; 599 my $response = $_[1];
51 for (@lines) { 600 ...
52 chomp; 601 });
53 print "Line: $_"; 602 }
603 });
604
605 # request two
606 $handle->push_write ("request 2\015\012");
607
608 # simply read 64 bytes, always
609 $handle->push_read (chunk => 64, sub {
610 my $response = $_[1];
611 ...
612 });
613
614=over 4
615
616=cut
617
618sub _drain_rbuf {
619 my ($self) = @_;
620
621 if (
622 defined $self->{rbuf_max}
623 && $self->{rbuf_max} < length $self->{rbuf}
624 ) {
625 return $self->_error (&Errno::ENOSPC, 1);
626 }
627
628 return if $self->{in_drain};
629 local $self->{in_drain} = 1;
630
631 while (my $len = length $self->{rbuf}) {
632 no strict 'refs';
633 if (my $cb = shift @{ $self->{_queue} }) {
634 unless ($cb->($self)) {
635 if ($self->{_eof}) {
636 # no progress can be made (not enough data and no data forthcoming)
637 return $self->_error (&Errno::EPIPE, 1);
54 } 638 }
639
640 unshift @{ $self->{_queue} }, $cb;
641 last;
55 } 642 }
56 );
57
58 $cv->wait;
59
60=head1 DESCRIPTION
61
62This module is a helper module to make it easier to do non-blocking I/O
63on filehandles (and sockets, see L<AnyEvent::Socket>).
64
65The event loop is provided by L<AnyEvent>.
66
67=head1 METHODS
68
69=over 4
70
71=item B<new (%args)>
72
73The constructor has these arguments:
74
75=over 4
76
77=item fh => $filehandle
78
79The filehandle this L<AnyEvent::Handle> object will operate on.
80
81NOTE: The filehandle will be set to non-blocking.
82
83=item read_block_size => $size
84
85The default read block size use for reads via the C<on_read>
86method.
87
88=item on_read => $cb
89
90=item on_eof => $cb
91
92=item on_error => $cb
93
94These are shortcuts, that will call the corresponding method and set the callback to C<$cb>.
95
96=item on_readline => $cb
97
98The C<readlines> method is called with the default separated and C<$cb> as callback
99for you.
100
101=back
102
103=cut
104
105sub new {
106 my $this = shift;
107 my $class = ref($this) || $this;
108 my $self = {
109 read_block_size => 4096,
110 rbuf => '',
111 @_
112 };
113 bless $self, $class;
114
115 $self->{fh}->blocking (0) if $self->{fh};
116
117 if ($self->{on_read}) {
118 $self->on_read ($self->{on_read});
119
120 } elsif ($self->{on_readline}) { 643 } elsif ($self->{on_read}) {
121 $self->readlines ($self->{on_readline}); 644 $self->{on_read}($self);
122 645
123 } elsif ($self->{on_eof}) { 646 if (
647 $len == length $self->{rbuf} # if no data has been consumed
648 && !@{ $self->{_queue} } # and the queue is still empty
649 && $self->{on_read} # but we still have on_read
650 ) {
651 # no further data will arrive
652 # so no progress can be made
653 return $self->_error (&Errno::EPIPE, 1)
654 if $self->{_eof};
655
656 last; # more data might arrive
657 }
658 } else {
659 # read side becomes idle
660 delete $self->{_rw};
661 last;
662 }
663 }
664
665 $self->{on_eof}($self)
124 $self->on_eof ($self->{on_eof}); 666 if $self->{_eof} && $self->{on_eof};
125 667
126 } elsif ($self->{on_error}) { 668 # may need to restart read watcher
127 $self->on_eof ($self->{on_error}); 669 unless ($self->{_rw}) {
670 $self->start_read
671 if $self->{on_read} || @{ $self->{_queue} };
128 } 672 }
129
130 return $self
131} 673}
132 674
133=item B<fh> 675=item $handle->on_read ($cb)
134 676
135This method returns the filehandle of the L<AnyEvent::Handle> object. 677This replaces the currently set C<on_read> callback, or clears it (when
136 678the new callback is C<undef>). See the description of C<on_read> in the
137=cut 679constructor.
138
139sub fh { $_[0]->{fh} }
140
141=item B<on_read ($callback)>
142
143This method installs a C<$callback> that will be called
144when new data arrived. You can access the read buffer via the C<rbuf>
145method (see below).
146
147The first argument of the C<$callback> will be the L<AnyEvent::Handle> object.
148 680
149=cut 681=cut
150 682
151sub on_read { 683sub on_read {
152 my ($self, $cb) = @_; 684 my ($self, $cb) = @_;
685
153 $self->{on_read} = $cb; 686 $self->{on_read} = $cb;
687 $self->_drain_rbuf if $cb;
688}
154 689
155 unless (defined $self->{on_read}) { 690=item $handle->rbuf
156 delete $self->{on_read_w}; 691
157 return; 692Returns the read buffer (as a modifiable lvalue).
693
694You can access the read buffer directly as the C<< ->{rbuf} >> member, if
695you want.
696
697NOTE: The read buffer should only be used or modified if the C<on_read>,
698C<push_read> or C<unshift_read> methods are used. The other read methods
699automatically manage the read buffer.
700
701=cut
702
703sub rbuf : lvalue {
704 $_[0]{rbuf}
705}
706
707=item $handle->push_read ($cb)
708
709=item $handle->unshift_read ($cb)
710
711Append the given callback to the end of the queue (C<push_read>) or
712prepend it (C<unshift_read>).
713
714The callback is called each time some additional read data arrives.
715
716It must check whether enough data is in the read buffer already.
717
718If not enough data is available, it must return the empty list or a false
719value, in which case it will be called repeatedly until enough data is
720available (or an error condition is detected).
721
722If enough data was available, then the callback must remove all data it is
723interested in (which can be none at all) and return a true value. After returning
724true, it will be removed from the queue.
725
726=cut
727
728our %RH;
729
730sub register_read_type($$) {
731 $RH{$_[0]} = $_[1];
732}
733
734sub push_read {
735 my $self = shift;
736 my $cb = pop;
737
738 if (@_) {
739 my $type = shift;
740
741 $cb = ($RH{$type} or Carp::croak "unsupported type passed to AnyEvent::Handle::push_read")
742 ->($self, $cb, @_);
743 }
744
745 push @{ $self->{_queue} }, $cb;
746 $self->_drain_rbuf;
747}
748
749sub unshift_read {
750 my $self = shift;
751 my $cb = pop;
752
753 if (@_) {
754 my $type = shift;
755
756 $cb = ($RH{$type} or Carp::croak "unsupported type passed to AnyEvent::Handle::unshift_read")
757 ->($self, $cb, @_);
758 }
759
760
761 unshift @{ $self->{_queue} }, $cb;
762 $self->_drain_rbuf;
763}
764
765=item $handle->push_read (type => @args, $cb)
766
767=item $handle->unshift_read (type => @args, $cb)
768
769Instead of providing a callback that parses the data itself you can chose
770between a number of predefined parsing formats, for chunks of data, lines
771etc.
772
773Predefined types are (if you have ideas for additional types, feel free to
774drop by and tell us):
775
776=over 4
777
778=item chunk => $octets, $cb->($handle, $data)
779
780Invoke the callback only once C<$octets> bytes have been read. Pass the
781data read to the callback. The callback will never be called with less
782data.
783
784Example: read 2 bytes.
785
786 $handle->push_read (chunk => 2, sub {
787 warn "yay ", unpack "H*", $_[1];
158 } 788 });
159 789
160 $self->{on_read_w} = 790=cut
161 AnyEvent->io (poll => 'r', fh => $self->{fh}, cb => sub { 791
162 #d# warn "READ:[$self->{read_size}] $self->{read_block_size} : ".length ($self->{rbuf})."\n"; 792register_read_type chunk => sub {
163 my $rbuf_len = length $self->{rbuf}; 793 my ($self, $cb, $len) = @_;
164 my $l; 794
165 if (defined $self->{read_size}) { 795 sub {
166 $l = sysread $self->{fh}, $self->{rbuf}, 796 $len <= length $_[0]{rbuf} or return;
167 ($self->{read_size} - $rbuf_len), $rbuf_len; 797 $cb->($_[0], substr $_[0]{rbuf}, 0, $len, "");
168 } else { 798 1
169 $l = sysread $self->{fh}, $self->{rbuf}, $self->{read_block_size}, $rbuf_len; 799 }
800};
801
802# compatibility with older API
803sub push_read_chunk {
804 $_[0]->push_read (chunk => $_[1], $_[2]);
805}
806
807sub unshift_read_chunk {
808 $_[0]->unshift_read (chunk => $_[1], $_[2]);
809}
810
811=item line => [$eol, ]$cb->($handle, $line, $eol)
812
813The callback will be called only once a full line (including the end of
814line marker, C<$eol>) has been read. This line (excluding the end of line
815marker) will be passed to the callback as second argument (C<$line>), and
816the end of line marker as the third argument (C<$eol>).
817
818The end of line marker, C<$eol>, can be either a string, in which case it
819will be interpreted as a fixed record end marker, or it can be a regex
820object (e.g. created by C<qr>), in which case it is interpreted as a
821regular expression.
822
823The end of line marker argument C<$eol> is optional, if it is missing (NOT
824undef), then C<qr|\015?\012|> is used (which is good for most internet
825protocols).
826
827Partial lines at the end of the stream will never be returned, as they are
828not marked by the end of line marker.
829
830=cut
831
832register_read_type line => sub {
833 my ($self, $cb, $eol) = @_;
834
835 $eol = qr|(\015?\012)| if @_ < 3;
836 $eol = quotemeta $eol unless ref $eol;
837 $eol = qr|^(.*?)($eol)|s;
838
839 sub {
840 $_[0]{rbuf} =~ s/$eol// or return;
841
842 $cb->($_[0], $1, $2);
843 1
844 }
845};
846
847# compatibility with older API
848sub push_read_line {
849 my $self = shift;
850 $self->push_read (line => @_);
851}
852
853sub unshift_read_line {
854 my $self = shift;
855 $self->unshift_read (line => @_);
856}
857
858=item netstring => $cb->($handle, $string)
859
860A netstring (http://cr.yp.to/proto/netstrings.txt, this is not an endorsement).
861
862Throws an error with C<$!> set to EBADMSG on format violations.
863
864=cut
865
866register_read_type netstring => sub {
867 my ($self, $cb) = @_;
868
869 sub {
870 unless ($_[0]{rbuf} =~ s/^(0|[1-9][0-9]*)://) {
871 if ($_[0]{rbuf} =~ /[^0-9]/) {
872 $self->_error (&Errno::EBADMSG);
170 } 873 }
171 #d# warn "READL $l [$self->{rbuf}]\n"; 874 return;
875 }
172 876
173 if (not defined $l) { 877 my $len = $1;
174 return if $! == EAGAIN || $! == EINTR;
175 $self->{on_error}->($self) if $self->{on_error};
176 delete $self->{on_read_w};
177 878
178 } elsif ($l == 0) { 879 $self->unshift_read (chunk => $len, sub {
179 $self->{on_eof}->($self) if $self->{on_eof}; 880 my $string = $_[1];
180 delete $self->{on_read_w}; 881 $_[0]->unshift_read (chunk => 1, sub {
181 882 if ($_[1] eq ",") {
883 $cb->($_[0], $string);
182 } else { 884 } else {
183 $self->{on_read}->($self); 885 $self->_error (&Errno::EBADMSG);
886 }
887 });
888 });
889
890 1
891 }
892};
893
894=item regex => $accept[, $reject[, $skip], $cb->($handle, $data)
895
896Makes a regex match against the regex object C<$accept> and returns
897everything up to and including the match.
898
899Example: read a single line terminated by '\n'.
900
901 $handle->push_read (regex => qr<\n>, sub { ... });
902
903If C<$reject> is given and not undef, then it determines when the data is
904to be rejected: it is matched against the data when the C<$accept> regex
905does not match and generates an C<EBADMSG> error when it matches. This is
906useful to quickly reject wrong data (to avoid waiting for a timeout or a
907receive buffer overflow).
908
909Example: expect a single decimal number followed by whitespace, reject
910anything else (not the use of an anchor).
911
912 $handle->push_read (regex => qr<^[0-9]+\s>, qr<[^0-9]>, sub { ... });
913
914If C<$skip> is given and not C<undef>, then it will be matched against
915the receive buffer when neither C<$accept> nor C<$reject> match,
916and everything preceding and including the match will be accepted
917unconditionally. This is useful to skip large amounts of data that you
918know cannot be matched, so that the C<$accept> or C<$reject> regex do not
919have to start matching from the beginning. This is purely an optimisation
920and is usually worth only when you expect more than a few kilobytes.
921
922Example: expect a http header, which ends at C<\015\012\015\012>. Since we
923expect the header to be very large (it isn't in practise, but...), we use
924a skip regex to skip initial portions. The skip regex is tricky in that
925it only accepts something not ending in either \015 or \012, as these are
926required for the accept regex.
927
928 $handle->push_read (regex =>
929 qr<\015\012\015\012>,
930 undef, # no reject
931 qr<^.*[^\015\012]>,
932 sub { ... });
933
934=cut
935
936register_read_type regex => sub {
937 my ($self, $cb, $accept, $reject, $skip) = @_;
938
939 my $data;
940 my $rbuf = \$self->{rbuf};
941
942 sub {
943 # accept
944 if ($$rbuf =~ $accept) {
945 $data .= substr $$rbuf, 0, $+[0], "";
946 $cb->($self, $data);
947 return 1;
948 }
949
950 # reject
951 if ($reject && $$rbuf =~ $reject) {
952 $self->_error (&Errno::EBADMSG);
953 }
954
955 # skip
956 if ($skip && $$rbuf =~ $skip) {
957 $data .= substr $$rbuf, 0, $+[0], "";
958 }
959
960 ()
961 }
962};
963
964=item json => $cb->($handle, $hash_or_arrayref)
965
966Reads a JSON object or array, decodes it and passes it to the callback.
967
968If a C<json> object was passed to the constructor, then that will be used
969for the final decode, otherwise it will create a JSON coder expecting UTF-8.
970
971This read type uses the incremental parser available with JSON version
9722.09 (and JSON::XS version 2.2) and above. You have to provide a
973dependency on your own: this module will load the JSON module, but
974AnyEvent does not depend on it itself.
975
976Since JSON texts are fully self-delimiting, the C<json> read and write
977types are an ideal simple RPC protocol: just exchange JSON datagrams. See
978the C<json> write type description, above, for an actual example.
979
980=cut
981
982register_read_type json => sub {
983 my ($self, $cb, $accept, $reject, $skip) = @_;
984
985 require JSON;
986
987 my $data;
988 my $rbuf = \$self->{rbuf};
989
990 my $json = $self->{json} ||= JSON->new->utf8;
991
992 sub {
993 my $ref = $json->incr_parse ($self->{rbuf});
994
995 if ($ref) {
996 $self->{rbuf} = $json->incr_text;
997 $json->incr_text = "";
998 $cb->($self, $ref);
999
1000 1
1001 } else {
1002 $self->{rbuf} = "";
1003 ()
1004 }
1005 }
1006};
1007
1008=back
1009
1010=item AnyEvent::Handle::register_read_type type => $coderef->($handle, $cb, @args)
1011
1012This function (not method) lets you add your own types to C<push_read>.
1013
1014Whenever the given C<type> is used, C<push_read> will invoke the code
1015reference with the handle object, the callback and the remaining
1016arguments.
1017
1018The code reference is supposed to return a callback (usually a closure)
1019that works as a plain read callback (see C<< ->push_read ($cb) >>).
1020
1021It should invoke the passed callback when it is done reading (remember to
1022pass C<$handle> as first argument as all other callbacks do that).
1023
1024Note that this is a function, and all types registered this way will be
1025global, so try to use unique names.
1026
1027For examples, see the source of this module (F<perldoc -m AnyEvent::Handle>,
1028search for C<register_read_type>)).
1029
1030=item $handle->stop_read
1031
1032=item $handle->start_read
1033
1034In rare cases you actually do not want to read anything from the
1035socket. In this case you can call C<stop_read>. Neither C<on_read> nor
1036any queued callbacks will be executed then. To start reading again, call
1037C<start_read>.
1038
1039Note that AnyEvent::Handle will automatically C<start_read> for you when
1040you change the C<on_read> callback or push/unshift a read callback, and it
1041will automatically C<stop_read> for you when neither C<on_read> is set nor
1042there are any read requests in the queue.
1043
1044=cut
1045
1046sub stop_read {
1047 my ($self) = @_;
1048
1049 delete $self->{_rw};
1050}
1051
1052sub start_read {
1053 my ($self) = @_;
1054
1055 unless ($self->{_rw} || $self->{_eof}) {
1056 Scalar::Util::weaken $self;
1057
1058 $self->{_rw} = AnyEvent->io (fh => $self->{fh}, poll => "r", cb => sub {
1059 my $rbuf = $self->{filter_r} ? \my $buf : \$self->{rbuf};
1060 my $len = sysread $self->{fh}, $$rbuf, $self->{read_size} || 8192, length $$rbuf;
1061
1062 if ($len > 0) {
1063 $self->{_activity} = AnyEvent->now;
1064
1065 $self->{filter_r}
1066 ? $self->{filter_r}($self, $rbuf)
1067 : $self->_drain_rbuf;
1068
1069 } elsif (defined $len) {
1070 delete $self->{_rw};
1071 $self->{_eof} = 1;
1072 $self->_drain_rbuf;
1073
1074 } elsif ($! != EAGAIN && $! != EINTR && $! != WSAEWOULDBLOCK) {
1075 return $self->_error ($!, 1);
184 } 1076 }
185 }); 1077 });
1078 }
186} 1079}
187 1080
188=item B<on_error ($callback)> 1081sub _dotls {
189
190Whenever a read or write operation resulted in an error the C<$callback>
191will be called.
192
193The first argument of C<$callback> will be the L<AnyEvent::Handle> object itself.
194The error is given as errno in C<$!>.
195
196=cut
197
198sub on_error {
199 $_[0]->{on_error} = $_[1];
200}
201
202=item B<on_eof ($callback)>
203
204Installs the C<$callback> that will be called when the end of file is
205encountered in a read operation this C<$callback> will be called. The first
206argument will be the L<AnyEvent::Handle> object itself.
207
208=cut
209
210sub on_eof {
211 $_[0]->{on_eof} = $_[1];
212}
213
214=item B<rbuf>
215
216Returns a reference to the read buffer.
217
218NOTE: The read buffer should only be used or modified if the C<on_read>
219method is used directly. The C<read> and C<readlines> methods will provide
220the read data to their callbacks.
221
222=cut
223
224sub rbuf : lvalue {
225 $_[0]->{rbuf}
226}
227
228=item B<read ($len, $callback)>
229
230Will read exactly C<$len> bytes from the filehandle and call the C<$callback>
231if done so. The first argument to the C<$callback> will be the L<AnyEvent::Handle>
232object itself and the second argument the read data.
233
234NOTE: This method will override any callbacks installed via the C<on_read> method.
235
236=cut
237
238sub read {
239 my ($self, $len, $cb) = @_; 1082 my ($self) = @_;
240 1083
241 $self->{read_cb} = $cb; 1084 my $buf;
242 my $old_blk_size = $self->{read_block_size};
243 $self->{read_block_size} = $len;
244 1085
245 $self->on_read (sub { 1086 if (length $self->{_tls_wbuf}) {
246 #d# warn "OFOFO $len || ".length($_[0]->{rbuf})."||\n"; 1087 while ((my $len = Net::SSLeay::write ($self->{tls}, $self->{_tls_wbuf})) > 0) {
247 1088 substr $self->{_tls_wbuf}, 0, $len, "";
248 if ($len == length $_[0]->{rbuf}) {
249 $_[0]->{read_block_size} = $old_blk_size;
250 $_[0]->on_read (undef);
251 $_[0]->{read_cb}->($_[0], (substr $self->{rbuf}, 0, $len, ''));
252 } 1089 }
253 }); 1090 }
254}
255 1091
256=item B<readlines ($callback)> 1092 if (length ($buf = Net::SSLeay::BIO_read ($self->{_wbio}))) {
257 1093 $self->{wbuf} .= $buf;
258=item B<readlines ($sep, $callback)> 1094 $self->_drain_wbuf;
259
260This method will read lines from the filehandle, separated by C<$sep> or C<"\n">
261if C<$sep> is not provided. C<$sep> will be used as "line" separated.
262
263The C<$callback> will be called when at least one
264line could be read. The first argument to the C<$callback> will be the L<AnyEvent::Handle>
265object itself and the rest of the arguments will be the read lines.
266
267NOTE: This method will override any callbacks installed via the C<on_read> method.
268
269=cut
270
271sub readlines {
272 my ($self, $sep, $cb) = @_;
273
274 if (ref $sep) {
275 $cb = $sep;
276 $sep = "\n";
277
278 } elsif (not defined $sep) {
279 $sep = "\n";
280 } 1095 }
281 1096
282 my $sep_len = length $sep; 1097 while (defined ($buf = Net::SSLeay::read ($self->{tls}))) {
283 1098 if (length $buf) {
284 $self->{on_readline} = $cb; 1099 $self->{rbuf} .= $buf;
285 1100 $self->_drain_rbuf;
286 $self->on_read (sub { 1101 } else {
287 my @lines; 1102 # let's treat SSL-eof as we treat normal EOF
288 my $rb = \$_[0]->{rbuf}; 1103 $self->{_eof} = 1;
289 my $pos; 1104 $self->_shutdown;
290 while (($pos = index ($$rb, $sep)) >= 0) { 1105 return;
291 push @lines, substr $$rb, 0, $pos + $sep_len, '';
292 } 1106 }
293 $self->{on_readline}->($_[0], @lines); 1107 }
1108
1109 my $err = Net::SSLeay::get_error ($self->{tls}, -1);
1110
1111 if ($err!= Net::SSLeay::ERROR_WANT_READ ()) {
1112 if ($err == Net::SSLeay::ERROR_SYSCALL ()) {
1113 return $self->_error ($!, 1);
1114 } elsif ($err == Net::SSLeay::ERROR_SSL ()) {
1115 return $self->_error (&Errno::EIO, 1);
1116 }
1117
1118 # all others are fine for our purposes
1119 }
1120}
1121
1122=item $handle->starttls ($tls[, $tls_ctx])
1123
1124Instead of starting TLS negotiation immediately when the AnyEvent::Handle
1125object is created, you can also do that at a later time by calling
1126C<starttls>.
1127
1128The first argument is the same as the C<tls> constructor argument (either
1129C<"connect">, C<"accept"> or an existing Net::SSLeay object).
1130
1131The second argument is the optional C<Net::SSLeay::CTX> object that is
1132used when AnyEvent::Handle has to create its own TLS connection object.
1133
1134The TLS connection object will end up in C<< $handle->{tls} >> after this
1135call and can be used or changed to your liking. Note that the handshake
1136might have already started when this function returns.
1137
1138=cut
1139
1140sub starttls {
1141 my ($self, $ssl, $ctx) = @_;
1142
1143 $self->stoptls;
1144
1145 if ($ssl eq "accept") {
1146 $ssl = Net::SSLeay::new ($ctx || TLS_CTX ());
1147 Net::SSLeay::set_accept_state ($ssl);
1148 } elsif ($ssl eq "connect") {
1149 $ssl = Net::SSLeay::new ($ctx || TLS_CTX ());
1150 Net::SSLeay::set_connect_state ($ssl);
1151 }
1152
1153 $self->{tls} = $ssl;
1154
1155 # basically, this is deep magic (because SSL_read should have the same issues)
1156 # but the openssl maintainers basically said: "trust us, it just works".
1157 # (unfortunately, we have to hardcode constants because the abysmally misdesigned
1158 # and mismaintained ssleay-module doesn't even offer them).
1159 # http://www.mail-archive.com/openssl-dev@openssl.org/msg22420.html
1160 Net::SSLeay::CTX_set_mode ($self->{tls},
1161 (eval { local $SIG{__DIE__}; Net::SSLeay::MODE_ENABLE_PARTIAL_WRITE () } || 1)
1162 | (eval { local $SIG{__DIE__}; Net::SSLeay::MODE_ACCEPT_MOVING_WRITE_BUFFER () } || 2));
1163
1164 $self->{_rbio} = Net::SSLeay::BIO_new (Net::SSLeay::BIO_s_mem ());
1165 $self->{_wbio} = Net::SSLeay::BIO_new (Net::SSLeay::BIO_s_mem ());
1166
1167 Net::SSLeay::set_bio ($ssl, $self->{_rbio}, $self->{_wbio});
1168
1169 $self->{filter_w} = sub {
1170 $_[0]{_tls_wbuf} .= ${$_[1]};
1171 &_dotls;
294 }); 1172 };
1173 $self->{filter_r} = sub {
1174 Net::SSLeay::BIO_write ($_[0]{_rbio}, ${$_[1]});
1175 &_dotls;
1176 };
295} 1177}
296 1178
297=item B<write ($data)> 1179=item $handle->stoptls
298 1180
299=item B<write ($callback)> 1181Destroys the SSL connection, if any. Partial read or write data will be
1182lost.
300 1183
301=item B<write ($data, $callback)>
302
303This method will write C<$data> to the filehandle and call the C<$callback>
304afterwards. If only C<$callback> is provided it will be called when the
305write buffer becomes empty the next time (or immediately if it already is empty).
306
307=cut 1184=cut
308 1185
309sub write { 1186sub stoptls {
310 my ($self, $data, $cb) = @_;
311 if (ref $data) { $cb = $data; undef $data }
312 push @{$self->{write_bufs}}, [$data, $cb];
313 $self->_check_writer;
314}
315
316sub _check_writer {
317 my ($self) = @_; 1187 my ($self) = @_;
318 1188
319 if ($self->{write_w}) { 1189 Net::SSLeay::free (delete $self->{tls}) if $self->{tls};
320 unless ($self->{write_cb}) {
321 while (@{$self->{write_bufs}} && not defined $self->{write_bufs}->[0]->[1]) {
322 my $wba = shift @{$self->{write_bufs}};
323 $self->{wbuf} .= $wba->[0];
324 }
325 }
326 return;
327 }
328 1190
329 my $wba = shift @{$self->{write_bufs}} 1191 delete $self->{_rbio};
330 or return; 1192 delete $self->{_wbio};
331 1193 delete $self->{_tls_wbuf};
332 unless (defined $wba->[0]) { 1194 delete $self->{filter_r};
333 $wba->[1]->($self) if $wba->[1];
334 $self->_check_writer;
335 return;
336 }
337
338 $self->{wbuf} = $wba->[0];
339 $self->{write_cb} = $wba->[1];
340
341 $self->{write_w} =
342 AnyEvent->io (poll => 'w', fh => $self->{fh}, cb => sub {
343 my $l = syswrite $self->{fh}, $self->{wbuf}, length $self->{wbuf};
344
345 if (not defined $l) {
346 return if $! == EAGAIN || $! == EINTR;
347 delete $self->{write_w}; 1195 delete $self->{filter_w};
348 $self->{on_error}->($self) if $self->{on_error}; 1196}
349 1197
350 } else { 1198sub DESTROY {
351 substr $self->{wbuf}, 0, $l, ''; 1199 my $self = shift;
352 1200
353 if (length ($self->{wbuf}) == 0) { 1201 $self->stoptls;
354 $self->{write_cb}->($self) if $self->{write_cb}; 1202}
355 1203
356 delete $self->{write_w}; 1204=item AnyEvent::Handle::TLS_CTX
357 delete $self->{wbuf};
358 delete $self->{write_cb};
359 1205
360 $self->_check_writer; 1206This function creates and returns the Net::SSLeay::CTX object used by
361 } 1207default for TLS mode.
362 } 1208
363 }); 1209The context is created like this:
1210
1211 Net::SSLeay::load_error_strings;
1212 Net::SSLeay::SSLeay_add_ssl_algorithms;
1213 Net::SSLeay::randomize;
1214
1215 my $CTX = Net::SSLeay::CTX_new;
1216
1217 Net::SSLeay::CTX_set_options $CTX, Net::SSLeay::OP_ALL
1218
1219=cut
1220
1221our $TLS_CTX;
1222
1223sub TLS_CTX() {
1224 $TLS_CTX || do {
1225 require Net::SSLeay;
1226
1227 Net::SSLeay::load_error_strings ();
1228 Net::SSLeay::SSLeay_add_ssl_algorithms ();
1229 Net::SSLeay::randomize ();
1230
1231 $TLS_CTX = Net::SSLeay::CTX_new ();
1232
1233 Net::SSLeay::CTX_set_options ($TLS_CTX, Net::SSLeay::OP_ALL ());
1234
1235 $TLS_CTX
1236 }
364} 1237}
365 1238
366=back 1239=back
367 1240
1241=head1 SUBCLASSING AnyEvent::Handle
1242
1243In many cases, you might want to subclass AnyEvent::Handle.
1244
1245To make this easier, a given version of AnyEvent::Handle uses these
1246conventions:
1247
1248=over 4
1249
1250=item * all constructor arguments become object members.
1251
1252At least initially, when you pass a C<tls>-argument to the constructor it
1253will end up in C<< $handle->{tls} >>. Those members might be changes or
1254mutated later on (for example C<tls> will hold the TLS connection object).
1255
1256=item * other object member names are prefixed with an C<_>.
1257
1258All object members not explicitly documented (internal use) are prefixed
1259with an underscore character, so the remaining non-C<_>-namespace is free
1260for use for subclasses.
1261
1262=item * all members not documented here and not prefixed with an underscore
1263are free to use in subclasses.
1264
1265Of course, new versions of AnyEvent::Handle may introduce more "public"
1266member variables, but thats just life, at least it is documented.
1267
1268=back
1269
368=head1 AUTHOR 1270=head1 AUTHOR
369 1271
370Robin Redeker, C<< <elmex at ta-sa.org> >> 1272Robin Redeker C<< <elmex at ta-sa.org> >>, Marc Lehmann <schmorp@schmorp.de>.
371 1273
372=cut 1274=cut
373 1275
3741; # End of AnyEvent::Handle 12761; # End of AnyEvent::Handle

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines