… | |
… | |
28 | limited support. |
28 | limited support. |
29 | |
29 | |
30 | METHODS |
30 | METHODS |
31 | http_get $url, key => value..., $cb->($data, $headers) |
31 | http_get $url, key => value..., $cb->($data, $headers) |
32 | Executes an HTTP-GET request. See the http_request function for |
32 | Executes an HTTP-GET request. See the http_request function for |
33 | details on additional parameters. |
33 | details on additional parameters and the return value. |
34 | |
34 | |
35 | http_head $url, key => value..., $cb->($data, $headers) |
35 | http_head $url, key => value..., $cb->($data, $headers) |
36 | Executes an HTTP-HEAD request. See the http_request function for |
36 | Executes an HTTP-HEAD request. See the http_request function for |
37 | details on additional parameters. |
37 | details on additional parameters and the return value. |
38 | |
38 | |
39 | http_post $url, $body, key => value..., $cb->($data, $headers) |
39 | http_post $url, $body, key => value..., $cb->($data, $headers) |
40 | Executes an HTTP-POST request with a request body of $body. See the |
40 | Executes an HTTP-POST request with a request body of $body. See the |
41 | http_request function for details on additional parameters. |
41 | http_request function for details on additional parameters and the |
|
|
42 | return value. |
42 | |
43 | |
43 | http_request $method => $url, key => value..., $cb->($data, $headers) |
44 | http_request $method => $url, key => value..., $cb->($data, $headers) |
44 | Executes a HTTP request of type $method (e.g. "GET", "POST"). The |
45 | Executes a HTTP request of type $method (e.g. "GET", "POST"). The |
45 | URL must be an absolute http or https URL. |
46 | URL must be an absolute http or https URL. |
46 | |
47 | |
|
|
48 | When called in void context, nothing is returned. In other contexts, |
|
|
49 | "http_request" returns a "cancellation guard" - you have to keep the |
|
|
50 | object at least alive until the callback get called. If the object |
|
|
51 | gets destroyed before the callbakc is called, the request will be |
|
|
52 | cancelled. |
|
|
53 | |
47 | The callback will be called with the response data as first argument |
54 | The callback will be called with the response body data as first |
48 | (or "undef" if it wasn't available due to errors), and a hash-ref |
55 | argument (or "undef" if an error occured), and a hash-ref with |
49 | with response headers as second argument. |
56 | response headers as second argument. |
50 | |
57 | |
51 | All the headers in that hash are lowercased. In addition to the |
58 | All the headers in that hash are lowercased. In addition to the |
52 | response headers, the "pseudo-headers" "HTTPVersion", "Status" and |
59 | response headers, the "pseudo-headers" "HTTPVersion", "Status" and |
53 | "Reason" contain the three parts of the HTTP Status-Line of the same |
60 | "Reason" contain the three parts of the HTTP Status-Line of the same |
54 | name. The pseudo-header "URL" contains the original URL (which can |
61 | name. The pseudo-header "URL" contains the original URL (which can |
55 | differ from the requested URL when following redirects). |
62 | differ from the requested URL when following redirects). |
56 | |
63 | |
57 | If the server sends a header multiple lines, then their contents |
64 | If the server sends a header multiple times, then their contents |
58 | will be joined together with "\x00". |
65 | will be joined together with a comma (","), as per the HTTP spec. |
59 | |
66 | |
60 | If an internal error occurs, such as not being able to resolve a |
67 | If an internal error occurs, such as not being able to resolve a |
61 | hostname, then $data will be "undef", "$headers->{Status}" will be |
68 | hostname, then $data will be "undef", "$headers->{Status}" will be |
62 | 599 and the "Reason" pseudo-header will contain an error message. |
69 | "59x" (usually 599) and the "Reason" pseudo-header will contain an |
|
|
70 | error message. |
63 | |
71 | |
64 | A typical callback might look like this: |
72 | A typical callback might look like this: |
65 | |
73 | |
66 | sub { |
74 | sub { |
67 | my ($body, $hdr) = @_; |
75 | my ($body, $hdr) = @_; |
… | |
… | |
108 | loosely based on the original netscape specification. |
116 | loosely based on the original netscape specification. |
109 | |
117 | |
110 | The $hash_ref must be an (initially empty) hash reference which |
118 | The $hash_ref must be an (initially empty) hash reference which |
111 | will get updated automatically. It is possible to save the |
119 | will get updated automatically. It is possible to save the |
112 | cookie_jar to persistent storage with something like JSON or |
120 | cookie_jar to persistent storage with something like JSON or |
113 | Storable, but this is not recommended, as expire times are |
121 | Storable, but this is not recommended, as expiry times are |
114 | currently being ignored. |
122 | currently being ignored. |
115 | |
123 | |
116 | Note that this cookie implementation is not of very high |
124 | Note that this cookie implementation is not of very high |
117 | quality, nor meant to be complete. If you want complete cookie |
125 | quality, nor meant to be complete. If you want complete cookie |
118 | management you have to do that on your own. "cookie_jar" is |
126 | management you have to do that on your own. "cookie_jar" is |
119 | meant as a quick fix to get some cookie-using sites working. |
127 | meant as a quick fix to get some cookie-using sites working. |
120 | Cookies are a privacy disaster, do not use them unless required |
128 | Cookies are a privacy disaster, do not use them unless required |
121 | to. |
129 | to. |
|
|
130 | |
|
|
131 | tls_ctx => $scheme | $tls_ctx |
|
|
132 | Specifies the AnyEvent::TLS context to be used for https |
|
|
133 | connections. This parameter follows the same rules as the |
|
|
134 | "tls_ctx" parameter to AnyEvent::Handle, but additionally, the |
|
|
135 | two strings "low" or "high" can be specified, which give you a |
|
|
136 | predefined low-security (no verification, highest compatibility) |
|
|
137 | and high-security (CA and common-name verification) TLS context. |
|
|
138 | |
|
|
139 | The default for this option is "low", which could be interpreted |
|
|
140 | as "give me the page, no matter what". |
|
|
141 | |
|
|
142 | on_header => $callback->($headers) |
|
|
143 | When specified, this callback will be called with the header |
|
|
144 | hash as soon as headers have been successfully received from the |
|
|
145 | remote server (not on locally-generated errors). |
|
|
146 | |
|
|
147 | It has to return either true (in which case AnyEvent::HTTP will |
|
|
148 | continue), or false, in which case AnyEvent::HTTP will cancel |
|
|
149 | the download (and call the finish callback with an error code of |
|
|
150 | 598). |
|
|
151 | |
|
|
152 | This callback is useful, among other things, to quickly reject |
|
|
153 | unwanted content, which, if it is supposed to be rare, can be |
|
|
154 | faster than first doing a "HEAD" request. |
|
|
155 | |
|
|
156 | Example: cancel the request unless the content-type is |
|
|
157 | "text/html". |
|
|
158 | |
|
|
159 | on_header => sub { |
|
|
160 | $_[0]{"content-type"} =~ /^text\/html\s*(?:;|$)/ |
|
|
161 | }, |
|
|
162 | |
|
|
163 | on_body => $callback->($partial_body, $headers) |
|
|
164 | When specified, all body data will be passed to this callback |
|
|
165 | instead of to the completion callback. The completion callback |
|
|
166 | will get the empty string instead of the body data. |
|
|
167 | |
|
|
168 | It has to return either true (in which case AnyEvent::HTTP will |
|
|
169 | continue), or false, in which case AnyEvent::HTTP will cancel |
|
|
170 | the download (and call the completion callback with an error |
|
|
171 | code of 598). |
|
|
172 | |
|
|
173 | This callback is useful when the data is too large to be held in |
|
|
174 | memory (so the callback writes it to a file) or when only some |
|
|
175 | information should be extracted, or when the body should be |
|
|
176 | processed incrementally. |
|
|
177 | |
|
|
178 | It is usually preferred over doing your own body handling via |
|
|
179 | "want_body_handle". |
|
|
180 | |
|
|
181 | want_body_handle => $enable |
|
|
182 | When enabled (default is disabled), the behaviour of |
|
|
183 | AnyEvent::HTTP changes considerably: after parsing the headers, |
|
|
184 | and instead of downloading the body (if any), the completion |
|
|
185 | callback will be called. Instead of the $body argument |
|
|
186 | containing the body data, the callback will receive the |
|
|
187 | AnyEvent::Handle object associated with the connection. In error |
|
|
188 | cases, "undef" will be passed. When there is no body (e.g. |
|
|
189 | status 304), the empty string will be passed. |
|
|
190 | |
|
|
191 | The handle object might or might not be in TLS mode, might be |
|
|
192 | connected to a proxy, be a persistent connection etc., and |
|
|
193 | configured in unspecified ways. The user is responsible for this |
|
|
194 | handle (it will not be used by this module anymore). |
|
|
195 | |
|
|
196 | This is useful with some push-type services, where, after the |
|
|
197 | initial headers, an interactive protocol is used (typical |
|
|
198 | example would be the push-style twitter API which starts a |
|
|
199 | JSON/XML stream). |
|
|
200 | |
|
|
201 | If you think you need this, first have a look at "on_body", to |
|
|
202 | see if that doesn'T solve your problem in a better way. |
122 | |
203 | |
123 | Example: make a simple HTTP GET request for http://www.nethype.de/ |
204 | Example: make a simple HTTP GET request for http://www.nethype.de/ |
124 | |
205 | |
125 | http_request GET => "http://www.nethype.de/", sub { |
206 | http_request GET => "http://www.nethype.de/", sub { |
126 | my ($body, $hdr) = @_; |
207 | my ($body, $hdr) = @_; |
… | |
… | |
138 | use Data::Dumper; |
219 | use Data::Dumper; |
139 | print Dumper $hdr; |
220 | print Dumper $hdr; |
140 | } |
221 | } |
141 | ; |
222 | ; |
142 | |
223 | |
|
|
224 | Example: make another simple HTTP GET request, but immediately try |
|
|
225 | to cancel it. |
|
|
226 | |
|
|
227 | my $request = http_request GET => "http://www.nethype.de/", sub { |
|
|
228 | my ($body, $hdr) = @_; |
|
|
229 | print "$body\n"; |
|
|
230 | }; |
|
|
231 | |
|
|
232 | undef $request; |
|
|
233 | |
143 | GLOBAL FUNCTIONS AND VARIABLES |
234 | GLOBAL FUNCTIONS AND VARIABLES |
144 | AnyEvent::HTTP::set_proxy "proxy-url" |
235 | AnyEvent::HTTP::set_proxy "proxy-url" |
145 | Sets the default proxy server to use. The proxy-url must begin with |
236 | Sets the default proxy server to use. The proxy-url must begin with |
146 | a string of the form "http://host:port" (optionally "https:..."). |
237 | a string of the form "http://host:port" (optionally "https:..."). |
147 | |
238 | |
148 | $AnyEvent::HTTP::MAX_RECURSE |
239 | $AnyEvent::HTTP::MAX_RECURSE |
149 | The default value for the "recurse" request parameter (default: 10). |
240 | The default value for the "recurse" request parameter (default: 10). |
150 | |
241 | |
151 | $AnyEvent::HTTP::USERAGENT |
242 | $AnyEvent::HTTP::USERAGENT |
152 | The default value for the "User-Agent" header (the default is |
243 | The default value for the "User-Agent" header (the default is |
153 | "Mozilla/5.0 (compatible; AnyEvent::HTTP/$VERSION; |
244 | "Mozilla/5.0 (compatible; U; AnyEvent-HTTP/$VERSION; |
154 | +http://software.schmorp.de/pkg/AnyEvent)"). |
245 | +http://software.schmorp.de/pkg/AnyEvent)"). |
155 | |
246 | |
156 | $AnyEvent::HTTP::MAX_PERSISTENT |
247 | $AnyEvent::HTTP::MAX_PER_HOST |
157 | The maximum number of persistent connections to keep open (default: |
248 | The maximum number of concurrent conenctions to the same host |
158 | 8). |
249 | (identified by the hostname). If the limit is exceeded, then the |
|
|
250 | additional requests are queued until previous connections are |
|
|
251 | closed. |
159 | |
252 | |
160 | Not implemented currently. |
253 | The default value for this is 4, and it is highly advisable to not |
161 | |
254 | increase it. |
162 | $AnyEvent::HTTP::PERSISTENT_TIMEOUT |
|
|
163 | The maximum time to cache a persistent connection, in seconds |
|
|
164 | (default: 2). |
|
|
165 | |
|
|
166 | Not implemented currently. |
|
|
167 | |
255 | |
168 | $AnyEvent::HTTP::ACTIVE |
256 | $AnyEvent::HTTP::ACTIVE |
169 | The number of active connections. This is not the number of |
257 | The number of active connections. This is not the number of |
170 | currently running requests, but the number of currently open and |
258 | currently running requests, but the number of currently open and |
171 | non-idle TCP connections. This number of can be useful for |
259 | non-idle TCP connections. This number of can be useful for |
… | |
… | |
176 | |
264 | |
177 | AUTHOR |
265 | AUTHOR |
178 | Marc Lehmann <schmorp@schmorp.de> |
266 | Marc Lehmann <schmorp@schmorp.de> |
179 | http://home.schmorp.de/ |
267 | http://home.schmorp.de/ |
180 | |
268 | |
|
|
269 | With many thanks to Дмитрий Шалашов, who provided |
|
|
270 | countless testcases and bugreports. |
|
|
271 | |