ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/JSON-XS/XS.pm
Revision: 1.41
Committed: Mon Jun 11 03:45:26 2007 UTC (16 years, 11 months ago) by root
Branch: MAIN
CVS Tags: rel-1_24
Changes since 1.40: +6 -5 lines
Log Message:
*** empty log message ***

File Contents

# User Rev Content
1 root 1.1 =head1 NAME
2    
3     JSON::XS - JSON serialising/deserialising, done correctly and fast
4    
5     =head1 SYNOPSIS
6    
7     use JSON::XS;
8    
9 root 1.22 # exported functions, they croak on error
10     # and expect/generate UTF-8
11 root 1.12
12     $utf8_encoded_json_text = to_json $perl_hash_or_arrayref;
13     $perl_hash_or_arrayref = from_json $utf8_encoded_json_text;
14    
15 root 1.22 # objToJson and jsonToObj aliases to to_json and from_json
16     # are exported for compatibility to the JSON module,
17     # but should not be used in new code.
18 root 1.21
19 root 1.22 # OO-interface
20 root 1.12
21     $coder = JSON::XS->new->ascii->pretty->allow_nonref;
22     $pretty_printed_unencoded = $coder->encode ($perl_scalar);
23     $perl_scalar = $coder->decode ($unicode_json_text);
24    
25 root 1.1 =head1 DESCRIPTION
26    
27 root 1.2 This module converts Perl data structures to JSON and vice versa. Its
28     primary goal is to be I<correct> and its secondary goal is to be
29     I<fast>. To reach the latter goal it was written in C.
30    
31     As this is the n-th-something JSON module on CPAN, what was the reason
32     to write yet another JSON module? While it seems there are many JSON
33     modules, none of them correctly handle all corner cases, and in most cases
34     their maintainers are unresponsive, gone missing, or not listening to bug
35     reports for other reasons.
36    
37     See COMPARISON, below, for a comparison to some other JSON modules.
38    
39 root 1.10 See MAPPING, below, on how JSON::XS maps perl values to JSON values and
40     vice versa.
41    
42 root 1.2 =head2 FEATURES
43    
44 root 1.1 =over 4
45    
46 root 1.21 =item * correct unicode handling
47 root 1.2
48 root 1.10 This module knows how to handle Unicode, and even documents how and when
49     it does so.
50 root 1.2
51     =item * round-trip integrity
52    
53     When you serialise a perl data structure using only datatypes supported
54     by JSON, the deserialised data structure is identical on the Perl level.
55 root 1.21 (e.g. the string "2.0" doesn't suddenly become "2" just because it looks
56     like a number).
57 root 1.2
58     =item * strict checking of JSON correctness
59    
60 root 1.16 There is no guessing, no generating of illegal JSON texts by default,
61 root 1.10 and only JSON is accepted as input by default (the latter is a security
62     feature).
63 root 1.2
64     =item * fast
65    
66 root 1.10 Compared to other JSON modules, this module compares favourably in terms
67     of speed, too.
68 root 1.2
69     =item * simple to use
70    
71     This module has both a simple functional interface as well as an OO
72     interface.
73    
74     =item * reasonably versatile output formats
75    
76 root 1.10 You can choose between the most compact guarenteed single-line format
77 root 1.21 possible (nice for simple line-based protocols), a pure-ascii format
78     (for when your transport is not 8-bit clean, still supports the whole
79     unicode range), or a pretty-printed format (for when you want to read that
80     stuff). Or you can combine those features in whatever way you like.
81 root 1.2
82     =back
83    
84 root 1.1 =cut
85    
86     package JSON::XS;
87    
88 root 1.20 use strict;
89    
90 root 1.1 BEGIN {
91 root 1.40 our $VERSION = '1.24';
92 root 1.20 our @ISA = qw(Exporter);
93 root 1.1
94 root 1.21 our @EXPORT = qw(to_json from_json objToJson jsonToObj);
95 root 1.1 require Exporter;
96    
97     require XSLoader;
98     XSLoader::load JSON::XS::, $VERSION;
99     }
100    
101 root 1.2 =head1 FUNCTIONAL INTERFACE
102    
103     The following convinience methods are provided by this module. They are
104     exported by default:
105    
106     =over 4
107    
108 root 1.16 =item $json_text = to_json $perl_scalar
109 root 1.2
110     Converts the given Perl data structure (a simple scalar or a reference to
111     a hash or array) to a UTF-8 encoded, binary string (that is, the string contains
112     octets only). Croaks on error.
113    
114 root 1.16 This function call is functionally identical to:
115 root 1.2
116 root 1.16 $json_text = JSON::XS->new->utf8->encode ($perl_scalar)
117    
118     except being faster.
119    
120     =item $perl_scalar = from_json $json_text
121 root 1.2
122     The opposite of C<to_json>: expects an UTF-8 (binary) string and tries to
123 root 1.16 parse that as an UTF-8 encoded JSON text, returning the resulting simple
124 root 1.2 scalar or reference. Croaks on error.
125    
126 root 1.16 This function call is functionally identical to:
127    
128     $perl_scalar = JSON::XS->new->utf8->decode ($json_text)
129    
130     except being faster.
131 root 1.2
132     =back
133    
134 root 1.23
135 root 1.2 =head1 OBJECT-ORIENTED INTERFACE
136    
137     The object oriented interface lets you configure your own encoding or
138     decoding style, within the limits of supported formats.
139    
140     =over 4
141    
142     =item $json = new JSON::XS
143    
144     Creates a new JSON::XS object that can be used to de/encode JSON
145     strings. All boolean flags described below are by default I<disabled>.
146 root 1.1
147 root 1.2 The mutators for flags all return the JSON object again and thus calls can
148     be chained:
149    
150 root 1.16 my $json = JSON::XS->new->utf8->space_after->encode ({a => [1,2]})
151 root 1.3 => {"a": [1, 2]}
152 root 1.2
153 root 1.7 =item $json = $json->ascii ([$enable])
154 root 1.2
155 root 1.16 If C<$enable> is true (or missing), then the C<encode> method will not
156     generate characters outside the code range C<0..127> (which is ASCII). Any
157     unicode characters outside that range will be escaped using either a
158     single \uXXXX (BMP characters) or a double \uHHHH\uLLLLL escape sequence,
159 root 1.32 as per RFC4627. The resulting encoded JSON text can be treated as a native
160     unicode string, an ascii-encoded, latin1-encoded or UTF-8 encoded string,
161     or any other superset of ASCII.
162 root 1.2
163     If C<$enable> is false, then the C<encode> method will not escape Unicode
164 root 1.33 characters unless required by the JSON syntax or other flags. This results
165     in a faster and more compact format.
166    
167     The main use for this flag is to produce JSON texts that can be
168     transmitted over a 7-bit channel, as the encoded JSON texts will not
169     contain any 8 bit characters.
170 root 1.2
171 root 1.16 JSON::XS->new->ascii (1)->encode ([chr 0x10401])
172     => ["\ud801\udc01"]
173 root 1.3
174 root 1.33 =item $json = $json->latin1 ([$enable])
175    
176     If C<$enable> is true (or missing), then the C<encode> method will encode
177     the resulting JSON text as latin1 (or iso-8859-1), escaping any characters
178     outside the code range C<0..255>. The resulting string can be treated as a
179     latin1-encoded JSON text or a native unicode string. The C<decode> method
180     will not be affected in any way by this flag, as C<decode> by default
181     expects unicode, which is a strict superset of latin1.
182    
183     If C<$enable> is false, then the C<encode> method will not escape Unicode
184     characters unless required by the JSON syntax or other flags.
185    
186     The main use for this flag is efficiently encoding binary data as JSON
187     text, as most octets will not be escaped, resulting in a smaller encoded
188     size. The disadvantage is that the resulting JSON text is encoded
189     in latin1 (and must correctly be treated as such when storing and
190     transfering), a rare encoding for JSON. It is therefore most useful when
191     you want to store data structures known to contain binary data efficiently
192     in files or databases, not when talking to other JSON encoders/decoders.
193    
194     JSON::XS->new->latin1->encode (["\x{89}\x{abc}"]
195     => ["\x{89}\\u0abc"] # (perl syntax, U+abc escaped, U+89 not)
196    
197 root 1.7 =item $json = $json->utf8 ([$enable])
198 root 1.2
199 root 1.7 If C<$enable> is true (or missing), then the C<encode> method will encode
200 root 1.16 the JSON result into UTF-8, as required by many protocols, while the
201 root 1.7 C<decode> method expects to be handled an UTF-8-encoded string. Please
202     note that UTF-8-encoded strings do not contain any characters outside the
203 root 1.16 range C<0..255>, they are thus useful for bytewise/binary I/O. In future
204     versions, enabling this option might enable autodetection of the UTF-16
205     and UTF-32 encoding families, as described in RFC4627.
206 root 1.2
207     If C<$enable> is false, then the C<encode> method will return the JSON
208     string as a (non-encoded) unicode string, while C<decode> expects thus a
209     unicode string. Any decoding or encoding (e.g. to UTF-8 or UTF-16) needs
210     to be done yourself, e.g. using the Encode module.
211    
212 root 1.16 Example, output UTF-16BE-encoded JSON:
213    
214     use Encode;
215     $jsontext = encode "UTF-16BE", JSON::XS->new->encode ($object);
216    
217     Example, decode UTF-32LE-encoded JSON:
218    
219     use Encode;
220     $object = JSON::XS->new->decode (decode "UTF-32LE", $jsontext);
221 root 1.12
222 root 1.7 =item $json = $json->pretty ([$enable])
223 root 1.2
224     This enables (or disables) all of the C<indent>, C<space_before> and
225 root 1.3 C<space_after> (and in the future possibly more) flags in one call to
226 root 1.2 generate the most readable (or most compact) form possible.
227    
228 root 1.12 Example, pretty-print some simple structure:
229    
230 root 1.3 my $json = JSON::XS->new->pretty(1)->encode ({a => [1,2]})
231     =>
232     {
233     "a" : [
234     1,
235     2
236     ]
237     }
238    
239 root 1.7 =item $json = $json->indent ([$enable])
240 root 1.2
241 root 1.7 If C<$enable> is true (or missing), then the C<encode> method will use a multiline
242 root 1.2 format as output, putting every array member or object/hash key-value pair
243     into its own line, identing them properly.
244    
245     If C<$enable> is false, no newlines or indenting will be produced, and the
246 root 1.16 resulting JSON text is guarenteed not to contain any C<newlines>.
247 root 1.2
248 root 1.16 This setting has no effect when decoding JSON texts.
249 root 1.2
250 root 1.7 =item $json = $json->space_before ([$enable])
251 root 1.2
252 root 1.7 If C<$enable> is true (or missing), then the C<encode> method will add an extra
253 root 1.2 optional space before the C<:> separating keys from values in JSON objects.
254    
255     If C<$enable> is false, then the C<encode> method will not add any extra
256     space at those places.
257    
258 root 1.16 This setting has no effect when decoding JSON texts. You will also
259     most likely combine this setting with C<space_after>.
260 root 1.2
261 root 1.12 Example, space_before enabled, space_after and indent disabled:
262    
263     {"key" :"value"}
264    
265 root 1.7 =item $json = $json->space_after ([$enable])
266 root 1.2
267 root 1.7 If C<$enable> is true (or missing), then the C<encode> method will add an extra
268 root 1.2 optional space after the C<:> separating keys from values in JSON objects
269     and extra whitespace after the C<,> separating key-value pairs and array
270     members.
271    
272     If C<$enable> is false, then the C<encode> method will not add any extra
273     space at those places.
274    
275 root 1.16 This setting has no effect when decoding JSON texts.
276 root 1.2
277 root 1.12 Example, space_before and indent disabled, space_after enabled:
278    
279     {"key": "value"}
280    
281 root 1.7 =item $json = $json->canonical ([$enable])
282 root 1.2
283 root 1.7 If C<$enable> is true (or missing), then the C<encode> method will output JSON objects
284 root 1.2 by sorting their keys. This is adding a comparatively high overhead.
285    
286     If C<$enable> is false, then the C<encode> method will output key-value
287     pairs in the order Perl stores them (which will likely change between runs
288     of the same script).
289    
290     This option is useful if you want the same data structure to be encoded as
291 root 1.16 the same JSON text (given the same overall settings). If it is disabled,
292 root 1.2 the same hash migh be encoded differently even if contains the same data,
293     as key-value pairs have no inherent ordering in Perl.
294    
295 root 1.16 This setting has no effect when decoding JSON texts.
296 root 1.2
297 root 1.7 =item $json = $json->allow_nonref ([$enable])
298 root 1.3
299 root 1.7 If C<$enable> is true (or missing), then the C<encode> method can convert a
300 root 1.3 non-reference into its corresponding string, number or null JSON value,
301     which is an extension to RFC4627. Likewise, C<decode> will accept those JSON
302     values instead of croaking.
303    
304     If C<$enable> is false, then the C<encode> method will croak if it isn't
305 root 1.16 passed an arrayref or hashref, as JSON texts must either be an object
306 root 1.3 or array. Likewise, C<decode> will croak if given something that is not a
307     JSON object or array.
308    
309 root 1.12 Example, encode a Perl scalar as JSON value with enabled C<allow_nonref>,
310     resulting in an invalid JSON text:
311    
312     JSON::XS->new->allow_nonref->encode ("Hello, World!")
313     => "Hello, World!"
314    
315 root 1.7 =item $json = $json->shrink ([$enable])
316    
317     Perl usually over-allocates memory a bit when allocating space for
318 root 1.24 strings. This flag optionally resizes strings generated by either
319 root 1.7 C<encode> or C<decode> to their minimum size possible. This can save
320 root 1.16 memory when your JSON texts are either very very long or you have many
321 root 1.8 short strings. It will also try to downgrade any strings to octet-form
322     if possible: perl stores strings internally either in an encoding called
323     UTF-X or in octet-form. The latter cannot store everything but uses less
324 root 1.24 space in general (and some buggy Perl or C code might even rely on that
325     internal representation being used).
326 root 1.7
327 root 1.24 The actual definition of what shrink does might change in future versions,
328     but it will always try to save space at the expense of time.
329    
330     If C<$enable> is true (or missing), the string returned by C<encode> will
331     be shrunk-to-fit, while all strings generated by C<decode> will also be
332     shrunk-to-fit.
333 root 1.7
334     If C<$enable> is false, then the normal perl allocation algorithms are used.
335     If you work with your data, then this is likely to be faster.
336    
337     In the future, this setting might control other things, such as converting
338     strings that look like integers or floats into integers or floats
339     internally (there is no difference on the Perl level), saving space.
340    
341 root 1.23 =item $json = $json->max_depth ([$maximum_nesting_depth])
342    
343 root 1.28 Sets the maximum nesting level (default C<512>) accepted while encoding
344 root 1.23 or decoding. If the JSON text or Perl data structure has an equal or
345     higher nesting level then this limit, then the encoder and decoder will
346     stop and croak at that point.
347    
348     Nesting level is defined by number of hash- or arrayrefs that the encoder
349     needs to traverse to reach a given point or the number of C<{> or C<[>
350     characters without their matching closing parenthesis crossed to reach a
351     given character in a string.
352    
353     Setting the maximum depth to one disallows any nesting, so that ensures
354     that the object is only a single hash/object or array.
355    
356     The argument to C<max_depth> will be rounded up to the next nearest power
357     of two.
358    
359     See SECURITY CONSIDERATIONS, below, for more info on why this is useful.
360    
361 root 1.16 =item $json_text = $json->encode ($perl_scalar)
362 root 1.2
363     Converts the given Perl data structure (a simple scalar or a reference
364     to a hash or array) to its JSON representation. Simple scalars will be
365     converted into JSON string or number sequences, while references to arrays
366     become JSON arrays and references to hashes become JSON objects. Undefined
367     Perl values (e.g. C<undef>) become JSON C<null> values. Neither C<true>
368     nor C<false> values will be generated.
369 root 1.1
370 root 1.16 =item $perl_scalar = $json->decode ($json_text)
371 root 1.1
372 root 1.16 The opposite of C<encode>: expects a JSON text and tries to parse it,
373 root 1.2 returning the resulting simple scalar or reference. Croaks on error.
374 root 1.1
375 root 1.2 JSON numbers and strings become simple Perl scalars. JSON arrays become
376     Perl arrayrefs and JSON objects become Perl hashrefs. C<true> becomes
377     C<1>, C<false> becomes C<0> and C<null> becomes C<undef>.
378 root 1.1
379 root 1.34 =item ($perl_scalar, $characters) = $json->decode_prefix ($json_text)
380    
381     This works like the C<decode> method, but instead of raising an exception
382     when there is trailing garbage after the first JSON object, it will
383     silently stop parsing there and return the number of characters consumed
384     so far.
385    
386     This is useful if your JSON texts are not delimited by an outer protocol
387     (which is not the brightest thing to do in the first place) and you need
388     to know where the JSON text ends.
389    
390     JSON::XS->new->decode_prefix ("[1] the tail")
391     => ([], 3)
392    
393 root 1.1 =back
394    
395 root 1.23
396 root 1.10 =head1 MAPPING
397    
398     This section describes how JSON::XS maps Perl values to JSON values and
399     vice versa. These mappings are designed to "do the right thing" in most
400     circumstances automatically, preserving round-tripping characteristics
401     (what you put in comes out as something equivalent).
402    
403     For the more enlightened: note that in the following descriptions,
404     lowercase I<perl> refers to the Perl interpreter, while uppcercase I<Perl>
405     refers to the abstract Perl language itself.
406    
407 root 1.39
408 root 1.10 =head2 JSON -> PERL
409    
410     =over 4
411    
412     =item object
413    
414     A JSON object becomes a reference to a hash in Perl. No ordering of object
415 root 1.14 keys is preserved (JSON does not preserver object key ordering itself).
416 root 1.10
417     =item array
418    
419     A JSON array becomes a reference to an array in Perl.
420    
421     =item string
422    
423     A JSON string becomes a string scalar in Perl - Unicode codepoints in JSON
424     are represented by the same codepoints in the Perl string, so no manual
425     decoding is necessary.
426    
427     =item number
428    
429     A JSON number becomes either an integer or numeric (floating point)
430     scalar in perl, depending on its range and any fractional parts. On the
431     Perl level, there is no difference between those as Perl handles all the
432     conversion details, but an integer may take slightly less memory and might
433     represent more values exactly than (floating point) numbers.
434    
435     =item true, false
436    
437     These JSON atoms become C<0>, C<1>, respectively. Information is lost in
438     this process. Future versions might represent those values differently,
439     but they will be guarenteed to act like these integers would normally in
440     Perl.
441    
442     =item null
443    
444     A JSON null atom becomes C<undef> in Perl.
445    
446     =back
447    
448 root 1.39
449 root 1.10 =head2 PERL -> JSON
450    
451     The mapping from Perl to JSON is slightly more difficult, as Perl is a
452     truly typeless language, so we can only guess which JSON type is meant by
453     a Perl value.
454    
455     =over 4
456    
457     =item hash references
458    
459     Perl hash references become JSON objects. As there is no inherent ordering
460 root 1.25 in hash keys (or JSON objects), they will usually be encoded in a
461     pseudo-random order that can change between runs of the same program but
462     stays generally the same within a single run of a program. JSON::XS can
463     optionally sort the hash keys (determined by the I<canonical> flag), so
464     the same datastructure will serialise to the same JSON text (given same
465     settings and version of JSON::XS), but this incurs a runtime overhead
466     and is only rarely useful, e.g. when you want to compare some JSON text
467     against another for equality.
468 root 1.10
469     =item array references
470    
471     Perl array references become JSON arrays.
472    
473 root 1.25 =item other references
474    
475     Other unblessed references are generally not allowed and will cause an
476     exception to be thrown, except for references to the integers C<0> and
477     C<1>, which get turned into C<false> and C<true> atoms in JSON. You can
478     also use C<JSON::XS::false> and C<JSON::XS::true> to improve readability.
479    
480     to_json [\0,JSON::XS::true] # yields [false,true]
481    
482 root 1.10 =item blessed objects
483    
484     Blessed objects are not allowed. JSON::XS currently tries to encode their
485     underlying representation (hash- or arrayref), but this behaviour might
486     change in future versions.
487    
488     =item simple scalars
489    
490     Simple Perl scalars (any scalar that is not a reference) are the most
491     difficult objects to encode: JSON::XS will encode undefined scalars as
492     JSON null value, scalars that have last been used in a string context
493     before encoding as JSON strings and anything else as number value:
494    
495     # dump as number
496     to_json [2] # yields [2]
497     to_json [-3.0e17] # yields [-3e+17]
498     my $value = 5; to_json [$value] # yields [5]
499    
500     # used as string, so dump as string
501     print $value;
502     to_json [$value] # yields ["5"]
503    
504     # undef becomes null
505     to_json [undef] # yields [null]
506    
507     You can force the type to be a string by stringifying it:
508    
509     my $x = 3.1; # some variable containing a number
510     "$x"; # stringified
511     $x .= ""; # another, more awkward way to stringify
512     print $x; # perl does it for you, too, quite often
513    
514     You can force the type to be a number by numifying it:
515    
516     my $x = "3"; # some variable containing a string
517     $x += 0; # numify it, ensuring it will be dumped as a number
518     $x *= 1; # same thing, the choise is yours.
519    
520     You can not currently output JSON booleans or force the type in other,
521     less obscure, ways. Tell me if you need this capability.
522    
523     =back
524    
525 root 1.23
526 root 1.3 =head1 COMPARISON
527    
528     As already mentioned, this module was created because none of the existing
529     JSON modules could be made to work correctly. First I will describe the
530     problems (or pleasures) I encountered with various existing JSON modules,
531 root 1.4 followed by some benchmark values. JSON::XS was designed not to suffer
532     from any of these problems or limitations.
533 root 1.3
534     =over 4
535    
536 root 1.5 =item JSON 1.07
537 root 1.3
538     Slow (but very portable, as it is written in pure Perl).
539    
540     Undocumented/buggy Unicode handling (how JSON handles unicode values is
541     undocumented. One can get far by feeding it unicode strings and doing
542     en-/decoding oneself, but unicode escapes are not working properly).
543    
544     No roundtripping (strings get clobbered if they look like numbers, e.g.
545     the string C<2.0> will encode to C<2.0> instead of C<"2.0">, and that will
546     decode into the number 2.
547    
548 root 1.5 =item JSON::PC 0.01
549 root 1.3
550     Very fast.
551    
552     Undocumented/buggy Unicode handling.
553    
554     No roundtripping.
555    
556 root 1.4 Has problems handling many Perl values (e.g. regex results and other magic
557     values will make it croak).
558 root 1.3
559     Does not even generate valid JSON (C<{1,2}> gets converted to C<{1:2}>
560 root 1.16 which is not a valid JSON text.
561 root 1.3
562     Unmaintained (maintainer unresponsive for many months, bugs are not
563     getting fixed).
564    
565 root 1.5 =item JSON::Syck 0.21
566 root 1.3
567     Very buggy (often crashes).
568    
569 root 1.4 Very inflexible (no human-readable format supported, format pretty much
570     undocumented. I need at least a format for easy reading by humans and a
571     single-line compact format for use in a protocol, and preferably a way to
572 root 1.16 generate ASCII-only JSON texts).
573 root 1.3
574     Completely broken (and confusingly documented) Unicode handling (unicode
575     escapes are not working properly, you need to set ImplicitUnicode to
576     I<different> values on en- and decoding to get symmetric behaviour).
577    
578     No roundtripping (simple cases work, but this depends on wether the scalar
579     value was used in a numeric context or not).
580    
581     Dumping hashes may skip hash values depending on iterator state.
582    
583     Unmaintained (maintainer unresponsive for many months, bugs are not
584     getting fixed).
585    
586     Does not check input for validity (i.e. will accept non-JSON input and
587     return "something" instead of raising an exception. This is a security
588     issue: imagine two banks transfering money between each other using
589     JSON. One bank might parse a given non-JSON request and deduct money,
590     while the other might reject the transaction with a syntax error. While a
591     good protocol will at least recover, that is extra unnecessary work and
592     the transaction will still not succeed).
593    
594 root 1.5 =item JSON::DWIW 0.04
595 root 1.3
596     Very fast. Very natural. Very nice.
597    
598     Undocumented unicode handling (but the best of the pack. Unicode escapes
599     still don't get parsed properly).
600    
601     Very inflexible.
602    
603     No roundtripping.
604    
605 root 1.16 Does not generate valid JSON texts (key strings are often unquoted, empty keys
606 root 1.4 result in nothing being output)
607    
608 root 1.3 Does not check input for validity.
609    
610     =back
611    
612 root 1.39
613     =head2 JSON and YAML
614    
615     You often hear that JSON is a subset (or a close subset) of YAML. This is,
616     however, a mass hysteria and very far from the truth. In general, there is
617     no way to configure JSON::XS to output a data structure as valid YAML.
618    
619 root 1.41 If you really must use JSON::XS to generate YAML, you should use this
620 root 1.39 algorithm (subject to change in future versions):
621    
622     my $to_yaml = JSON::XS->new->utf8->space_after (1);
623     my $yaml = $to_yaml->encode ($ref) . "\n";
624    
625     This will usually generate JSON texts that also parse as valid
626 root 1.41 YAML. Please note that YAML has hardcoded limits on (simple) object key
627     lengths that JSON doesn't have, so you should make sure that your hash
628     keys are noticably shorter than the 1024 characters YAML allows.
629 root 1.39
630     There might be other incompatibilities that I am not aware of. In general
631     you should not try to generate YAML with a JSON generator or vice versa,
632 root 1.41 or try to parse JSON with a YAML parser or vice versa: chances are high
633     that you will run into severe interoperability problems.
634 root 1.39
635    
636 root 1.3 =head2 SPEED
637    
638 root 1.4 It seems that JSON::XS is surprisingly fast, as shown in the following
639     tables. They have been generated with the help of the C<eg/bench> program
640     in the JSON::XS distribution, to make it easy to compare on your own
641     system.
642    
643 root 1.37 First comes a comparison between various modules using a very short
644     single-line JSON string:
645 root 1.18
646 root 1.37 {"method": "handleMessage", "params": ["user1", "we were just talking"], \
647 root 1.38 "id": null, "array":[1,11,234,-5,1e5,1e7, true, false]}
648 root 1.18
649 root 1.39 It shows the number of encodes/decodes per second (JSON::XS uses
650     the functional interface, while JSON::XS/2 uses the OO interface
651     with pretty-printing and hashkey sorting enabled, JSON::XS/3 enables
652     shrink). Higher is better:
653 root 1.4
654     module | encode | decode |
655     -----------|------------|------------|
656 root 1.38 JSON | 7645.468 | 4208.613 |
657 root 1.40 JSON::DWIW | 40721.398 | 77101.176 |
658 root 1.38 JSON::PC | 65948.176 | 78251.940 |
659 root 1.40 JSON::Syck | 22844.793 | 26479.192 |
660 root 1.38 JSON::XS | 388361.481 | 199728.762 |
661     JSON::XS/2 | 218453.333 | 192399.266 |
662     JSON::XS/3 | 338250.323 | 192399.266 |
663 root 1.40 Storable | 15779.925 | 14169.946 |
664 root 1.4 -----------+------------+------------+
665    
666 root 1.37 That is, JSON::XS is about five times faster than JSON::DWIW on encoding,
667 root 1.38 about three times faster on decoding, and over fourty times faster
668 root 1.37 than JSON, even with pretty-printing and key sorting. It also compares
669     favourably to Storable for small amounts of data.
670 root 1.4
671 root 1.13 Using a longer test string (roughly 18KB, generated from Yahoo! Locals
672 root 1.4 search API (http://nanoref.com/yahooapis/mgPdGg):
673    
674     module | encode | decode |
675     -----------|------------|------------|
676 root 1.37 JSON | 254.685 | 37.665 |
677 root 1.40 JSON::DWIW | 843.343 | 1049.731 |
678 root 1.37 JSON::PC | 3602.116 | 2307.352 |
679 root 1.40 JSON::Syck | 505.107 | 787.899 |
680     JSON::XS | 5747.196 | 3690.220 |
681     JSON::XS/2 | 3968.121 | 3676.634 |
682     JSON::XS/3 | 6105.246 | 3662.508 |
683     Storable | 4417.337 | 5285.161 |
684 root 1.4 -----------+------------+------------+
685    
686 root 1.40 Again, JSON::XS leads by far (except for Storable which non-surprisingly
687     decodes faster).
688 root 1.4
689 root 1.18 On large strings containing lots of high unicode characters, some modules
690     (such as JSON::PC) seem to decode faster than JSON::XS, but the result
691     will be broken due to missing (or wrong) unicode handling. Others refuse
692     to decode or encode properly, so it was impossible to prepare a fair
693     comparison table for that case.
694 root 1.13
695 root 1.11
696 root 1.23 =head1 SECURITY CONSIDERATIONS
697    
698     When you are using JSON in a protocol, talking to untrusted potentially
699     hostile creatures requires relatively few measures.
700    
701     First of all, your JSON decoder should be secure, that is, should not have
702     any buffer overflows. Obviously, this module should ensure that and I am
703     trying hard on making that true, but you never know.
704    
705     Second, you need to avoid resource-starving attacks. That means you should
706     limit the size of JSON texts you accept, or make sure then when your
707     resources run out, thats just fine (e.g. by using a separate process that
708     can crash safely). The size of a JSON text in octets or characters is
709     usually a good indication of the size of the resources required to decode
710     it into a Perl structure.
711    
712     Third, JSON::XS recurses using the C stack when decoding objects and
713     arrays. The C stack is a limited resource: for instance, on my amd64
714 root 1.28 machine with 8MB of stack size I can decode around 180k nested arrays but
715     only 14k nested JSON objects (due to perl itself recursing deeply on croak
716     to free the temporary). If that is exceeded, the program crashes. to be
717     conservative, the default nesting limit is set to 512. If your process
718     has a smaller stack, you should adjust this setting accordingly with the
719     C<max_depth> method.
720 root 1.23
721     And last but least, something else could bomb you that I forgot to think
722 root 1.30 of. In that case, you get to keep the pieces. I am always open for hints,
723 root 1.23 though...
724    
725 root 1.11
726 root 1.4 =head1 BUGS
727    
728     While the goal of this module is to be correct, that unfortunately does
729     not mean its bug-free, only that I think its design is bug-free. It is
730 root 1.23 still relatively early in its development. If you keep reporting bugs they
731     will be fixed swiftly, though.
732 root 1.4
733 root 1.2 =cut
734    
735 root 1.25 sub true() { \1 }
736     sub false() { \0 }
737    
738 root 1.2 1;
739    
740 root 1.1 =head1 AUTHOR
741    
742     Marc Lehmann <schmorp@schmorp.de>
743     http://home.schmorp.de/
744    
745     =cut
746