ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/JSON-XS/XS.pm
(Generate patch)

Comparing JSON-XS/XS.pm (file contents):
Revision 1.44 by root, Mon Jun 25 04:08:17 2007 UTC vs.
Revision 1.62 by root, Thu Oct 11 22:52:52 2007 UTC

1=encoding utf-8
2
1=head1 NAME 3=head1 NAME
2 4
3JSON::XS - JSON serialising/deserialising, done correctly and fast 5JSON::XS - JSON serialising/deserialising, done correctly and fast
6
7JSON::XS - 正しくて高速な JSON シリアライザ/デシリアライザ
8 (http://fleur.hio.jp/perldoc/mix/lib/JSON/XS.html)
4 9
5=head1 SYNOPSIS 10=head1 SYNOPSIS
6 11
7 use JSON::XS; 12 use JSON::XS;
8 13
9 # exported functions, they croak on error 14 # exported functions, they croak on error
10 # and expect/generate UTF-8 15 # and expect/generate UTF-8
11 16
12 $utf8_encoded_json_text = to_json $perl_hash_or_arrayref; 17 $utf8_encoded_json_text = to_json $perl_hash_or_arrayref;
13 $perl_hash_or_arrayref = from_json $utf8_encoded_json_text; 18 $perl_hash_or_arrayref = from_json $utf8_encoded_json_text;
14
15 # objToJson and jsonToObj aliases to to_json and from_json
16 # are exported for compatibility to the JSON module,
17 # but should not be used in new code.
18 19
19 # OO-interface 20 # OO-interface
20 21
21 $coder = JSON::XS->new->ascii->pretty->allow_nonref; 22 $coder = JSON::XS->new->ascii->pretty->allow_nonref;
22 $pretty_printed_unencoded = $coder->encode ($perl_scalar); 23 $pretty_printed_unencoded = $coder->encode ($perl_scalar);
85 86
86package JSON::XS; 87package JSON::XS;
87 88
88use strict; 89use strict;
89 90
90our $VERSION = '1.4'; 91our $VERSION = '1.5';
91our @ISA = qw(Exporter); 92our @ISA = qw(Exporter);
92 93
93our @EXPORT = qw(to_json from_json objToJson jsonToObj); 94our @EXPORT = qw(to_json from_json);
94 95
95use Exporter; 96use Exporter;
96use XSLoader; 97use XSLoader;
97 98
98=head1 FUNCTIONAL INTERFACE 99=head1 FUNCTIONAL INTERFACE
282 283
283Example, space_before and indent disabled, space_after enabled: 284Example, space_before and indent disabled, space_after enabled:
284 285
285 {"key": "value"} 286 {"key": "value"}
286 287
288=item $json = $json->relaxed ([$enable])
289
290If C<$enable> is true (or missing), then C<decode> will accept some
291extensions to normal JSON syntax (see below). C<encode> will not be
292affected in anyway. I<Be aware that this option makes you accept invalid
293JSON texts as if they were valid!>. I suggest only to use this option to
294parse application-specific files written by humans (configuration files,
295resource files etc.)
296
297If C<$enable> is false (the default), then C<decode> will only accept
298valid JSON texts.
299
300Currently accepted extensions are:
301
302=over 4
303
304=item * list items can have an end-comma
305
306JSON I<separates> array elements and key-value pairs with commas. This
307can be annoying if you write JSON texts manually and want to be able to
308quickly append elements, so this extension accepts comma at the end of
309such items not just between them:
310
311 [
312 1,
313 2, <- this comma not normally allowed
314 ]
315 {
316 "k1": "v1",
317 "k2": "v2", <- this comma not normally allowed
318 }
319
320=item * shell-style '#'-comments
321
322Whenever JSON allows whitespace, shell-style comments are additionally
323allowed. They are terminated by the first carriage-return or line-feed
324character, after which more white-space and comments are allowed.
325
326 [
327 1, # this comment not allowed in JSON
328 # neither this one...
329 ]
330
331=back
332
287=item $json = $json->canonical ([$enable]) 333=item $json = $json->canonical ([$enable])
288 334
289If C<$enable> is true (or missing), then the C<encode> method will output JSON objects 335If C<$enable> is true (or missing), then the C<encode> method will output JSON objects
290by sorting their keys. This is adding a comparatively high overhead. 336by sorting their keys. This is adding a comparatively high overhead.
291 337
341 387
342The C<TO_JSON> method may safely call die if it wants. If C<TO_JSON> 388The C<TO_JSON> method may safely call die if it wants. If C<TO_JSON>
343returns other blessed objects, those will be handled in the same 389returns other blessed objects, those will be handled in the same
344way. C<TO_JSON> must take care of not causing an endless recursion cycle 390way. C<TO_JSON> must take care of not causing an endless recursion cycle
345(== crash) in this case. The name of C<TO_JSON> was chosen because other 391(== crash) in this case. The name of C<TO_JSON> was chosen because other
346methods called by the Perl core (== not the user of the object) are 392methods called by the Perl core (== not by the user of the object) are
347usually in upper case letters and to avoid collisions with the C<to_json> 393usually in upper case letters and to avoid collisions with the C<to_json>
348function. 394function.
349 395
396This setting does not yet influence C<decode> in any way, but in the
397future, global hooks might get installed that influence C<decode> and are
398enabled by this setting.
399
350If C<$enable> is false, then the C<allow_blessed> setting will decide what 400If C<$enable> is false, then the C<allow_blessed> setting will decide what
351to do when a blessed object is found. 401to do when a blessed object is found.
402
403=item $json = $json->filter_json_object ([$coderef->($hashref)])
404
405When C<$coderef> is specified, it will be called from C<decode> each
406time it decodes a JSON object. The only argument is a reference to the
407newly-created hash. If the code references returns a single scalar (which
408need not be a reference), this value (i.e. a copy of that scalar to avoid
409aliasing) is inserted into the deserialised data structure. If it returns
410an empty list (NOTE: I<not> C<undef>, which is a valid scalar), the
411original deserialised hash will be inserted. This setting can slow down
412decoding considerably.
413
414When C<$coderef> is omitted or undefined, any existing callback will
415be removed and C<decode> will not change the deserialised hash in any
416way.
417
418Example, convert all JSON objects into the integer 5:
419
420 my $js = JSON::XS->new->filter_json_object (sub { 5 });
421 # returns [5]
422 $js->decode ('[{}]')
423 # throw an exception because allow_nonref is not enabled
424 # so a lone 5 is not allowed.
425 $js->decode ('{"a":1, "b":2}');
426
427=item $json = $json->filter_json_single_key_object ($key [=> $coderef->($value)])
428
429Works remotely similar to C<filter_json_object>, but is only called for
430JSON objects having a single key named C<$key>.
431
432This C<$coderef> is called before the one specified via
433C<filter_json_object>, if any. It gets passed the single value in the JSON
434object. If it returns a single value, it will be inserted into the data
435structure. If it returns nothing (not even C<undef> but the empty list),
436the callback from C<filter_json_object> will be called next, as if no
437single-key callback were specified.
438
439If C<$coderef> is omitted or undefined, the corresponding callback will be
440disabled. There can only ever be one callback for a given key.
441
442As this callback gets called less often then the C<filter_json_object>
443one, decoding speed will not usually suffer as much. Therefore, single-key
444objects make excellent targets to serialise Perl objects into, especially
445as single-key JSON objects are as close to the type-tagged value concept
446as JSON gets (its basically an ID/VALUE tuple). Of course, JSON does not
447support this in any way, so you need to make sure your data never looks
448like a serialised Perl hash.
449
450Typical names for the single object key are C<__class_whatever__>, or
451C<$__dollars_are_rarely_used__$> or C<}ugly_brace_placement>, or even
452things like C<__class_md5sum(classname)__>, to reduce the risk of clashing
453with real hashes.
454
455Example, decode JSON objects of the form C<< { "__widget__" => <id> } >>
456into the corresponding C<< $WIDGET{<id>} >> object:
457
458 # return whatever is in $WIDGET{5}:
459 JSON::XS
460 ->new
461 ->filter_json_single_key_object (__widget__ => sub {
462 $WIDGET{ $_[0] }
463 })
464 ->decode ('{"__widget__": 5')
465
466 # this can be used with a TO_JSON method in some "widget" class
467 # for serialisation to json:
468 sub WidgetBase::TO_JSON {
469 my ($self) = @_;
470
471 unless ($self->{id}) {
472 $self->{id} = ..get..some..id..;
473 $WIDGET{$self->{id}} = $self;
474 }
475
476 { __widget__ => $self->{id} }
477 }
352 478
353=item $json = $json->shrink ([$enable]) 479=item $json = $json->shrink ([$enable])
354 480
355Perl usually over-allocates memory a bit when allocating space for 481Perl usually over-allocates memory a bit when allocating space for
356strings. This flag optionally resizes strings generated by either 482strings. This flag optionally resizes strings generated by either
389given character in a string. 515given character in a string.
390 516
391Setting the maximum depth to one disallows any nesting, so that ensures 517Setting the maximum depth to one disallows any nesting, so that ensures
392that the object is only a single hash/object or array. 518that the object is only a single hash/object or array.
393 519
394The argument to C<max_depth> will be rounded up to the next nearest power 520The argument to C<max_depth> will be rounded up to the next highest power
395of two. 521of two. If no argument is given, the highest possible setting will be
522used, which is rarely useful.
523
524See SECURITY CONSIDERATIONS, below, for more info on why this is useful.
525
526=item $json = $json->max_size ([$maximum_string_size])
527
528Set the maximum length a JSON text may have (in bytes) where decoding is
529being attempted. The default is C<0>, meaning no limit. When C<decode>
530is called on a string longer then this number of characters it will not
531attempt to decode the string but throw an exception. This setting has no
532effect on C<encode> (yet).
533
534The argument to C<max_size> will be rounded up to the next B<highest>
535power of two (so may be more than requested). If no argument is given, the
536limit check will be deactivated (same as when C<0> is specified).
396 537
397See SECURITY CONSIDERATIONS, below, for more info on why this is useful. 538See SECURITY CONSIDERATIONS, below, for more info on why this is useful.
398 539
399=item $json_text = $json->encode ($perl_scalar) 540=item $json_text = $json->encode ($perl_scalar)
400 541
462are represented by the same codepoints in the Perl string, so no manual 603are represented by the same codepoints in the Perl string, so no manual
463decoding is necessary. 604decoding is necessary.
464 605
465=item number 606=item number
466 607
467A JSON number becomes either an integer or numeric (floating point) 608A JSON number becomes either an integer, numeric (floating point) or
468scalar in perl, depending on its range and any fractional parts. On the 609string scalar in perl, depending on its range and any fractional parts. On
469Perl level, there is no difference between those as Perl handles all the 610the Perl level, there is no difference between those as Perl handles all
470conversion details, but an integer may take slightly less memory and might 611the conversion details, but an integer may take slightly less memory and
471represent more values exactly than (floating point) numbers. 612might represent more values exactly than (floating point) numbers.
613
614If the number consists of digits only, JSON::XS will try to represent
615it as an integer value. If that fails, it will try to represent it as
616a numeric (floating point) value if that is possible without loss of
617precision. Otherwise it will preserve the number as a string value.
618
619Numbers containing a fractional or exponential part will always be
620represented as numeric (floating point) values, possibly at a loss of
621precision.
622
623This might create round-tripping problems as numbers might become strings,
624but as Perl is typeless there is no other way to do it.
472 625
473=item true, false 626=item true, false
474 627
475These JSON atoms become C<JSON::XS::true> and C<JSON::XS::false>, 628These JSON atoms become C<JSON::XS::true> and C<JSON::XS::false>,
476respectively. They are overloaded to act almost exactly like the numbers 629respectively. They are overloaded to act almost exactly like the numbers
518 to_json [\0,JSON::XS::true] # yields [false,true] 671 to_json [\0,JSON::XS::true] # yields [false,true]
519 672
520=item JSON::XS::true, JSON::XS::false 673=item JSON::XS::true, JSON::XS::false
521 674
522These special values become JSON true and JSON false values, 675These special values become JSON true and JSON false values,
523respectively. You cna alos use C<\1> and C<\0> directly if you want. 676respectively. You can also use C<\1> and C<\0> directly if you want.
524 677
525=item blessed objects 678=item blessed objects
526 679
527Blessed objects are not allowed. JSON::XS currently tries to encode their 680Blessed objects are not allowed. JSON::XS currently tries to encode their
528underlying representation (hash- or arrayref), but this behaviour might 681underlying representation (hash- or arrayref), but this behaviour might
692It shows the number of encodes/decodes per second (JSON::XS uses 845It shows the number of encodes/decodes per second (JSON::XS uses
693the functional interface, while JSON::XS/2 uses the OO interface 846the functional interface, while JSON::XS/2 uses the OO interface
694with pretty-printing and hashkey sorting enabled, JSON::XS/3 enables 847with pretty-printing and hashkey sorting enabled, JSON::XS/3 enables
695shrink). Higher is better: 848shrink). Higher is better:
696 849
850 Storable | 15779.925 | 14169.946 |
851 -----------+------------+------------+
697 module | encode | decode | 852 module | encode | decode |
698 -----------|------------|------------| 853 -----------|------------|------------|
699 JSON | 7645.468 | 4208.613 | 854 JSON | 4990.842 | 4088.813 |
700 JSON::DWIW | 40721.398 | 77101.176 | 855 JSON::DWIW | 51653.990 | 71575.154 |
701 JSON::PC | 65948.176 | 78251.940 | 856 JSON::PC | 65948.176 | 74631.744 |
702 JSON::Syck | 22844.793 | 26479.192 | 857 JSON::PP | 8931.652 | 3817.168 |
858 JSON::Syck | 24877.248 | 27776.848 |
703 JSON::XS | 388361.481 | 199728.762 | 859 JSON::XS | 388361.481 | 227951.304 |
704 JSON::XS/2 | 218453.333 | 192399.266 | 860 JSON::XS/2 | 227951.304 | 218453.333 |
705 JSON::XS/3 | 338250.323 | 192399.266 | 861 JSON::XS/3 | 338250.323 | 218453.333 |
706 Storable | 15779.925 | 14169.946 | 862 Storable | 16500.016 | 135300.129 |
707 -----------+------------+------------+ 863 -----------+------------+------------+
708 864
709That is, JSON::XS is about five times faster than JSON::DWIW on encoding, 865That is, JSON::XS is about five times faster than JSON::DWIW on encoding,
710about three times faster on decoding, and over fourty times faster 866about three times faster on decoding, and over fourty times faster
711than JSON, even with pretty-printing and key sorting. It also compares 867than JSON, even with pretty-printing and key sorting. It also compares
714Using a longer test string (roughly 18KB, generated from Yahoo! Locals 870Using a longer test string (roughly 18KB, generated from Yahoo! Locals
715search API (http://nanoref.com/yahooapis/mgPdGg): 871search API (http://nanoref.com/yahooapis/mgPdGg):
716 872
717 module | encode | decode | 873 module | encode | decode |
718 -----------|------------|------------| 874 -----------|------------|------------|
719 JSON | 254.685 | 37.665 | 875 JSON | 55.260 | 34.971 |
720 JSON::DWIW | 843.343 | 1049.731 | 876 JSON::DWIW | 825.228 | 1082.513 |
721 JSON::PC | 3602.116 | 2307.352 | 877 JSON::PC | 3571.444 | 2394.829 |
878 JSON::PP | 210.987 | 32.574 |
722 JSON::Syck | 505.107 | 787.899 | 879 JSON::Syck | 552.551 | 787.544 |
723 JSON::XS | 5747.196 | 3690.220 | 880 JSON::XS | 5780.463 | 4854.519 |
724 JSON::XS/2 | 3968.121 | 3676.634 | 881 JSON::XS/2 | 3869.998 | 4798.975 |
725 JSON::XS/3 | 6105.246 | 3662.508 | 882 JSON::XS/3 | 5862.880 | 4798.975 |
726 Storable | 4417.337 | 5285.161 | 883 Storable | 4445.002 | 5235.027 |
727 -----------+------------+------------+ 884 -----------+------------+------------+
728 885
729Again, JSON::XS leads by far (except for Storable which non-surprisingly 886Again, JSON::XS leads by far (except for Storable which non-surprisingly
730decodes faster). 887decodes faster).
731 888
748Second, you need to avoid resource-starving attacks. That means you should 905Second, you need to avoid resource-starving attacks. That means you should
749limit the size of JSON texts you accept, or make sure then when your 906limit the size of JSON texts you accept, or make sure then when your
750resources run out, thats just fine (e.g. by using a separate process that 907resources run out, thats just fine (e.g. by using a separate process that
751can crash safely). The size of a JSON text in octets or characters is 908can crash safely). The size of a JSON text in octets or characters is
752usually a good indication of the size of the resources required to decode 909usually a good indication of the size of the resources required to decode
753it into a Perl structure. 910it into a Perl structure. While JSON::XS can check the size of the JSON
911text, it might be too late when you already have it in memory, so you
912might want to check the size before you accept the string.
754 913
755Third, JSON::XS recurses using the C stack when decoding objects and 914Third, JSON::XS recurses using the C stack when decoding objects and
756arrays. The C stack is a limited resource: for instance, on my amd64 915arrays. The C stack is a limited resource: for instance, on my amd64
757machine with 8MB of stack size I can decode around 180k nested arrays but 916machine with 8MB of stack size I can decode around 180k nested arrays but
758only 14k nested JSON objects (due to perl itself recursing deeply on croak 917only 14k nested JSON objects (due to perl itself recursing deeply on croak

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines