… | |
… | |
83 | this module usually compares favourably in terms of speed, too. |
83 | this module usually compares favourably in terms of speed, too. |
84 | |
84 | |
85 | =item * simple to use |
85 | =item * simple to use |
86 | |
86 | |
87 | This module has both a simple functional interface as well as an object |
87 | This module has both a simple functional interface as well as an object |
88 | oriented interface interface. |
88 | oriented interface. |
89 | |
89 | |
90 | =item * reasonably versatile output formats |
90 | =item * reasonably versatile output formats |
91 | |
91 | |
92 | You can choose between the most compact guaranteed-single-line format |
92 | You can choose between the most compact guaranteed-single-line format |
93 | possible (nice for simple line-based protocols), a pure-ASCII format |
93 | possible (nice for simple line-based protocols), a pure-ASCII format |
… | |
… | |
101 | |
101 | |
102 | package JSON::XS; |
102 | package JSON::XS; |
103 | |
103 | |
104 | use common::sense; |
104 | use common::sense; |
105 | |
105 | |
106 | our $VERSION = '2.3'; |
106 | our $VERSION = 2.34; |
107 | our @ISA = qw(Exporter); |
107 | our @ISA = qw(Exporter); |
108 | |
108 | |
109 | our @EXPORT = qw(encode_json decode_json to_json from_json); |
109 | our @EXPORT = qw(encode_json decode_json); |
110 | |
|
|
111 | sub to_json($) { |
|
|
112 | require Carp; |
|
|
113 | Carp::croak ("JSON::XS::to_json has been renamed to encode_json, either downgrade to pre-2.0 versions of JSON::XS or rename the call"); |
|
|
114 | } |
|
|
115 | |
|
|
116 | sub from_json($) { |
|
|
117 | require Carp; |
|
|
118 | Carp::croak ("JSON::XS::from_json has been renamed to decode_json, either downgrade to pre-2.0 versions of JSON::XS or rename the call"); |
|
|
119 | } |
|
|
120 | |
110 | |
121 | use Exporter; |
111 | use Exporter; |
122 | use XSLoader; |
112 | use XSLoader; |
123 | |
113 | |
124 | =head1 FUNCTIONAL INTERFACE |
114 | =head1 FUNCTIONAL INTERFACE |
… | |
… | |
432 | If C<$enable> is true (or missing), then the C<encode> method will output JSON objects |
422 | If C<$enable> is true (or missing), then the C<encode> method will output JSON objects |
433 | by sorting their keys. This is adding a comparatively high overhead. |
423 | by sorting their keys. This is adding a comparatively high overhead. |
434 | |
424 | |
435 | If C<$enable> is false, then the C<encode> method will output key-value |
425 | If C<$enable> is false, then the C<encode> method will output key-value |
436 | pairs in the order Perl stores them (which will likely change between runs |
426 | pairs in the order Perl stores them (which will likely change between runs |
437 | of the same script). |
427 | of the same script, and can change even within the same run from 5.18 |
|
|
428 | onwards). |
438 | |
429 | |
439 | This option is useful if you want the same data structure to be encoded as |
430 | This option is useful if you want the same data structure to be encoded as |
440 | the same JSON text (given the same overall settings). If it is disabled, |
431 | the same JSON text (given the same overall settings). If it is disabled, |
441 | the same hash might be encoded differently even if contains the same data, |
432 | the same hash might be encoded differently even if contains the same data, |
442 | as key-value pairs have no inherent ordering in Perl. |
433 | as key-value pairs have no inherent ordering in Perl. |
… | |
… | |
666 | |
657 | |
667 | See SECURITY CONSIDERATIONS, below, for more info on why this is useful. |
658 | See SECURITY CONSIDERATIONS, below, for more info on why this is useful. |
668 | |
659 | |
669 | =item $json_text = $json->encode ($perl_scalar) |
660 | =item $json_text = $json->encode ($perl_scalar) |
670 | |
661 | |
671 | Converts the given Perl data structure (a simple scalar or a reference |
662 | Converts the given Perl value or data structure to its JSON |
672 | to a hash or array) to its JSON representation. Simple scalars will be |
663 | representation. Croaks on error. |
673 | converted into JSON string or number sequences, while references to arrays |
|
|
674 | become JSON arrays and references to hashes become JSON objects. Undefined |
|
|
675 | Perl values (e.g. C<undef>) become JSON C<null> values. Neither C<true> |
|
|
676 | nor C<false> values will be generated. |
|
|
677 | |
664 | |
678 | =item $perl_scalar = $json->decode ($json_text) |
665 | =item $perl_scalar = $json->decode ($json_text) |
679 | |
666 | |
680 | The opposite of C<encode>: expects a JSON text and tries to parse it, |
667 | The opposite of C<encode>: expects a JSON text and tries to parse it, |
681 | returning the resulting simple scalar or reference. Croaks on error. |
668 | returning the resulting simple scalar or reference. Croaks on error. |
682 | |
|
|
683 | JSON numbers and strings become simple Perl scalars. JSON arrays become |
|
|
684 | Perl arrayrefs and JSON objects become Perl hashrefs. C<true> becomes |
|
|
685 | C<1>, C<false> becomes C<0> and C<null> becomes C<undef>. |
|
|
686 | |
669 | |
687 | =item ($perl_scalar, $characters) = $json->decode_prefix ($json_text) |
670 | =item ($perl_scalar, $characters) = $json->decode_prefix ($json_text) |
688 | |
671 | |
689 | This works like the C<decode> method, but instead of raising an exception |
672 | This works like the C<decode> method, but instead of raising an exception |
690 | when there is trailing garbage after the first JSON object, it will |
673 | when there is trailing garbage after the first JSON object, it will |
… | |
… | |
740 | |
723 | |
741 | If the method is called in scalar context, then it will try to extract |
724 | If the method is called in scalar context, then it will try to extract |
742 | exactly I<one> JSON object. If that is successful, it will return this |
725 | exactly I<one> JSON object. If that is successful, it will return this |
743 | object, otherwise it will return C<undef>. If there is a parse error, |
726 | object, otherwise it will return C<undef>. If there is a parse error, |
744 | this method will croak just as C<decode> would do (one can then use |
727 | this method will croak just as C<decode> would do (one can then use |
745 | C<incr_skip> to skip the errornous part). This is the most common way of |
728 | C<incr_skip> to skip the erroneous part). This is the most common way of |
746 | using the method. |
729 | using the method. |
747 | |
730 | |
748 | And finally, in list context, it will try to extract as many objects |
731 | And finally, in list context, it will try to extract as many objects |
749 | from the stream as it can find and return them, or the empty list |
732 | from the stream as it can find and return them, or the empty list |
750 | otherwise. For this to work, there must be no separators between the JSON |
733 | otherwise. For this to work, there must be no separators between the JSON |
… | |
… | |
779 | C<incr_parse> died, in which case the input buffer and incremental parser |
762 | C<incr_parse> died, in which case the input buffer and incremental parser |
780 | state is left unchanged, to skip the text parsed so far and to reset the |
763 | state is left unchanged, to skip the text parsed so far and to reset the |
781 | parse state. |
764 | parse state. |
782 | |
765 | |
783 | The difference to C<incr_reset> is that only text until the parse error |
766 | The difference to C<incr_reset> is that only text until the parse error |
784 | occured is removed. |
767 | occurred is removed. |
785 | |
768 | |
786 | =item $json->incr_reset |
769 | =item $json->incr_reset |
787 | |
770 | |
788 | This completely resets the incremental parser, that is, after this call, |
771 | This completely resets the incremental parser, that is, after this call, |
789 | it will be as if the parser had never parsed anything. |
772 | it will be as if the parser had never parsed anything. |
… | |
… | |
987 | If the number consists of digits only, JSON::XS will try to represent |
970 | If the number consists of digits only, JSON::XS will try to represent |
988 | it as an integer value. If that fails, it will try to represent it as |
971 | it as an integer value. If that fails, it will try to represent it as |
989 | a numeric (floating point) value if that is possible without loss of |
972 | a numeric (floating point) value if that is possible without loss of |
990 | precision. Otherwise it will preserve the number as a string value (in |
973 | precision. Otherwise it will preserve the number as a string value (in |
991 | which case you lose roundtripping ability, as the JSON number will be |
974 | which case you lose roundtripping ability, as the JSON number will be |
992 | re-encoded toa JSON string). |
975 | re-encoded to a JSON string). |
993 | |
976 | |
994 | Numbers containing a fractional or exponential part will always be |
977 | Numbers containing a fractional or exponential part will always be |
995 | represented as numeric (floating point) values, possibly at a loss of |
978 | represented as numeric (floating point) values, possibly at a loss of |
996 | precision (in which case you might lose perfect roundtripping ability, but |
979 | precision (in which case you might lose perfect roundtripping ability, but |
997 | the JSON number will still be re-encoded as a JSON number). |
980 | the JSON number will still be re-encoded as a JSON number). |
998 | |
981 | |
999 | Note that precision is not accuracy - binary floating point values cannot |
982 | Note that precision is not accuracy - binary floating point values cannot |
1000 | represent most decimal fractions exactly, and when converting from and to |
983 | represent most decimal fractions exactly, and when converting from and to |
1001 | floating point, JSON::XS only guarantees precision up to but not including |
984 | floating point, JSON::XS only guarantees precision up to but not including |
1002 | the leats significant bit. |
985 | the least significant bit. |
1003 | |
986 | |
1004 | =item true, false |
987 | =item true, false |
1005 | |
988 | |
1006 | These JSON atoms become C<JSON::XS::true> and C<JSON::XS::false>, |
989 | These JSON atoms become C<JSON::XS::true> and C<JSON::XS::false>, |
1007 | respectively. They are overloaded to act almost exactly like the numbers |
990 | respectively. They are overloaded to act almost exactly like the numbers |
… | |
… | |
1137 | =item C<utf8> flag disabled |
1120 | =item C<utf8> flag disabled |
1138 | |
1121 | |
1139 | When C<utf8> is disabled (the default), then C<encode>/C<decode> generate |
1122 | When C<utf8> is disabled (the default), then C<encode>/C<decode> generate |
1140 | and expect Unicode strings, that is, characters with high ordinal Unicode |
1123 | and expect Unicode strings, that is, characters with high ordinal Unicode |
1141 | values (> 255) will be encoded as such characters, and likewise such |
1124 | values (> 255) will be encoded as such characters, and likewise such |
1142 | characters are decoded as-is, no canges to them will be done, except |
1125 | characters are decoded as-is, no changes to them will be done, except |
1143 | "(re-)interpreting" them as Unicode codepoints or Unicode characters, |
1126 | "(re-)interpreting" them as Unicode codepoints or Unicode characters, |
1144 | respectively (to Perl, these are the same thing in strings unless you do |
1127 | respectively (to Perl, these are the same thing in strings unless you do |
1145 | funny/weird/dumb stuff). |
1128 | funny/weird/dumb stuff). |
1146 | |
1129 | |
1147 | This is useful when you want to do the encoding yourself (e.g. when you |
1130 | This is useful when you want to do the encoding yourself (e.g. when you |
… | |
… | |
1263 | output for these property strings, e.g.: |
1246 | output for these property strings, e.g.: |
1264 | |
1247 | |
1265 | $json =~ s/"__proto__"\s*:/"__proto__renamed":/g; |
1248 | $json =~ s/"__proto__"\s*:/"__proto__renamed":/g; |
1266 | |
1249 | |
1267 | This works because C<__proto__> is not valid outside of strings, so every |
1250 | This works because C<__proto__> is not valid outside of strings, so every |
1268 | occurence of C<"__proto__"\s*:> must be a string used as property name. |
1251 | occurrence of C<"__proto__"\s*:> must be a string used as property name. |
1269 | |
1252 | |
1270 | If you know of other incompatibilities, please let me know. |
1253 | If you know of other incompatibilities, please let me know. |
1271 | |
1254 | |
1272 | |
1255 | |
1273 | =head2 JSON and YAML |
1256 | =head2 JSON and YAML |
… | |
… | |
1445 | process simulations - use fork, it's I<much> faster, cheaper, better). |
1428 | process simulations - use fork, it's I<much> faster, cheaper, better). |
1446 | |
1429 | |
1447 | (It might actually work, but you have been warned). |
1430 | (It might actually work, but you have been warned). |
1448 | |
1431 | |
1449 | |
1432 | |
|
|
1433 | =head1 THE PERILS OF SETLOCALE |
|
|
1434 | |
|
|
1435 | Sometimes people avoid the Perl locale support and directly call the |
|
|
1436 | system's setlocale function with C<LC_ALL>. |
|
|
1437 | |
|
|
1438 | This breaks both perl and modules such as JSON::XS, as stringification of |
|
|
1439 | numbers no longer works correctly (e.g. C<$x = 0.1; print "$x"+1> might |
|
|
1440 | print C<1>, and JSON::XS might output illegal JSON as JSON::XS relies on |
|
|
1441 | perl to stringify numbers). |
|
|
1442 | |
|
|
1443 | The solution is simple: don't call C<setlocale>, or use it for only those |
|
|
1444 | categories you need, such as C<LC_MESSAGES> or C<LC_CTYPE>. |
|
|
1445 | |
|
|
1446 | If you need C<LC_NUMERIC>, you should enable it only around the code that |
|
|
1447 | actually needs it (avoiding stringification of numbers), and restore it |
|
|
1448 | afterwards. |
|
|
1449 | |
|
|
1450 | |
1450 | =head1 BUGS |
1451 | =head1 BUGS |
1451 | |
1452 | |
1452 | While the goal of this module is to be correct, that unfortunately does |
1453 | While the goal of this module is to be correct, that unfortunately does |
1453 | not mean it's bug-free, only that I think its design is bug-free. If you |
1454 | not mean it's bug-free, only that I think its design is bug-free. If you |
1454 | keep reporting bugs they will be fixed swiftly, though. |
1455 | keep reporting bugs they will be fixed swiftly, though. |