… | |
… | |
83 | this module usually compares favourably in terms of speed, too. |
83 | this module usually compares favourably in terms of speed, too. |
84 | |
84 | |
85 | =item * simple to use |
85 | =item * simple to use |
86 | |
86 | |
87 | This module has both a simple functional interface as well as an object |
87 | This module has both a simple functional interface as well as an object |
88 | oriented interface interface. |
88 | oriented interface. |
89 | |
89 | |
90 | =item * reasonably versatile output formats |
90 | =item * reasonably versatile output formats |
91 | |
91 | |
92 | You can choose between the most compact guaranteed-single-line format |
92 | You can choose between the most compact guaranteed-single-line format |
93 | possible (nice for simple line-based protocols), a pure-ASCII format |
93 | possible (nice for simple line-based protocols), a pure-ASCII format |
… | |
… | |
101 | |
101 | |
102 | package JSON::XS; |
102 | package JSON::XS; |
103 | |
103 | |
104 | use common::sense; |
104 | use common::sense; |
105 | |
105 | |
106 | our $VERSION = '2.33'; |
106 | our $VERSION = 2.34; |
107 | our @ISA = qw(Exporter); |
107 | our @ISA = qw(Exporter); |
108 | |
108 | |
109 | our @EXPORT = qw(encode_json decode_json to_json from_json); |
109 | our @EXPORT = qw(encode_json decode_json to_json from_json); |
110 | |
110 | |
111 | sub to_json($) { |
111 | sub to_json($) { |
… | |
… | |
432 | If C<$enable> is true (or missing), then the C<encode> method will output JSON objects |
432 | If C<$enable> is true (or missing), then the C<encode> method will output JSON objects |
433 | by sorting their keys. This is adding a comparatively high overhead. |
433 | by sorting their keys. This is adding a comparatively high overhead. |
434 | |
434 | |
435 | If C<$enable> is false, then the C<encode> method will output key-value |
435 | If C<$enable> is false, then the C<encode> method will output key-value |
436 | pairs in the order Perl stores them (which will likely change between runs |
436 | pairs in the order Perl stores them (which will likely change between runs |
437 | of the same script). |
437 | of the same script, and can change even within the same run from 5.18 |
|
|
438 | onwards). |
438 | |
439 | |
439 | This option is useful if you want the same data structure to be encoded as |
440 | This option is useful if you want the same data structure to be encoded as |
440 | the same JSON text (given the same overall settings). If it is disabled, |
441 | the same JSON text (given the same overall settings). If it is disabled, |
441 | the same hash might be encoded differently even if contains the same data, |
442 | the same hash might be encoded differently even if contains the same data, |
442 | as key-value pairs have no inherent ordering in Perl. |
443 | as key-value pairs have no inherent ordering in Perl. |
… | |
… | |
740 | |
741 | |
741 | If the method is called in scalar context, then it will try to extract |
742 | If the method is called in scalar context, then it will try to extract |
742 | exactly I<one> JSON object. If that is successful, it will return this |
743 | exactly I<one> JSON object. If that is successful, it will return this |
743 | object, otherwise it will return C<undef>. If there is a parse error, |
744 | object, otherwise it will return C<undef>. If there is a parse error, |
744 | this method will croak just as C<decode> would do (one can then use |
745 | this method will croak just as C<decode> would do (one can then use |
745 | C<incr_skip> to skip the errornous part). This is the most common way of |
746 | C<incr_skip> to skip the erroneous part). This is the most common way of |
746 | using the method. |
747 | using the method. |
747 | |
748 | |
748 | And finally, in list context, it will try to extract as many objects |
749 | And finally, in list context, it will try to extract as many objects |
749 | from the stream as it can find and return them, or the empty list |
750 | from the stream as it can find and return them, or the empty list |
750 | otherwise. For this to work, there must be no separators between the JSON |
751 | otherwise. For this to work, there must be no separators between the JSON |
… | |
… | |
779 | C<incr_parse> died, in which case the input buffer and incremental parser |
780 | C<incr_parse> died, in which case the input buffer and incremental parser |
780 | state is left unchanged, to skip the text parsed so far and to reset the |
781 | state is left unchanged, to skip the text parsed so far and to reset the |
781 | parse state. |
782 | parse state. |
782 | |
783 | |
783 | The difference to C<incr_reset> is that only text until the parse error |
784 | The difference to C<incr_reset> is that only text until the parse error |
784 | occured is removed. |
785 | occurred is removed. |
785 | |
786 | |
786 | =item $json->incr_reset |
787 | =item $json->incr_reset |
787 | |
788 | |
788 | This completely resets the incremental parser, that is, after this call, |
789 | This completely resets the incremental parser, that is, after this call, |
789 | it will be as if the parser had never parsed anything. |
790 | it will be as if the parser had never parsed anything. |
… | |
… | |
987 | If the number consists of digits only, JSON::XS will try to represent |
988 | If the number consists of digits only, JSON::XS will try to represent |
988 | it as an integer value. If that fails, it will try to represent it as |
989 | it as an integer value. If that fails, it will try to represent it as |
989 | a numeric (floating point) value if that is possible without loss of |
990 | a numeric (floating point) value if that is possible without loss of |
990 | precision. Otherwise it will preserve the number as a string value (in |
991 | precision. Otherwise it will preserve the number as a string value (in |
991 | which case you lose roundtripping ability, as the JSON number will be |
992 | which case you lose roundtripping ability, as the JSON number will be |
992 | re-encoded toa JSON string). |
993 | re-encoded to a JSON string). |
993 | |
994 | |
994 | Numbers containing a fractional or exponential part will always be |
995 | Numbers containing a fractional or exponential part will always be |
995 | represented as numeric (floating point) values, possibly at a loss of |
996 | represented as numeric (floating point) values, possibly at a loss of |
996 | precision (in which case you might lose perfect roundtripping ability, but |
997 | precision (in which case you might lose perfect roundtripping ability, but |
997 | the JSON number will still be re-encoded as a JSON number). |
998 | the JSON number will still be re-encoded as a JSON number). |
998 | |
999 | |
999 | Note that precision is not accuracy - binary floating point values cannot |
1000 | Note that precision is not accuracy - binary floating point values cannot |
1000 | represent most decimal fractions exactly, and when converting from and to |
1001 | represent most decimal fractions exactly, and when converting from and to |
1001 | floating point, JSON::XS only guarantees precision up to but not including |
1002 | floating point, JSON::XS only guarantees precision up to but not including |
1002 | the leats significant bit. |
1003 | the least significant bit. |
1003 | |
1004 | |
1004 | =item true, false |
1005 | =item true, false |
1005 | |
1006 | |
1006 | These JSON atoms become C<JSON::XS::true> and C<JSON::XS::false>, |
1007 | These JSON atoms become C<JSON::XS::true> and C<JSON::XS::false>, |
1007 | respectively. They are overloaded to act almost exactly like the numbers |
1008 | respectively. They are overloaded to act almost exactly like the numbers |
… | |
… | |
1137 | =item C<utf8> flag disabled |
1138 | =item C<utf8> flag disabled |
1138 | |
1139 | |
1139 | When C<utf8> is disabled (the default), then C<encode>/C<decode> generate |
1140 | When C<utf8> is disabled (the default), then C<encode>/C<decode> generate |
1140 | and expect Unicode strings, that is, characters with high ordinal Unicode |
1141 | and expect Unicode strings, that is, characters with high ordinal Unicode |
1141 | values (> 255) will be encoded as such characters, and likewise such |
1142 | values (> 255) will be encoded as such characters, and likewise such |
1142 | characters are decoded as-is, no canges to them will be done, except |
1143 | characters are decoded as-is, no changes to them will be done, except |
1143 | "(re-)interpreting" them as Unicode codepoints or Unicode characters, |
1144 | "(re-)interpreting" them as Unicode codepoints or Unicode characters, |
1144 | respectively (to Perl, these are the same thing in strings unless you do |
1145 | respectively (to Perl, these are the same thing in strings unless you do |
1145 | funny/weird/dumb stuff). |
1146 | funny/weird/dumb stuff). |
1146 | |
1147 | |
1147 | This is useful when you want to do the encoding yourself (e.g. when you |
1148 | This is useful when you want to do the encoding yourself (e.g. when you |
… | |
… | |
1263 | output for these property strings, e.g.: |
1264 | output for these property strings, e.g.: |
1264 | |
1265 | |
1265 | $json =~ s/"__proto__"\s*:/"__proto__renamed":/g; |
1266 | $json =~ s/"__proto__"\s*:/"__proto__renamed":/g; |
1266 | |
1267 | |
1267 | This works because C<__proto__> is not valid outside of strings, so every |
1268 | This works because C<__proto__> is not valid outside of strings, so every |
1268 | occurence of C<"__proto__"\s*:> must be a string used as property name. |
1269 | occurrence of C<"__proto__"\s*:> must be a string used as property name. |
1269 | |
1270 | |
1270 | If you know of other incompatibilities, please let me know. |
1271 | If you know of other incompatibilities, please let me know. |
1271 | |
1272 | |
1272 | |
1273 | |
1273 | =head2 JSON and YAML |
1274 | =head2 JSON and YAML |
… | |
… | |
1445 | process simulations - use fork, it's I<much> faster, cheaper, better). |
1446 | process simulations - use fork, it's I<much> faster, cheaper, better). |
1446 | |
1447 | |
1447 | (It might actually work, but you have been warned). |
1448 | (It might actually work, but you have been warned). |
1448 | |
1449 | |
1449 | |
1450 | |
|
|
1451 | =head1 THE PERILS OF SETLOCALE |
|
|
1452 | |
|
|
1453 | Sometimes people avoid the Perl locale support and directly call the |
|
|
1454 | system's setlocale function with C<LC_ALL>. |
|
|
1455 | |
|
|
1456 | This breaks both perl and modules such as JSON::XS, as stringification of |
|
|
1457 | numbers no longer works correctly (e.g. C<$x = 0.1; print "$x"+1> might |
|
|
1458 | print C<1>, and JSON::XS might output illegal JSON as JSON::XS relies on |
|
|
1459 | perl to stringify numbers). |
|
|
1460 | |
|
|
1461 | The solution is simple: don't call C<setlocale>, or use it for only those |
|
|
1462 | categories you need, such as C<LC_MESSAGES> or C<LC_CTYPE>. |
|
|
1463 | |
|
|
1464 | If you need C<LC_NUMERIC>, you should enable it only around the code that |
|
|
1465 | actually needs it (avoiding stringification of numbers), and restore it |
|
|
1466 | afterwards. |
|
|
1467 | |
|
|
1468 | |
1450 | =head1 BUGS |
1469 | =head1 BUGS |
1451 | |
1470 | |
1452 | While the goal of this module is to be correct, that unfortunately does |
1471 | While the goal of this module is to be correct, that unfortunately does |
1453 | not mean it's bug-free, only that I think its design is bug-free. If you |
1472 | not mean it's bug-free, only that I think its design is bug-free. If you |
1454 | keep reporting bugs they will be fixed swiftly, though. |
1473 | keep reporting bugs they will be fixed swiftly, though. |