ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/JSON-XS/README
(Generate patch)

Comparing JSON-XS/README (file contents):
Revision 1.28 by root, Mon Sep 29 03:09:27 2008 UTC vs.
Revision 1.35 by root, Thu Mar 11 19:31:37 2010 UTC

20 $perl_scalar = $coder->decode ($unicode_json_text); 20 $perl_scalar = $coder->decode ($unicode_json_text);
21 21
22 # Note that JSON version 2.0 and above will automatically use JSON::XS 22 # Note that JSON version 2.0 and above will automatically use JSON::XS
23 # if available, at virtually no speed overhead either, so you should 23 # if available, at virtually no speed overhead either, so you should
24 # be able to just: 24 # be able to just:
25 25
26 use JSON; 26 use JSON;
27 27
28 # and do the same things, except that you have a pure-perl fallback now. 28 # and do the same things, except that you have a pure-perl fallback now.
29 29
30DESCRIPTION 30DESCRIPTION
31 This module converts Perl data structures to JSON and vice versa. Its 31 This module converts Perl data structures to JSON and vice versa. Its
56 does so, and even documents what "correct" means. 56 does so, and even documents what "correct" means.
57 57
58 * round-trip integrity 58 * round-trip integrity
59 59
60 When you serialise a perl data structure using only data types 60 When you serialise a perl data structure using only data types
61 supported by JSON, the deserialised data structure is identical on 61 supported by JSON and Perl, the deserialised data structure is
62 the Perl level. (e.g. the string "2.0" doesn't suddenly become "2" 62 identical on the Perl level. (e.g. the string "2.0" doesn't suddenly
63 just because it looks like a number). There minor *are* exceptions 63 become "2" just because it looks like a number). There *are* minor
64 to this, read the MAPPING section below to learn about those. 64 exceptions to this, read the MAPPING section below to learn about
65 those.
65 66
66 * strict checking of JSON correctness 67 * strict checking of JSON correctness
67 68
68 There is no guessing, no generating of illegal JSON texts by 69 There is no guessing, no generating of illegal JSON texts by
69 default, and only JSON is accepted as input by default (the latter 70 default, and only JSON is accepted as input by default (the latter
377 it is disabled, the same hash might be encoded differently even if 378 it is disabled, the same hash might be encoded differently even if
378 contains the same data, as key-value pairs have no inherent ordering 379 contains the same data, as key-value pairs have no inherent ordering
379 in Perl. 380 in Perl.
380 381
381 This setting has no effect when decoding JSON texts. 382 This setting has no effect when decoding JSON texts.
383
384 This setting has currently no effect on tied hashes.
382 385
383 $json = $json->allow_nonref ([$enable]) 386 $json = $json->allow_nonref ([$enable])
384 $enabled = $json->get_allow_nonref 387 $enabled = $json->get_allow_nonref
385 If $enable is true (or missing), then the "encode" method can 388 If $enable is true (or missing), then the "encode" method can
386 convert a non-reference into its corresponding string, number or 389 convert a non-reference into its corresponding string, number or
668 JSON objects or arrays, instead they must be concatenated 671 JSON objects or arrays, instead they must be concatenated
669 back-to-back. If an error occurs, an exception will be raised as in 672 back-to-back. If an error occurs, an exception will be raised as in
670 the scalar context case. Note that in this case, any 673 the scalar context case. Note that in this case, any
671 previously-parsed JSON texts will be lost. 674 previously-parsed JSON texts will be lost.
672 675
676 Example: Parse some JSON arrays/objects in a given string and return
677 them.
678
679 my @objs = JSON::XS->new->incr_parse ("[5][7][1,2]");
680
673 $lvalue_string = $json->incr_text 681 $lvalue_string = $json->incr_text
674 This method returns the currently stored JSON fragment as an lvalue, 682 This method returns the currently stored JSON fragment as an lvalue,
675 that is, you can manipulate it. This *only* works when a preceding 683 that is, you can manipulate it. This *only* works when a preceding
676 call to "incr_parse" in *scalar context* successfully returned an 684 call to "incr_parse" in *scalar context* successfully returned an
677 object. Under all other circumstances you must not call this 685 object. Under all other circumstances you must not call this
684 after a JSON object or b) parsing multiple JSON objects separated by 692 after a JSON object or b) parsing multiple JSON objects separated by
685 non-JSON text (such as commas). 693 non-JSON text (such as commas).
686 694
687 $json->incr_skip 695 $json->incr_skip
688 This will reset the state of the incremental parser and will remove 696 This will reset the state of the incremental parser and will remove
689 the parsed text from the input buffer. This is useful after 697 the parsed text from the input buffer so far. This is useful after
690 "incr_parse" died, in which case the input buffer and incremental 698 "incr_parse" died, in which case the input buffer and incremental
691 parser state is left unchanged, to skip the text parsed so far and 699 parser state is left unchanged, to skip the text parsed so far and
692 to reset the parse state. 700 to reset the parse state.
693 701
702 The difference to "incr_reset" is that only text until the parse
703 error occured is removed.
704
694 $json->incr_reset 705 $json->incr_reset
695 This completely resets the incremental parser, that is, after this 706 This completely resets the incremental parser, that is, after this
696 call, it will be as if the parser had never parsed anything. 707 call, it will be as if the parser had never parsed anything.
697 708
698 This is useful if you want ot repeatedly parse JSON objects and want 709 This is useful if you want to repeatedly parse JSON objects and want
699 to ignore any trailing data, which means you have to reset the 710 to ignore any trailing data, which means you have to reset the
700 parser after each successful decode. 711 parser after each successful decode.
701 712
702 LIMITATIONS 713 LIMITATIONS
703 All options that affect decoding are supported, except "allow_nonref". 714 All options that affect decoding are supported, except "allow_nonref".
888 Numbers containing a fractional or exponential part will always be 899 Numbers containing a fractional or exponential part will always be
889 represented as numeric (floating point) values, possibly at a loss 900 represented as numeric (floating point) values, possibly at a loss
890 of precision (in which case you might lose perfect roundtripping 901 of precision (in which case you might lose perfect roundtripping
891 ability, but the JSON number will still be re-encoded as a JSON 902 ability, but the JSON number will still be re-encoded as a JSON
892 number). 903 number).
904
905 Note that precision is not accuracy - binary floating point values
906 cannot represent most decimal fractions exactly, and when converting
907 from and to floating point, JSON::XS only guarantees precision up to
908 but not including the leats significant bit.
893 909
894 true, false 910 true, false
895 These JSON atoms become "JSON::XS::true" and "JSON::XS::false", 911 These JSON atoms become "JSON::XS::true" and "JSON::XS::false",
896 respectively. They are overloaded to act almost exactly like the 912 respectively. They are overloaded to act almost exactly like the
897 numbers 1 and 0. You can check whether a scalar is a JSON boolean by 913 numbers 1 and 0. You can check whether a scalar is a JSON boolean by
974 990
975 You can not currently force the type in other, less obscure, ways. 991 You can not currently force the type in other, less obscure, ways.
976 Tell me if you need this capability (but don't forget to explain why 992 Tell me if you need this capability (but don't forget to explain why
977 it's needed :). 993 it's needed :).
978 994
995 Note that numerical precision has the same meaning as under Perl (so
996 binary to decimal conversion follows the same rules as in Perl,
997 which can differ to other languages). Also, your perl interpreter
998 might expose extensions to the floating point numbers of your
999 platform, such as infinities or NaN's - these cannot be represented
1000 in JSON, and it is an error to pass those in.
1001
979ENCODING/CODESET FLAG NOTES 1002ENCODING/CODESET FLAG NOTES
980 The interested reader might have seen a number of flags that signify 1003 The interested reader might have seen a number of flags that signify
981 encodings or codesets - "utf8", "latin1" and "ascii". There seems to be 1004 encodings or codesets - "utf8", "latin1" and "ascii". There seems to be
982 some confusion on what these do, so here is a short comparison: 1005 some confusion on what these do, so here is a short comparison:
983 1006
1069 structure back. This is useful when your channel for JSON transfer 1092 structure back. This is useful when your channel for JSON transfer
1070 is not 8-bit clean or the encoding might be mangled in between (e.g. 1093 is not 8-bit clean or the encoding might be mangled in between (e.g.
1071 in mail), and works because ASCII is a proper subset of most 8-bit 1094 in mail), and works because ASCII is a proper subset of most 8-bit
1072 and multibyte encodings in use in the world. 1095 and multibyte encodings in use in the world.
1073 1096
1097 JSON and ECMAscript
1098 JSON syntax is based on how literals are represented in javascript (the
1099 not-standardised predecessor of ECMAscript) which is presumably why it
1100 is called "JavaScript Object Notation".
1101
1102 However, JSON is not a subset (and also not a superset of course) of
1103 ECMAscript (the standard) or javascript (whatever browsers actually
1104 implement).
1105
1106 If you want to use javascript's "eval" function to "parse" JSON, you
1107 might run into parse errors for valid JSON texts, or the resulting data
1108 structure might not be queryable:
1109
1110 One of the problems is that U+2028 and U+2029 are valid characters
1111 inside JSON strings, but are not allowed in ECMAscript string literals,
1112 so the following Perl fragment will not output something that can be
1113 guaranteed to be parsable by javascript's "eval":
1114
1115 use JSON::XS;
1116
1117 print encode_json [chr 0x2028];
1118
1119 The right fix for this is to use a proper JSON parser in your javascript
1120 programs, and not rely on "eval" (see for example Douglas Crockford's
1121 json2.js parser).
1122
1123 If this is not an option, you can, as a stop-gap measure, simply encode
1124 to ASCII-only JSON:
1125
1126 use JSON::XS;
1127
1128 print JSON::XS->new->ascii->encode ([chr 0x2028]);
1129
1130 Note that this will enlarge the resulting JSON text quite a bit if you
1131 have many non-ASCII characters. You might be tempted to run some regexes
1132 to only escape U+2028 and U+2029, e.g.:
1133
1134 # DO NOT USE THIS!
1135 my $json = JSON::XS->new->utf8->encode ([chr 0x2028]);
1136 $json =~ s/\xe2\x80\xa8/\\u2028/g; # escape U+2028
1137 $json =~ s/\xe2\x80\xa9/\\u2029/g; # escape U+2029
1138 print $json;
1139
1140 Note that *this is a bad idea*: the above only works for U+2028 and
1141 U+2029 and thus only for fully ECMAscript-compliant parsers. Many
1142 existing javascript implementations, however, have issues with other
1143 characters as well - using "eval" naively simply *will* cause problems.
1144
1145 Another problem is that some javascript implementations reserve some
1146 property names for their own purposes (which probably makes them
1147 non-ECMAscript-compliant). For example, Iceweasel reserves the
1148 "__proto__" property name for it's own purposes.
1149
1150 If that is a problem, you could parse try to filter the resulting JSON
1151 output for these property strings, e.g.:
1152
1153 $json =~ s/"__proto__"\s*:/"__proto__renamed":/g;
1154
1155 This works because "__proto__" is not valid outside of strings, so every
1156 occurence of ""__proto__"\s*:" must be a string used as property name.
1157
1158 If you know of other incompatibilities, please let me know.
1159
1074 JSON and YAML 1160 JSON and YAML
1075 You often hear that JSON is a subset of YAML. This is, however, a mass 1161 You often hear that JSON is a subset of YAML. This is, however, a mass
1076 hysteria(*) and very far from the truth (as of the time of this 1162 hysteria(*) and very far from the truth (as of the time of this
1077 writing), so let me state it clearly: *in general, there is no way to 1163 writing), so let me state it clearly: *in general, there is no way to
1078 configure JSON::XS to output a data structure as valid YAML* that works 1164 configure JSON::XS to output a data structure as valid YAML* that works
1085 my $yaml = $to_yaml->encode ($ref) . "\n"; 1171 my $yaml = $to_yaml->encode ($ref) . "\n";
1086 1172
1087 This will *usually* generate JSON texts that also parse as valid YAML. 1173 This will *usually* generate JSON texts that also parse as valid YAML.
1088 Please note that YAML has hardcoded limits on (simple) object key 1174 Please note that YAML has hardcoded limits on (simple) object key
1089 lengths that JSON doesn't have and also has different and incompatible 1175 lengths that JSON doesn't have and also has different and incompatible
1090 unicode handling, so you should make sure that your hash keys are 1176 unicode character escape syntax, so you should make sure that your hash
1091 noticeably shorter than the 1024 "stream characters" YAML allows and 1177 keys are noticeably shorter than the 1024 "stream characters" YAML
1092 that you do not have characters with codepoint values outside the 1178 allows and that you do not have characters with codepoint values outside
1093 Unicode BMP (basic multilingual page). YAML also does not allow "\/" 1179 the Unicode BMP (basic multilingual page). YAML also does not allow "\/"
1094 sequences in strings (which JSON::XS does not *currently* generate, but 1180 sequences in strings (which JSON::XS does not *currently* generate, but
1095 other JSON generators might). 1181 other JSON generators might).
1096 1182
1097 There might be other incompatibilities that I am not aware of (or the 1183 There might be other incompatibilities that I am not aware of (or the
1098 YAML specification has been changed yet again - it does so quite often). 1184 YAML specification has been changed yet again - it does so quite often).
1115 (which is not that difficult or long) and finally make YAML 1201 (which is not that difficult or long) and finally make YAML
1116 compatible to it, and educating users about the changes, instead of 1202 compatible to it, and educating users about the changes, instead of
1117 spreading lies about the real compatibility for many *years* and 1203 spreading lies about the real compatibility for many *years* and
1118 trying to silence people who point out that it isn't true. 1204 trying to silence people who point out that it isn't true.
1119 1205
1206 Addendum/2009: the YAML 1.2 spec is still incomaptible with JSON,
1207 even though the incompatibilities have been documented (and are
1208 known to Brian) for many years and the spec makes explicit claims
1209 that YAML is a superset of JSON. It would be so easy to fix, but
1210 apparently, bullying and corrupting userdata is so much easier.
1211
1120 SPEED 1212 SPEED
1121 It seems that JSON::XS is surprisingly fast, as shown in the following 1213 It seems that JSON::XS is surprisingly fast, as shown in the following
1122 tables. They have been generated with the help of the "eg/bench" program 1214 tables. They have been generated with the help of the "eg/bench" program
1123 in the JSON::XS distribution, to make it easy to compare on your own 1215 in the JSON::XS distribution, to make it easy to compare on your own
1124 system. 1216 system.
1127 single-line JSON string (also available at 1219 single-line JSON string (also available at
1128 <http://dist.schmorp.de/misc/json/short.json>). 1220 <http://dist.schmorp.de/misc/json/short.json>).
1129 1221
1130 {"method": "handleMessage", "params": ["user1", 1222 {"method": "handleMessage", "params": ["user1",
1131 "we were just talking"], "id": null, "array":[1,11,234,-5,1e5,1e7, 1223 "we were just talking"], "id": null, "array":[1,11,234,-5,1e5,1e7,
1132 true, false]} 1224 1, 0]}
1133 1225
1134 It shows the number of encodes/decodes per second (JSON::XS uses the 1226 It shows the number of encodes/decodes per second (JSON::XS uses the
1135 functional interface, while JSON::XS/2 uses the OO interface with 1227 functional interface, while JSON::XS/2 uses the OO interface with
1136 pretty-printing and hashkey sorting enabled, JSON::XS/3 enables shrink). 1228 pretty-printing and hashkey sorting enabled, JSON::XS/3 enables shrink.
1137 Higher is better: 1229 JSON::DWIW/DS uses the deserialise function, while JSON::DWIW::FJ uses
1230 the from_json method). Higher is better:
1138 1231
1139 module | encode | decode | 1232 module | encode | decode |
1140 -----------|------------|------------| 1233 --------------|------------|------------|
1141 JSON 1.x | 4990.842 | 4088.813 | 1234 JSON::DWIW/DS | 86302.551 | 102300.098 |
1142 JSON::DWIW | 51653.990 | 71575.154 | 1235 JSON::DWIW/FJ | 86302.551 | 75983.768 |
1143 JSON::PC | 65948.176 | 74631.744 | 1236 JSON::PP | 15827.562 | 6638.658 |
1144 JSON::PP | 8931.652 | 3817.168 | 1237 JSON::Syck | 63358.066 | 47662.545 |
1145 JSON::Syck | 24877.248 | 27776.848 | 1238 JSON::XS | 511500.488 | 511500.488 |
1146 JSON::XS | 388361.481 | 227951.304 | 1239 JSON::XS/2 | 291271.111 | 388361.481 |
1147 JSON::XS/2 | 227951.304 | 218453.333 | 1240 JSON::XS/3 | 361577.931 | 361577.931 |
1148 JSON::XS/3 | 338250.323 | 218453.333 | 1241 Storable | 66788.280 | 265462.278 |
1149 Storable | 16500.016 | 135300.129 |
1150 -----------+------------+------------+ 1242 --------------+------------+------------+
1151 1243
1152 That is, JSON::XS is about five times faster than JSON::DWIW on 1244 That is, JSON::XS is almost six times faster than JSON::DWIW on
1153 encoding, about three times faster on decoding, and over forty times 1245 encoding, about five times faster on decoding, and over thirty to
1154 faster than JSON, even with pretty-printing and key sorting. It also 1246 seventy times faster than JSON's pure perl implementation. It also
1155 compares favourably to Storable for small amounts of data. 1247 compares favourably to Storable for small amounts of data.
1156 1248
1157 Using a longer test string (roughly 18KB, generated from Yahoo! Locals 1249 Using a longer test string (roughly 18KB, generated from Yahoo! Locals
1158 search API (<http://dist.schmorp.de/misc/json/long.json>). 1250 search API (<http://dist.schmorp.de/misc/json/long.json>).
1159 1251
1160 module | encode | decode | 1252 module | encode | decode |
1161 -----------|------------|------------| 1253 --------------|------------|------------|
1162 JSON 1.x | 55.260 | 34.971 | 1254 JSON::DWIW/DS | 1647.927 | 2673.916 |
1163 JSON::DWIW | 825.228 | 1082.513 | 1255 JSON::DWIW/FJ | 1630.249 | 2596.128 |
1164 JSON::PC | 3571.444 | 2394.829 |
1165 JSON::PP | 210.987 | 32.574 | 1256 JSON::PP | 400.640 | 62.311 |
1166 JSON::Syck | 552.551 | 787.544 | 1257 JSON::Syck | 1481.040 | 1524.869 |
1167 JSON::XS | 5780.463 | 4854.519 | 1258 JSON::XS | 20661.596 | 9541.183 |
1168 JSON::XS/2 | 3869.998 | 4798.975 | 1259 JSON::XS/2 | 10683.403 | 9416.938 |
1169 JSON::XS/3 | 5862.880 | 4798.975 | 1260 JSON::XS/3 | 20661.596 | 9400.054 |
1170 Storable | 4445.002 | 5235.027 | 1261 Storable | 19765.806 | 10000.725 |
1171 -----------+------------+------------+ 1262 --------------+------------+------------+
1172 1263
1173 Again, JSON::XS leads by far (except for Storable which non-surprisingly 1264 Again, JSON::XS leads by far (except for Storable which non-surprisingly
1174 decodes faster). 1265 decodes a bit faster).
1175 1266
1176 On large strings containing lots of high Unicode characters, some 1267 On large strings containing lots of high Unicode characters, some
1177 modules (such as JSON::PC) seem to decode faster than JSON::XS, but the 1268 modules (such as JSON::PC) seem to decode faster than JSON::XS, but the
1178 result will be broken due to missing (or wrong) Unicode handling. Others 1269 result will be broken due to missing (or wrong) Unicode handling. Others
1179 refuse to decode or encode properly, so it was impossible to prepare a 1270 refuse to decode or encode properly, so it was impossible to prepare a
1214 information you might want to make sure that exceptions thrown by 1305 information you might want to make sure that exceptions thrown by
1215 JSON::XS will not end up in front of untrusted eyes. 1306 JSON::XS will not end up in front of untrusted eyes.
1216 1307
1217 If you are using JSON::XS to return packets to consumption by JavaScript 1308 If you are using JSON::XS to return packets to consumption by JavaScript
1218 scripts in a browser you should have a look at 1309 scripts in a browser you should have a look at
1219 <http://jpsykes.com/47/practical-csrf-and-json-security> to see whether 1310 <http://blog.archive.jpsykes.com/47/practical-csrf-and-json-security/>
1220 you are vulnerable to some common attack vectors (which really are 1311 to see whether you are vulnerable to some common attack vectors (which
1221 browser design bugs, but it is still you who will have to deal with it, 1312 really are browser design bugs, but it is still you who will have to
1222 as major browser developers care only for features, not about getting 1313 deal with it, as major browser developers care only for features, not
1223 security right). 1314 about getting security right).
1224 1315
1225THREADS 1316THREADS
1226 This module is *not* guaranteed to be thread safe and there are no plans 1317 This module is *not* guaranteed to be thread safe and there are no plans
1227 to change this until Perl gets thread support (as opposed to the 1318 to change this until Perl gets thread support (as opposed to the
1228 horribly slow so-called "threads" which are simply slow and bloated 1319 horribly slow so-called "threads" which are simply slow and bloated

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines