ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/JSON-XS/README
(Generate patch)

Comparing JSON-XS/README (file contents):
Revision 1.31 by root, Sat Aug 8 10:06:02 2009 UTC vs.
Revision 1.36 by root, Wed Jul 27 15:53:40 2011 UTC

56 does so, and even documents what "correct" means. 56 does so, and even documents what "correct" means.
57 57
58 * round-trip integrity 58 * round-trip integrity
59 59
60 When you serialise a perl data structure using only data types 60 When you serialise a perl data structure using only data types
61 supported by JSON, the deserialised data structure is identical on 61 supported by JSON and Perl, the deserialised data structure is
62 the Perl level. (e.g. the string "2.0" doesn't suddenly become "2" 62 identical on the Perl level. (e.g. the string "2.0" doesn't suddenly
63 just because it looks like a number). There minor *are* exceptions 63 become "2" just because it looks like a number). There *are* minor
64 to this, read the MAPPING section below to learn about those. 64 exceptions to this, read the MAPPING section below to learn about
65 those.
65 66
66 * strict checking of JSON correctness 67 * strict checking of JSON correctness
67 68
68 There is no guessing, no generating of illegal JSON texts by 69 There is no guessing, no generating of illegal JSON texts by
69 default, and only JSON is accepted as input by default (the latter 70 default, and only JSON is accepted as input by default (the latter
635 calls). 636 calls).
636 637
637 JSON::XS will only attempt to parse the JSON text once it is sure it has 638 JSON::XS will only attempt to parse the JSON text once it is sure it has
638 enough text to get a decisive result, using a very simple but truly 639 enough text to get a decisive result, using a very simple but truly
639 incremental parser. This means that it sometimes won't stop as early as 640 incremental parser. This means that it sometimes won't stop as early as
640 the full parser, for example, it doesn't detect parenthese mismatches. 641 the full parser, for example, it doesn't detect mismatched parentheses.
641 The only thing it guarantees is that it starts decoding as soon as a 642 The only thing it guarantees is that it starts decoding as soon as a
642 syntactically valid JSON text has been seen. This means you need to set 643 syntactically valid JSON text has been seen. This means you need to set
643 resource limits (e.g. "max_size") to ensure the parser will stop parsing 644 resource limits (e.g. "max_size") to ensure the parser will stop parsing
644 in the presence if syntax errors. 645 in the presence if syntax errors.
645 646
669 otherwise. For this to work, there must be no separators between the 670 otherwise. For this to work, there must be no separators between the
670 JSON objects or arrays, instead they must be concatenated 671 JSON objects or arrays, instead they must be concatenated
671 back-to-back. If an error occurs, an exception will be raised as in 672 back-to-back. If an error occurs, an exception will be raised as in
672 the scalar context case. Note that in this case, any 673 the scalar context case. Note that in this case, any
673 previously-parsed JSON texts will be lost. 674 previously-parsed JSON texts will be lost.
675
676 Example: Parse some JSON arrays/objects in a given string and return
677 them.
678
679 my @objs = JSON::XS->new->incr_parse ("[5][7][1,2]");
674 680
675 $lvalue_string = $json->incr_text 681 $lvalue_string = $json->incr_text
676 This method returns the currently stored JSON fragment as an lvalue, 682 This method returns the currently stored JSON fragment as an lvalue,
677 that is, you can manipulate it. This *only* works when a preceding 683 that is, you can manipulate it. This *only* works when a preceding
678 call to "incr_parse" in *scalar context* successfully returned an 684 call to "incr_parse" in *scalar context* successfully returned an
893 Numbers containing a fractional or exponential part will always be 899 Numbers containing a fractional or exponential part will always be
894 represented as numeric (floating point) values, possibly at a loss 900 represented as numeric (floating point) values, possibly at a loss
895 of precision (in which case you might lose perfect roundtripping 901 of precision (in which case you might lose perfect roundtripping
896 ability, but the JSON number will still be re-encoded as a JSON 902 ability, but the JSON number will still be re-encoded as a JSON
897 number). 903 number).
904
905 Note that precision is not accuracy - binary floating point values
906 cannot represent most decimal fractions exactly, and when converting
907 from and to floating point, JSON::XS only guarantees precision up to
908 but not including the leats significant bit.
898 909
899 true, false 910 true, false
900 These JSON atoms become "JSON::XS::true" and "JSON::XS::false", 911 These JSON atoms become "JSON::XS::true" and "JSON::XS::false",
901 respectively. They are overloaded to act almost exactly like the 912 respectively. They are overloaded to act almost exactly like the
902 numbers 1 and 0. You can check whether a scalar is a JSON boolean by 913 numbers 1 and 0. You can check whether a scalar is a JSON boolean by
979 990
980 You can not currently force the type in other, less obscure, ways. 991 You can not currently force the type in other, less obscure, ways.
981 Tell me if you need this capability (but don't forget to explain why 992 Tell me if you need this capability (but don't forget to explain why
982 it's needed :). 993 it's needed :).
983 994
995 Note that numerical precision has the same meaning as under Perl (so
996 binary to decimal conversion follows the same rules as in Perl,
997 which can differ to other languages). Also, your perl interpreter
998 might expose extensions to the floating point numbers of your
999 platform, such as infinities or NaN's - these cannot be represented
1000 in JSON, and it is an error to pass those in.
1001
984ENCODING/CODESET FLAG NOTES 1002ENCODING/CODESET FLAG NOTES
985 The interested reader might have seen a number of flags that signify 1003 The interested reader might have seen a number of flags that signify
986 encodings or codesets - "utf8", "latin1" and "ascii". There seems to be 1004 encodings or codesets - "utf8", "latin1" and "ascii". There seems to be
987 some confusion on what these do, so here is a short comparison: 1005 some confusion on what these do, so here is a short comparison:
988 1006
1125 characters as well - using "eval" naively simply *will* cause problems. 1143 characters as well - using "eval" naively simply *will* cause problems.
1126 1144
1127 Another problem is that some javascript implementations reserve some 1145 Another problem is that some javascript implementations reserve some
1128 property names for their own purposes (which probably makes them 1146 property names for their own purposes (which probably makes them
1129 non-ECMAscript-compliant). For example, Iceweasel reserves the 1147 non-ECMAscript-compliant). For example, Iceweasel reserves the
1130 "__proto__" property name for it's own purposes. 1148 "__proto__" property name for its own purposes.
1131 1149
1132 If that is a problem, you could parse try to filter the resulting JSON 1150 If that is a problem, you could parse try to filter the resulting JSON
1133 output for these property strings, e.g.: 1151 output for these property strings, e.g.:
1134 1152
1135 $json =~ s/"__proto__"\s*:/"__proto__renamed":/g; 1153 $json =~ s/"__proto__"\s*:/"__proto__renamed":/g;
1153 my $yaml = $to_yaml->encode ($ref) . "\n"; 1171 my $yaml = $to_yaml->encode ($ref) . "\n";
1154 1172
1155 This will *usually* generate JSON texts that also parse as valid YAML. 1173 This will *usually* generate JSON texts that also parse as valid YAML.
1156 Please note that YAML has hardcoded limits on (simple) object key 1174 Please note that YAML has hardcoded limits on (simple) object key
1157 lengths that JSON doesn't have and also has different and incompatible 1175 lengths that JSON doesn't have and also has different and incompatible
1158 unicode handling, so you should make sure that your hash keys are 1176 unicode character escape syntax, so you should make sure that your hash
1159 noticeably shorter than the 1024 "stream characters" YAML allows and 1177 keys are noticeably shorter than the 1024 "stream characters" YAML
1160 that you do not have characters with codepoint values outside the 1178 allows and that you do not have characters with codepoint values outside
1161 Unicode BMP (basic multilingual page). YAML also does not allow "\/" 1179 the Unicode BMP (basic multilingual page). YAML also does not allow "\/"
1162 sequences in strings (which JSON::XS does not *currently* generate, but 1180 sequences in strings (which JSON::XS does not *currently* generate, but
1163 other JSON generators might). 1181 other JSON generators might).
1164 1182
1165 There might be other incompatibilities that I am not aware of (or the 1183 There might be other incompatibilities that I am not aware of (or the
1166 YAML specification has been changed yet again - it does so quite often). 1184 YAML specification has been changed yet again - it does so quite often).
1183 (which is not that difficult or long) and finally make YAML 1201 (which is not that difficult or long) and finally make YAML
1184 compatible to it, and educating users about the changes, instead of 1202 compatible to it, and educating users about the changes, instead of
1185 spreading lies about the real compatibility for many *years* and 1203 spreading lies about the real compatibility for many *years* and
1186 trying to silence people who point out that it isn't true. 1204 trying to silence people who point out that it isn't true.
1187 1205
1206 Addendum/2009: the YAML 1.2 spec is still incompatible with JSON,
1207 even though the incompatibilities have been documented (and are
1208 known to Brian) for many years and the spec makes explicit claims
1209 that YAML is a superset of JSON. It would be so easy to fix, but
1210 apparently, bullying people and corrupting userdata is so much
1211 easier.
1212
1188 SPEED 1213 SPEED
1189 It seems that JSON::XS is surprisingly fast, as shown in the following 1214 It seems that JSON::XS is surprisingly fast, as shown in the following
1190 tables. They have been generated with the help of the "eg/bench" program 1215 tables. They have been generated with the help of the "eg/bench" program
1191 in the JSON::XS distribution, to make it easy to compare on your own 1216 in the JSON::XS distribution, to make it easy to compare on your own
1192 system. 1217 system.
1195 single-line JSON string (also available at 1220 single-line JSON string (also available at
1196 <http://dist.schmorp.de/misc/json/short.json>). 1221 <http://dist.schmorp.de/misc/json/short.json>).
1197 1222
1198 {"method": "handleMessage", "params": ["user1", 1223 {"method": "handleMessage", "params": ["user1",
1199 "we were just talking"], "id": null, "array":[1,11,234,-5,1e5,1e7, 1224 "we were just talking"], "id": null, "array":[1,11,234,-5,1e5,1e7,
1200 true, false]} 1225 1, 0]}
1201 1226
1202 It shows the number of encodes/decodes per second (JSON::XS uses the 1227 It shows the number of encodes/decodes per second (JSON::XS uses the
1203 functional interface, while JSON::XS/2 uses the OO interface with 1228 functional interface, while JSON::XS/2 uses the OO interface with
1204 pretty-printing and hashkey sorting enabled, JSON::XS/3 enables shrink). 1229 pretty-printing and hashkey sorting enabled, JSON::XS/3 enables shrink.
1205 Higher is better: 1230 JSON::DWIW/DS uses the deserialise function, while JSON::DWIW::FJ uses
1231 the from_json method). Higher is better:
1206 1232
1207 module | encode | decode | 1233 module | encode | decode |
1208 -----------|------------|------------| 1234 --------------|------------|------------|
1209 JSON 1.x | 4990.842 | 4088.813 | 1235 JSON::DWIW/DS | 86302.551 | 102300.098 |
1210 JSON::DWIW | 51653.990 | 71575.154 | 1236 JSON::DWIW/FJ | 86302.551 | 75983.768 |
1211 JSON::PC | 65948.176 | 74631.744 | 1237 JSON::PP | 15827.562 | 6638.658 |
1212 JSON::PP | 8931.652 | 3817.168 | 1238 JSON::Syck | 63358.066 | 47662.545 |
1213 JSON::Syck | 24877.248 | 27776.848 | 1239 JSON::XS | 511500.488 | 511500.488 |
1214 JSON::XS | 388361.481 | 227951.304 | 1240 JSON::XS/2 | 291271.111 | 388361.481 |
1215 JSON::XS/2 | 227951.304 | 218453.333 | 1241 JSON::XS/3 | 361577.931 | 361577.931 |
1216 JSON::XS/3 | 338250.323 | 218453.333 | 1242 Storable | 66788.280 | 265462.278 |
1217 Storable | 16500.016 | 135300.129 |
1218 -----------+------------+------------+ 1243 --------------+------------+------------+
1219 1244
1220 That is, JSON::XS is about five times faster than JSON::DWIW on 1245 That is, JSON::XS is almost six times faster than JSON::DWIW on
1221 encoding, about three times faster on decoding, and over forty times 1246 encoding, about five times faster on decoding, and over thirty to
1222 faster than JSON, even with pretty-printing and key sorting. It also 1247 seventy times faster than JSON's pure perl implementation. It also
1223 compares favourably to Storable for small amounts of data. 1248 compares favourably to Storable for small amounts of data.
1224 1249
1225 Using a longer test string (roughly 18KB, generated from Yahoo! Locals 1250 Using a longer test string (roughly 18KB, generated from Yahoo! Locals
1226 search API (<http://dist.schmorp.de/misc/json/long.json>). 1251 search API (<http://dist.schmorp.de/misc/json/long.json>).
1227 1252
1228 module | encode | decode | 1253 module | encode | decode |
1229 -----------|------------|------------| 1254 --------------|------------|------------|
1230 JSON 1.x | 55.260 | 34.971 | 1255 JSON::DWIW/DS | 1647.927 | 2673.916 |
1231 JSON::DWIW | 825.228 | 1082.513 | 1256 JSON::DWIW/FJ | 1630.249 | 2596.128 |
1232 JSON::PC | 3571.444 | 2394.829 |
1233 JSON::PP | 210.987 | 32.574 | 1257 JSON::PP | 400.640 | 62.311 |
1234 JSON::Syck | 552.551 | 787.544 | 1258 JSON::Syck | 1481.040 | 1524.869 |
1235 JSON::XS | 5780.463 | 4854.519 | 1259 JSON::XS | 20661.596 | 9541.183 |
1236 JSON::XS/2 | 3869.998 | 4798.975 | 1260 JSON::XS/2 | 10683.403 | 9416.938 |
1237 JSON::XS/3 | 5862.880 | 4798.975 | 1261 JSON::XS/3 | 20661.596 | 9400.054 |
1238 Storable | 4445.002 | 5235.027 | 1262 Storable | 19765.806 | 10000.725 |
1239 -----------+------------+------------+ 1263 --------------+------------+------------+
1240 1264
1241 Again, JSON::XS leads by far (except for Storable which non-surprisingly 1265 Again, JSON::XS leads by far (except for Storable which non-surprisingly
1242 decodes faster). 1266 decodes a bit faster).
1243 1267
1244 On large strings containing lots of high Unicode characters, some 1268 On large strings containing lots of high Unicode characters, some
1245 modules (such as JSON::PC) seem to decode faster than JSON::XS, but the 1269 modules (such as JSON::PC) seem to decode faster than JSON::XS, but the
1246 result will be broken due to missing (or wrong) Unicode handling. Others 1270 result will be broken due to missing (or wrong) Unicode handling. Others
1247 refuse to decode or encode properly, so it was impossible to prepare a 1271 refuse to decode or encode properly, so it was impossible to prepare a
1282 information you might want to make sure that exceptions thrown by 1306 information you might want to make sure that exceptions thrown by
1283 JSON::XS will not end up in front of untrusted eyes. 1307 JSON::XS will not end up in front of untrusted eyes.
1284 1308
1285 If you are using JSON::XS to return packets to consumption by JavaScript 1309 If you are using JSON::XS to return packets to consumption by JavaScript
1286 scripts in a browser you should have a look at 1310 scripts in a browser you should have a look at
1287 <http://jpsykes.com/47/practical-csrf-and-json-security> to see whether 1311 <http://blog.archive.jpsykes.com/47/practical-csrf-and-json-security/>
1288 you are vulnerable to some common attack vectors (which really are 1312 to see whether you are vulnerable to some common attack vectors (which
1289 browser design bugs, but it is still you who will have to deal with it, 1313 really are browser design bugs, but it is still you who will have to
1290 as major browser developers care only for features, not about getting 1314 deal with it, as major browser developers care only for features, not
1291 security right). 1315 about getting security right).
1292 1316
1293THREADS 1317THREADS
1294 This module is *not* guaranteed to be thread safe and there are no plans 1318 This module is *not* guaranteed to be thread safe and there are no plans
1295 to change this until Perl gets thread support (as opposed to the 1319 to change this until Perl gets thread support (as opposed to the
1296 horribly slow so-called "threads" which are simply slow and bloated 1320 horribly slow so-called "threads" which are simply slow and bloated

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines