[ViewVC] Diff of: cvs/Geo-LatLon2Place/bin/geo-latlon2place-makedb

Comparing Geo-LatLon2Place/bin/geo-latlon2place-makedb (file contents):
Revision 1.1 by root, Mon Mar 14 02:41:52 2022 UTC vs.
Revision 1.2 by root, Mon Mar 14 03:26:20 2022 UTC

 The extraction method: the default is C<geonames>, which expects a
 geonames database (L<https://download.geonames.org/export/dump/>, for
 example F<DE.txt>, F<cities500.txt> or F<allCountries.txt>) and extracts
 I<placename, countrycode> strings from it.
-The method C<geonames-postalcodes> (not yet implemented)
+The method C<geonames-postalcodes> does the same, but for a geonames
-does the same, but for a geonames postal code database
-L<https://download.geonames.org/export/zip>.
+postal code database L<https://download.geonames.org/export/zip>, and
+extracts C<zip name, countrycopde> strings.
 Lastly, you can specify a perl fragment that implements your own filtering
 and extraction.
 =back
 in C<$_>. The file is opened using the C<:perlio> layer, so if your input
 file is in UTF-8, so will be C<$_>.
 For example, the following would expect an input file with space separated
 latitude, longitude, weight and name, where name can contain spaces, which
-is useful when you wat to provide your own input data:
+is useful when you want to provide your own input data:
    geo-latlon2place-makedb --extract 'chomp; split / /, 4' input output
 A slighly more verbose example expecting only latitude, longitude and a
 name would be:
 weight, these should be self-explaining. The weight is used during search
 and will be multiplied to the square of the distance, and is used to make
 larger cities win over small ones when the coordinate is somewhere between
 them.
+The standard extractors (C<geonames> and C<geonames-postalcodes>) provide
+a UTF-8-encoded string as blob, but any binary data will do, for example,
+if you want to associate your coordinate pairs with some short-ish
+integer codes, you could do this:
+   geo-latlon2place-makedb --extract '
+      chomp;
+      my ($lat, $lon, $id) = split / /, 4;
+      ($lat, $lon, 1, pack "w", $id)
+   ' input output
+And later use C<unpack "w"> on the data returned by C<lookup>.
-The C<geonames> filter looks similar to this fragment, which shows off
+The C<geonames> filter looks similar to the following fragment, which
-more possibilities:
+shows off some more filtering possibilities:
    my ($id, $name, undef, undef, $lat, $lon, $t1, $t2, $cc, undef, $a1, $s2, $a3, $a4, $pop, undef) = split /\t/;
    return if $t1 ne "P"; # only places
    # actually place names, so ignore very long names
    60 > length $name
       or return;
    # we estimate a weight by dividing 25 by the radius of the place,
-   # which we get by assuming a fixed population density of 5000 people/km²,
+   # which we get by assuming a fixed population density of 5000 # people
-   # which is almost always a considerable over-estimate.
+   # per square km, # which is almost always a considerable over-estimate.
    # 25 and 5000 are pretty much made-up, feel free to improve and
    # send me the results.
    my $w = 25 / (1 + sqrt $pop / 5000);
    # administrative centers get a fixed low weight

Diff Legend

-–
+Removed lines
-+
+Added lines
-<
+Changed lines
->
+Changed lines

Comparing Geo-LatLon2Place/bin/geo-latlon2place-makedb (file contents): Revision 1.1 by root, Mon Mar 14 02:41:52 2022 UTC vs. Revision 1.2 by root, Mon Mar 14 03:26:20 2022 UTC

Diff Legend

Comparing Geo-LatLon2Place/bin/geo-latlon2place-makedb (file contents):
Revision 1.1 by root, Mon Mar 14 02:41:52 2022 UTC vs.
Revision 1.2 by root, Mon Mar 14 03:26:20 2022 UTC