Opened 13 years ago
Last modified 7 years ago
#1461 new defect
Tiger Geocoder doesn't anticipate irregular spacing in road name
Reported by: | arencambre | Owned by: | robe |
---|---|---|---|
Priority: | medium | Milestone: | PostGIS Fund Me |
Component: | pagc_address_parser | Version: | master |
Keywords: | Cc: | aren@…, woodbri |
Description
The tx_edges table uses I- 635 instead of the more consistent I-635.
This looks weird but works:
SELECT ST_AsEWKT(geomout) FROM geocode_intersection('N. Belt Line', 'I- 635', 'TX', 'Coppell') ORDER BY rating ASC LIMIT 1;
This looks correct but doesn't work:
SELECT ST_AsEWKT(geomout) FROM geocode_intersection('N. Belt Line', 'I-635', 'TX', 'Coppell') ORDER BY rating ASC LIMIT 1;
Seems like the geocoder needs to work around these kind of errors in the Tiger data.
Change History (10)
comment:1 by , 13 years ago
Milestone: | PostGIS 2.0.0 → PostGIS 2.1.0 |
---|
comment:2 by , 13 years ago
Milestone: | PostGIS 2.1.0 → PostGIS 2.0.1 |
---|
comment:3 by , 12 years ago
Milestone: | PostGIS 2.0.1 → PostGIS 2.1.0 |
---|
comment:5 by , 12 years ago
Component: | tiger geocoder → pagc_address_parser |
---|
comment:6 by , 12 years ago
I should correct my last comment, PAGC still parses based on token so names like "SUN VALLEY" will parse as two tokens and "SUNVALLEY" will parse as one token. In my geocoder, I handle this under the fuzzy search by joining all the name tokens, then picking the best match to the input via scoring the results.
comment:7 by , 12 years ago
Also, I have found at least one very pathological case where the name is "MAINSTREET" and if it is entered as "MAIN STREET" then it is impossible to match, because "STREET" is classified as SUFFIX_TYPE token and "MAIN" is too short to match the fuzy key of "MAINSTREET".
There are ways to find this, but they tend to make everything else slower and return too many unwanted results. -- Good Times!
comment:8 by , 12 years ago
Milestone: | PostGIS 2.1.0 → PostGIS 2.2.0 |
---|
comment:9 by , 9 years ago
Milestone: | PostGIS 2.2.0 → PostGIS Future |
---|
yah -- may not be that trivial as spacing is used to designate separation of elements so putting logic like this in is libel to break something else without some extensive testing. I'll push to 2.1.0 but may get to it before then.