parse-genbank-location
should warn about region/locality mix ups
#1578
Labels
enhancement
New feature or request
Currently,
parse-genbank-location
strictly follows GenBank's documented pattern for geo_loc_name:augur/augur/curate/parse_genbank_location.py
Lines 19 to 23 in 66e903a
However, the GenBank records don't always follow this pattern as shown in nextstrain/rabies#10.
We've previously done this in ncov-ingest specifically for USA locations by checking for US state codes but we can do a more generalized check with something like pycountry. If there is a region/locality mix-up, the command should emit a warning with instructions on how to fix this with
apply-geolocation-rules
.The text was updated successfully, but these errors were encountered: