Generate reporting data on select quality-related aspects of the Pleiades gazetteer dataset.
This code expects to be run against Pleiades JSON data such as that found in https://github.com/isawnyu/pleiades.datasets/tree/main/data/json and produces report output such as that found in https://github.com/isawnyu/pleiades.datasets/tree/main/data/data_quality
python scripts/report.py ../pleiades.datasets/data/json/ ../pleiades.datasets/data/data_quality/
python scripts/issues2csv.py ../pleiades.datasets/data/data_quality/issues.json
To get an overview of how many issues there are in any given category, see the "summary" key in issues.json.