Skip to content

isawnyu/pleiades_data_quality

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pleiades_data_quality

Generate reporting data on select quality-related aspects of the Pleiades gazetteer dataset.

This code expects to be run against Pleiades JSON data such as that found in https://github.com/isawnyu/pleiades.datasets/tree/main/data/json and produces report output such as that found in https://github.com/isawnyu/pleiades.datasets/tree/main/data/data_quality

how to run it:

generate reporting data, all of which is saved to issues.json

python scripts/report.py ../pleiades.datasets/data/json/ ../pleiades.datasets/data/data_quality/

create CSV files containing issue-specific reporting data pulled from issues.json

python scripts/issues2csv.py ../pleiades.datasets/data/data_quality/issues.json

quick tip

To get an overview of how many issues there are in any given category, see the "summary" key in issues.json.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages