Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Distinguish facility IDs and building ones and start index both #5

Open
Abbe98 opened this issue Nov 28, 2016 · 0 comments
Open

Distinguish facility IDs and building ones and start index both #5

Abbe98 opened this issue Nov 28, 2016 · 0 comments

Comments

@Abbe98
Copy link
Member

Abbe98 commented Nov 28, 2016

Probably happening late December(required for Kyrkosok/web-client#23).

  • Figure out the exact value of the BBR ID change break point , see Template:BBR-länk for estimated values then do a loop with HTTP requests to get the exact one. Once done update the template too.

  • Run the kulturarvsdata-prefer-rdf.py bot.

  • Check for duplicate statements(should be none or very few), have seen something for this task over att Tool Labs.

  • Start indexing the WLM lists on sv.wikipedia.org to a CSV or SQLite file(index only WP articles and BBR URIs?)

  • check this list against existing data in Wikidata. Look for conflicts and data which exists only in Wikidata(which should not be the case).

  • fix any data that needs fixing

  • add Wikipedia articles for all the WLM BBR items missing one(if Geonames can be a source for bot created articles anything can be a source).

  • Index a new CSV or SQLite file from the WLM tables.

  • Import all the missing data to Wikidata.

  • start indexing both facility and building IDs(breaks the API). Use the "BBR ID change break point" if it's a fuzzy one create a buffer were all IDs gets verified using HTTP requests(the way all currently are validated).

  • Add all the Wikidata IDs to the WLM lists on sv.wikipedia.org and notify the folks over at Phabricator. Research on how to parse and process wikitext tabels <-- new to me

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant