Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

CKAN URL Identifier #107

Draft
wants to merge 6 commits into
base: main
Choose a base branch
from
Draft

CKAN URL Identifier #107

wants to merge 6 commits into from

Conversation

EvilDrPurple
Copy link
Contributor

Description

Testing

  1. Checkout the branch
  2. In the source_collectors/ckan folder, create a CSV file with a column labeled "url" and populate it with urls. Below is an example CSV file that may be used for testing
  3. In the root project directory, create a virtual env and activate it
    python -m venv venv
    source venv/bin/activate
  4. Install the requirements
    pip install -r requirements.txt
  5. (Optional) Move to the ckan folder
    cd source_collectors/ckan
  6. Run the program with the CSV file as the second argument
    python ckan_identifier.py urls.csv
  7. Output will be located in output.csv

Sample CSV:

url
https://catalog.data.gov/dataset/operation-aquila
https://github.com/Police-Data-Accessibility-Project/data-source-identification/compare/ckan-identifier?expand=1
https://docs.ckan.org/en/2.9/api/
https://data.gov/open-gov/
https://data.boston.gov/dataset/police-districts
https://open.jacksonms.gov/dataset/police-precinct-city-of-jackson-and-wards
https://duckduckgo.com/
https://docs.pola.rs/

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant