Skip to content
This repository has been archived by the owner on Jan 5, 2023. It is now read-only.

Manual Script Needed to Clear DOI Source When There are Multiple #497

Open
thejuliekramer opened this issue Oct 22, 2020 · 0 comments
Open

Comments

@thejuliekramer
Copy link
Contributor

thejuliekramer commented Oct 22, 2020

A manual script is needed to clear data associated to the doi-open-data after it is deleted using the UI and a different doi-open-data source is created and harvested - we want to figure out why and avoid this.

How to reproduce

  1. Import harvest source doi-open-data - harvest all the datasets
  2. Delete the harvest source
  3. Import harvest source doi-open-data again and harvest all the datasets again
  4. Observe Errors while harvesting datasets

Screen Shot 2020-10-22 at 8 16 56 AM

Expected behavior

You should be able to 1. delete a harvest source 2. re-import it and 3. harvest without and issue of the deleted harvest sources datasets becoming mismatched in the database or needing to manually clear them

https://github.com/ckan/ckanext-harvest/blob/master/ckanext/harvest/logic/action/update.py#L175

Actual behavior

Datasets becoming mismatched. Old harvest sources and associated objects need to be cleared using this script.

Thoughts

The manual part of this script is already run when a source is cleared using the clear button in the UI or when using the clearsource commandline command from ckanext-harvest already so it may just be that we need to clear the source in the UI before deleting it.

@thejuliekramer thejuliekramer changed the title Manual Script Needed to Clear DOI Source Manual Script Needed to Clear DOI Source When There are Multiple Oct 22, 2020
@mogul mogul moved this to Icebox in data.gov team board Mar 23, 2022
# for free to subscribe to this conversation on GitHub. Already have an account? #.
Labels
None yet
Projects
Archived in project
Development

No branches or pull requests

1 participant