This harvester uses the OAI-PMH interface of openBis to harvest metadata.
Use pip
to install this plugin. This example installs it in /vagrant
source /home/www-data/pyenv/bin/activate
pip install -e git+https://github.com/openresearchdata/ckanext-openbis.git#egg=ckanext-openbis --src /vagrant
cd /vagrant/ckanext-openbis
pip install -r requirements.txt
python setup.py develop
Make sure the ckanext-oaipmh and ckanext-harvest extension are installed as well.
- add
openbis_harvester
tockan.plugins
indevelopment.ini
(orproduction.ini
) - restart your webserver
- with the web browser go to
<your ckan url>/harvest/new
- as URL fill in the base URL of an OAI-PMH conforming openBis instance
- select Source type
openBis
- if your OAI-PMH needs credentials, add the following to the "Configuration" section:
{"username": "foo", "password": "bar" }
- if you only want to harvest a specific set, add the following to the "Configuration" section:
{"set": "baz"}
- Save
- on the harvest admin click Reharvest
On the command line do this:
- activate the python environment
cd
to the ckan directory, e.g./usr/lib/ckan/default/src/ckan
- start the consumers (NOTE: only run 1 gather and 1 fetch consumer per server):
paster --plugin=ckanext-openbis harvester gather_consumer &
paster --plugin=ckanext-openbis harvester fetch_consumer &
-
run the job:
paster --plugin=ckanext-openbis harvester run
The harvester should now start and import the OAI-PMH metadata.
To make it easier to develop, tests are setup that allow to do that:
. ~/default/bin/activate
cd /vagrant/ckanext-openbis
In this example the logging filter is used to only show messages of the harvester.