Skip to content

napattack/md-ingestion

 
 

Repository files navigation

B2FIND Metadata Integration Tool

License

Description

Python library to harvest, map and upload research community specific metadata into B2FIND CKAN portal. Currently, supported metadata schemas are DublinCore, DataCite and ISO 19139.

Installation

Create conda env:

$ conda env create -f environment.yml
$ conda activate b2f

Install mdingestion:

$ python setup.py develop

Example with Darus Community

List available communities:

$ b2f list

Harvest:

$ b2f harvest -c darus

Files are written to oaidata/darus/raw.

Map:

$ b2f map -c darus

Files are written to oaidata/darus/ckan.

Check the validation result:

$ less summary/darus/2020-10-16_darus_summary.json

Upload:

$ b2f upload -c darus -i CKAN_HOST --auth AUTH_KEY

Run tests

Install pytest:

$ conda install pytest

Run all tests:

$ pytest tests/

Run single test:

$ pytest tests/community/test_darus.py

About

Ingestion (including OAI harvesting, semantic mapping (see repo 'md-mapping') and uploading to CKAN ) of metadata in the B2FIND portal

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 98.1%
  • Jupyter Notebook 1.9%