scoregraph

This branch was created from the original sources: https://github.com/behas/scoregraph

A collection of scripts for transforming ONB music score metadata into a semantically enriched knowledge graph of music scores.

Data aggregation

Starting from the ONB's raw music score dataset (Aleph export) data aggregation involves the following steps:

Data normalization: extract relevant fields from raw aleph data (example) and transform to JSON (example)
Data enrichment (example):
- follow GND links in raw/normalized data and collect additional uris (e.g., DBpedia, VIAF)
- find related Europeana items by (i) searching via the Europeana search API and (ii) filtering those that share at least one URI with the raw/normalized data
Statistics computation: (example)
- id: aleph document id
- links_artwork: number of artwork links
- persons: number of persons mentioned in metadata record
- links_person_gnd: number of persons linked to GND
- links_person_dbpedia: number of persons linked to DBPedia
- links_person_viaf: number of persons linked to VIAF
- related_europeana_items: number of persons possibly related to Europeana

Install dependencies:

pip install -r requirements.txt

Enable script execution

chmod u+x *.py

The dataset in the XML format should be placed in folder 'data/raw'

Start workflow analysis

$ python analyze.py data

Run data normalization script

./normalize -o data/normalized data/raw/*.xml

Run data enrichment script

./enrich -o data/enriched -e YOUR_EUROPEANA_API_KEY data/normalized/*.json

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
data		data
README.md		README.md
analyze.py		analyze.py
cdvs.py		cdvs.py
common.py		common.py
dbpedia_helper.py		dbpedia_helper.py
enrich.py		enrich.py
europeanaJsonLoader.py		europeanaJsonLoader.py
freebase_helper.py		freebase_helper.py
geo.properties		geo.properties
geo_location_helper.py		geo_location_helper.py
geo_tests.py		geo_tests.py
mediawiki_helper.py		mediawiki_helper.py
musicbrainz_helper.py		musicbrainz_helper.py
neo4j_manager.py		neo4j_manager.py
normalize.py		normalize.py
requirements.txt		requirements.txt
statistics.py		statistics.py
summarize.py		summarize.py
viaf_helper.py		viaf_helper.py
wikidata_helper.py		wikidata_helper.py
wikidata_tests.py		wikidata_tests.py