GitHub - JTFouquier/crowdflower_relation_verification: Verification of EU-ADR drug-disease relationships using crowdsourcing.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
crowdflower		crowdflower
data		data
src		src
README.md		README.md
create_work_units.ipynb		create_work_units.ipynb
demographic_analysis.ipynb		demographic_analysis.ipynb
fig2_expert_crowd_confidence.ipynb		fig2_expert_crowd_confidence.ipynb
fig3_raw_annotation_agreement.ipynb		fig3_raw_annotation_agreement.ipynb
identify_published_drug_disease_relationships.ipynb		identify_published_drug_disease_relationships.ipynb
identify_raw_annotations.ipynb		identify_raw_annotations.ipynb
job_710587_agreement_analysis.ipynb		job_710587_agreement_analysis.ipynb
map_euadr_pub_to_raw.ipynb		map_euadr_pub_to_raw.ipynb

Repository files navigation

Data and Code for Exposing ambiguities in a relation-extraction gold standard with crowdsourcing

Last updated 2015-04-14 Tong Shu Li

This repository contains the code and data used to generate Exposing ambiguities in a relation-extraction gold standard with crowdsourcing.

Any questions can be sent to tongli@scripps.edu

Contents

crowdflower/: this directory contains all of the instructions and markup for CrowdFlower job 710587, which was used to gather the data analyzed in the paper.
data/: this directory contains the CrowdFlower output data as well as other data.
src/: this directory contains all of the source code referenced by the iPython notebooks.
create_work_units.ipynb: Code for randomly selecting some drug-disease relationships to show to the crowd.
demographic_analysis.ipynb: An analysis of the countries of origin of the task participants.
fig2_expert_crowd_confidence.ipynb: Contains all of the code used to generate figure 2.
fig3_raw_annotation_agreement.ipynb: Contains all of the code used to generate figure 3.
identify_published_drug_disease_relations.ipynb: Assigns unique identifiers to the published EU-ADR drug-disease relationships.
identify_raw_annotations.ipynb: Assigns unique identifiers to the raw annotations in the raw EU-ADR.
map_euadr_pub_to_raw.ipynb: Maps the identifiers for the published drug-disease relationships back to the identifiers for the raw annotations.

Workflow

Assign unique identifiers to the published EU-ADR drug-disease relationships.
Assign unique identifiers to the raw EU-ADR relationship annotations.
Map the published and raw identifiers.
Create the work units for CrowdFlower.
Aggregate the CrowdFlower data for analysis.

About

Verification of EU-ADR drug-disease relationships using crowdsourcing.

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 55.8%
HTML 42.3%
CSS 1.9%