GitHub - tbmihailov/conll16st-hd-sdp: Code for the paper "Discourse Relation Sense Classification Using Cross-argument Semantic Similarity Based on Word Embeddings" - participation in the CoNLL 2016 shared task on Discourse Relation Sense Classification http://www.cs.brandeis.edu/~clp/conll16st/

Repo for CoNLL 2016 Shared Task on Shallow Discourse Parsing

Repository for participation in the CoNLL 2016 shared task of Shallow Discourse Parsing (http://www.cs.brandeis.edu/~clp/conll16st/)

Contains code used for the paper: Discourse Relation Sense Classification Using Cross-argument Semantic Similarity Based on Word Embeddings

@inproceedings{mihaylovfrank:2016a,
  author = {Todor Mihaylov and Anette Frank},
  title = {{Discourse Relation Sense Classification Using Cross-argument Semantic Similarity Based on Word Embeddings}},
  year = {2016},
  publisher = {Association for Computational Linguistics},
  booktitle = {Proceedings of the Twentieth Conference on Computational Natural Language Learning - Shared Task},
  pages = {100--107},
  address = {Berlin, Germany},
  url = {https://aclweb.org/anthology/K/K16/K16-2014.pdf}
}

Setup environment

Create virtual environment

virtualenv venv

Activate the environment:

cd venv
source bin/activate

Install everything from requirements.txt

pip install -r requirements.txt

Get the data!

In the current repository we have only the public dev set. To get the training data, you need to obtain the shared task data from here: http://www.cs.brandeis.edu/~clp/conll16st/dataset.html

Run experiments with the Feature-based model

Update the paths to the training and evaluation data in a copy of scripts/sup_parser_v2_hierarchy_ext_run_dev2016_dev2016.sh
Run

bash scripts/sup_parser_v2_hierarchy_ext_run_dev2016_dev2016.sh

CoNLL 2016 Shared Task - how to from the official documentation

Validator

The validator is provided to make sure the discourse parser output is in the right format. In this version of the task, language must be specified when validating the output. Sample usage:

python2.7 scorer/validator.py en tutorial/output.json

If you would like to see what the error messages look like, try running:

python2.7 scorer/validator.py en tutorial/faulty_output.json

Scorer

The official scorer for the final evaluation is used to calculate evaluation metrics for argument labeler, connective detection, sense classification, and overall parsing performance. The scorer gives quite detailed scores for analytical purposes. We also provide scoring based on partial matching of argument. The detail on how partial matching criteria are computed can be found in the task blog.

The output should be validated without any error. The scorer will fail silently if the output is not in the right format or validated.

Sample usage:

python2.7 scorer/scorer.py tutorial/conll16st-en-01-12-16-trial/relations.json tutorial/output.json

TIRA scorer

This is the scorer that is used in the TIRA evaluation platform. You should check this out and try to run this offline and see if your parser outputs the right kind of format.

Sample usage:

First, run your parser on the dataset. Your parser should load the parses from path/to/data_dir, use path/to/model_dir/ for model and other resources and produce the output file in path/to/outputdir/output.json. We will use the sample parser as an example:

python2.7 scorer/sample_parser.py path/to/data_dir path/to/model_dir path/to/output_dir
ls -l path/to/output_dir/output.json

Next, run the TIRA scorer on it:

python2.7 scorer/tira_eval.py path/to/data_dir path/to/output_dir path/to/result_dir

The evaluation will be done on the gold standard in path/to/data_dir and the predictions in path/to/output_dir and the results will be put in path/to/result_dir/evaluation.prototext, which looks like:

measure {
 key: "Parser precision"
 value: "0.0373"
}
measure {
 key: "Parser recall"
 value: "0.0418"
}
measure {
 key: "Parser f1"
 value: "0.0394"
}
...

Name		Name	Last commit message	Last commit date
Latest commit History 115 Commits
scorer		scorer
scripts		scripts
sdp		sdp
tutorial		tutorial
.gitignore		.gitignore
README.md		README.md
___commands.txt		___commands.txt
__init__.py		__init__.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scorer

scorer

scripts

scripts

sdp

sdp

tutorial

tutorial

.gitignore

.gitignore

README.md

README.md

___commands.txt

___commands.txt

init.py

init.py

requirements.txt

requirements.txt

Repository files navigation

Repo for CoNLL 2016 Shared Task on Shallow Discourse Parsing

Setup environment

Create virtual environment

Install everything from requirements.txt

Get the data!

Run experiments with the Feature-based model

CoNLL 2016 Shared Task - how to from the official documentation

Validator

Scorer

TIRA scorer

About

Releases

Packages

Contributors 3

Languages

tbmihailov/conll16st-hd-sdp

Folders and files

Latest commit

History

Repository files navigation

Repo for CoNLL 2016 Shared Task on Shallow Discourse Parsing

Setup environment

Create virtual environment

Install everything from requirements.txt

Get the data!

Run experiments with the Feature-based model

CoNLL 2016 Shared Task - how to from the official documentation

Validator

Scorer

TIRA scorer

About

Resources

Stars

Watchers

Forks

Languages