Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
bagofwords		bagofwords
bert		bert
filterir		filterir
ir		ir
seq2seq		seq2seq
.gitignore		.gitignore
EXPERIMENTS.md		EXPERIMENTS.md
LICENSE		LICENSE
README.md		README.md
WordUnionDataGet.ipynb		WordUnionDataGet.ipynb
parser_output.txt		parser_output.txt
preps.txt		preps.txt
qa.txt		qa.txt
requirements.txt		requirements.txt
rule.py		rule.py
rule_based_qa.py		rule_based_qa.py

Repository files navigation

OBQA

open book question answering

Data Set

OBQA Link to Dataset

Parsing and Tagging of Wh Questions

Parser and Tagger are taken from https://github.com/tdozat/Parser-v3.
Parser and Tagger are trained using CoNLL 2018 dataset :
- Git clone the repo. Create the Data Directory : data/CoNLL18/UD_English-EWT. Save both the Datasets and the embeddings.
- Training Data : http://universaldependencies.org/conll18/
- Word2Vec Embeddings : https://lindat.mff.cuni.cz/repository/xmlui/handle/11234/1-1989 . Only English Word2Vec embeddings are needed.
- Environment : TensorFlow=1.4, Scipy, Matplotlib, Psutil, Python=3.6, Pandas, Conllu
- Training Command : python main.py train ParserNetwork / TaggerNetwork
- Run Model Command : python main.py --save_dir=\$PATH_TO_NETWORK run \$INPUTFILE --output_dir=\$OUTPUTDIR
- Key Point to Note: CoNLLU format needs to be adhered strictly, Tabs between columns.
- Trained Models to be pushed at a location : [DropboxLocation]

Knowledge selection from OpenBook facts

Present in folder ir, with its own ReadMe

Word Intersection, Union and Seq2Seq Abductive IR

Present in notebooks and folders

Re-Ranking using SpaCY

Present in filterir

BERT QA models

Runner and scorer are present

About

open book question answering

Apache-2.0 license

Report repository

Releases

No releases published

Packages

No packages published

Languages

Jupyter Notebook 72.3%
Python 27.7%