GitHub - vgpprasad91/Seq2Graph: This project is to understand how to extract relations from text data automatically without human annotations and convert it to graph triplets.

This project is based on the inspiration from the paper, "Matching the Blanks: Distributional Similarity for Relation Learning" published in ACL 2019.

Before zeroing on this approach, I had considered two other approaches for this project. Below are them, with their papers:

Python: 3.6+ Spacy: 2.1.8+ Pytorch: 1.7.0+

The following steps are involved in the process of creation of graphs:

Masking the entities extracted based on Spacy's dependency Parsing and POS tagging linguistic features from the reddit dumps.
Fine tuning on the Semeval 2010 relation extraction paper.
Based on spacy's linguistic features, we can automatically annotate and infer the relationship between the extracted entities using the pretrained model.

Training Stages:

The objective here is that given a relation pair, predict a relation type from a fixed dictionary of relation types. For ex: "Cause-Effect" is one among the fixed dictionary of relation types from the SemEval 2010 Task 8.

--contd...

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.idea		.idea
_data		_data
amrlib		amrlib
d3rdf		d3rdf
transition_words		transition_words
README.md		README.md
connected_components.py		connected_components.py
database.py		database.py
graph_data.csv		graph_data.csv
main.py		main.py
matcher.py		matcher.py
preprocess.py		preprocess.py
process_triples.py		process_triples.py
relationship_extraction.py		relationship_extraction.py
requirements.txt		requirements.txt
spacy_parser.py		spacy_parser.py
triplets.csv		triplets.csv
triplets.py		triplets.py

vgpprasad91/Seq2Graph