Skip to content

This project is to understand how to extract relations from text data automatically without human annotations and convert it to graph triplets.

Notifications You must be signed in to change notification settings

vgpprasad91/Seq2Graph

Repository files navigation

This project is based on the inspiration from the paper, "Matching the Blanks: Distributional Similarity for Relation Learning" published in ACL 2019.

Before zeroing on this approach, I had considered two other approaches for this project. Below are them, with their papers:

  1. LEVERAGING SEMANTIC PARSING FOR RELATION LINKING OVER KNOWLEDGE BASES
  2. Transition words based link prediction

Download all the models from this link

  1. Run Python main.py

Requirements:

Python: 3.6+ Spacy: 2.1.8+ Pytorch: 1.7.0+

The following steps are involved in the process of creation of graphs:

  1. Masking the entities extracted based on Spacy's dependency Parsing and POS tagging linguistic features from the reddit dumps.
  2. Fine tuning on the Semeval 2010 relation extraction paper.
  3. Based on spacy's linguistic features, we can automatically annotate and infer the relationship between the extracted entities using the pretrained model.

Training Stages:

  1. The objective here is that given a relation pair, predict a relation type from a fixed dictionary of relation types. For ex: "Cause-Effect" is one among the fixed dictionary of relation types from the SemEval 2010 Task 8.

--contd...

About

This project is to understand how to extract relations from text data automatically without human annotations and convert it to graph triplets.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published