GitHub - abhinavanshuman15/Question-Answering-System-using-Hashed-Memory-Networks

Description:- Implementation of Question-Answering-System-using-Hashed-Memory-Networks for facebook bAbI dataset.

Requirements:-

Python 2.7
numpy
tensorflow==0.12.1
pandas
Flask

Dataset:-

Download the dataset from Jasen Wetson's(One of the author for dataset creation)page http://www.thespermwhale.com/jaseweston/babi/tasks_1-20_v1-2.tar.gz
Unzip the dataset inside projects home directory in a data directory such that all extracted directories(en, en-10k, hn) appears on data/tasks_1-20_v1-2 path.

Taining and testing is done on both en and en-10k dataset.

Training:-

To train the model we have two files one to train individually all task by(train_single.py) and training all tasks combindelyly using (train_combinedly.py). Single training:-

Use $python train_single.py (It will by default train for task 1 with 1k dataset), we can pass different arguments to work on different task and dataset.
$python train_single.py --task_id <any task ID between 1-20> --data_dir <data/tasks_1-20_v1-2/en-10k for 10k dataset> --reader <bow, simple_gru> to run on different tasks.

The models will be generated under models/models-1k/task- folder for each task for 1k dataset and for 10k dataset it will get stored in models/models-10k/task- by making change in line 179 of train_single.py by giving path as ./models/models-10k/task-{}/model.ckpt".format(FLAGS.task_id).

Combined Training:-

$python train_combinedly.py -It will train all the tasks combindely from 1k dataset.
$python train_combinedly.py --data_dir data/tasks_1-20_v1-2/en-10k will train on data from 10k dataset. The models will be generated under models/models-1k/joint folder for each task for 1k dataset and for 10k dataset it will get stored in models/models-10k/joint by making change in line 179 of single.py by giving path as ./models/models-10k/joint/model.ckpt".

-- All the log files will be generated in logs directory.

The final accuracy score for 1k dataset will be generated in project home directory as "single_scores.csv" file. The final accuracy score for 10k dataset will be generated in logs directory as csv file.

Webapp:-

For demonstration purpose we have created a simple webapp which populates randomly selected stories from dataset and initially a related question. A user can change to other question related to the given story and predict the output. User can also populate different story by clicking on Get new story button. It evaluates the question on given story based on a pre trained model. before running the below code , make sure "./models/joint/" folder has some pre trained models.

$python webapp.py
Go to browser and open 127.0.0.1:5000

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commits
output		output
static		static
templates		templates
README.md		README.md
data_utils.py		data_utils.py
hashed_mem_nw.py		hashed_mem_nw.py
memn2n_kv.py		memn2n_kv.py
train_combinedly.py		train_combinedly.py
train_single.py		train_single.py
webapp.py		webapp.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

output

output

static

static

templates

templates

README.md

README.md

data_utils.py

data_utils.py

hashed_mem_nw.py

hashed_mem_nw.py

memn2n_kv.py

memn2n_kv.py

train_combinedly.py

train_combinedly.py

train_single.py

train_single.py

webapp.py

webapp.py

Repository files navigation

Dataset:-

Training:-

Webapp:-

About

Releases

Packages

Languages

abhinavanshuman15/Question-Answering-System-using-Hashed-Memory-Networks

Folders and files

Latest commit

History

Repository files navigation

Dataset:-

Training:-

Webapp:-

About

Resources

Stars

Watchers

Forks

Languages