GitHub

Usage

Currently use train_nlc.py or train_engine.py

Call evaluation by:

CUDA_VISIBLE_DEVICES=0 python train_logic.py --train_dir sandbox/rnn_logic_seq_256_d15 --size 256 --dev True --best_epoch 13 --restore_checkpoint sandbox/rnn_logic_seq_256_d15/best.ckpt-13

Model

We build four types of models:

Seq2Seq model (RNN(S1) -> Output) or (RNN(S1) -> RNN(S2) -> Output)
Normal Attention model (Attention goes from S1 to S2, and encoded S2 to Output)
Coattention model
Concatenated Attention Decoder Model (without multihead attention)
Concatenated Multi-head Attention Decoder Model (Transformer) (not yet implemented)

Task: RNN_Logic

We generate the logical form conditioned not just on the input query, but on the context as well. Q2L means "Query to Logical parse"

Model Type	EM	F1	param_size
no context (Q2L)	55.90	92.81	1.84M
seq	53.89	92.28	2.63M
attn	6.74	69.61	1.97M
concat-attn	49.47	91.88	2.63M
co-attn	51.48	92.08	3.42M

All models report their best EM/F1 under optimal settings.

no context (Q2L): size 256, 20 epochs
Seq: size 256, 15 epochs
Attn: size 256, 20 epochs
concat-attn: 256, 25 epochs
co-attn: 256, 35 epochs

Task: RNN_Engine

We directly predict the output of a query from the context.

Model Type	EM	F1	param_size
null hypothesis (no query)	21.76	82.36	1.84M
seq	59.91	94.27	2.63M
attn	2.65	24.42	1.97M
concat-attn	64.17	93.99	2.63M
co-attn	55.74	92.26	3.41M

All models report their best EM/F1 under optimal settings.

Null hypothesis: size 256, 20 epochs
Seq: size 256, 20 epochs
Attn: size 256, 20 epochs
concat-attn: size 256, 20 epochs
co-attn: size 256, 20 epochs

(note that concat-attn and seq have the same amount of parameters, and share basic architecture)

(note that co-attn could be under-trained because the parameter size, but size=256 outperforms size=128, could try size=175)

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
data/shrdlurn		data/shrdlurn
queries		queries
.gitignore		.gitignore
analysis.ipynb		analysis.ipynb
data_util.py		data_util.py
decode.py		decode.py
epoch10_log.txt		epoch10_log.txt
f1_em_plot.png		f1_em_plot.png
plot.py		plot.py
readme.md		readme.md
rnn.py		rnn.py
rnn_core.py		rnn_core.py
rnn_engine.png		rnn_engine.png
rnn_engine.py		rnn_engine.py
rnn_logic.py		rnn_logic.py
rnn_logic_fig.png		rnn_logic_fig.png
rnn_nlc.py		rnn_nlc.py
rnn_torch.py		rnn_torch.py
rnn_transformer.py		rnn_transformer.py
shrdlurn_data.ipynb		shrdlurn_data.ipynb
shrdlurn_data_util.py		shrdlurn_data_util.py
train.py		train.py
train_core.py		train_core.py
train_engine.py		train_engine.py
train_logic.py		train_logic.py
train_nlc.py		train_nlc.py
util.py		util.py
viz.ipynb		viz.ipynb

windweller/Sempar

Folders and files

Latest commit

History

Repository files navigation

Usage

Model

Task: RNN_Logic

Task: RNN_Engine

Error Analysis

About

Resources

Stars

Watchers

Forks

Languages