HRED VHRED VHCR for Multi-Turn Dialogue Systems

Preprocess data

./data: train.txt, dev.txt, test.txt

format:

u1 </s> u2 </s> u3 \t response

example:

w11 w12 w13 </s> w21 w22 </s> w31 w32 w33 w34 \t w1 w2 w3

then:

python prepare_data.py

Go to the model directory and set the save_dir in configs.py (this is where the model checkpoints will be saved)

We provide our implementation of VHCR, as well as our reference implementations for HRED and VHRED.

To run training:

python train.py --model=<model> --batch_size=<batch_size>

For example:

python train.py  --model=HRED

python train.py --model=VHRED --batch_size=40 --word_drop=0.25 --kl_annealing_iter=250000

python train.py --model=VHCR --batch_size=40 --sentence_drop=0.25 --kl_annealing_iter=250000

To evaluate the word perplexity:

python eval.py --model=<model> --checkpoint=<path_to_your_checkpoint>

For embedding based metrics, you need to download Google News word vectors, unzip it and put it under the datasets folder. Then run:

python eval_embed.py --model=<model> --checkpoint=<path_to_your_checkpoint>

To generate the response for the test set:

python test.py --model=<model> --checkpoint=<path_to_your_checkpoint>

python metrics.py

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
layers		layers
utils		utils
.gitignore		.gitignore
README.md		README.md
configs.py		configs.py
data_loader.py		data_loader.py
eval.py		eval.py
eval_embed.py		eval_embed.py
metrics.py		metrics.py
models.py		models.py
prepare_data.py		prepare_data.py
solver.py		solver.py
test.py		test.py
test_hred.sh		test_hred.sh
test_vhred.sh		test_vhred.sh
train.py		train.py
train_hred.sh		train_hred.sh
train_vhred.sh		train_vhred.sh