Cruz Control

GPT2 Models

We currently have the following training scripts for the models:

GPT2 Baseline Text + Fact
Knowledge Dependent Policy Driven Neural Response Generator using Mezza Tags

Contact

For any clarification related to the above code, please reach out to Rishi Rajasekaran (rrajasek@ucsc.edu)

DSTC9 Baseline Code (untested)

Response Generation

Scripts to train Seq2Seq and Transformer models on the Amazon Topical-Chat Corpus. This code serves as the baseline for DSTC9 Track 3.

To train: python3 train.py --use_knowledge --transformer --save_path transformer/

To test: python3 test.py --use_knowledge --transformer --save_path transformer/

To serve interactive model with TF-IDF based fact selection: python3 dynamic.py --use_knowledge --transformer --save_path transformer/

Data

The pre-processed data can be found in data.zip. If you would like to use a different pre-processing strategy, please download the original data from here.

The dataset preparation code is split between the utils.py file and the tc_dataset.py. The data loading and tokenization is done in utils.py while the data preparation to feed into the model is done in tc_dataset.py.

Contact

If you experience any issues with this code, please contact me at mehrishikib@gmail.com

Setup

spacy
python -m spacy download en_core_web_lg
nltk.download('punkt')

Name		Name	Last commit message	Last commit date
Latest commit History 404 Commits
DialogueAct_Tagger		DialogueAct_Tagger
annotators		annotators
baseline		baseline
datasets		datasets
elastic		elastic
encoder		encoder
evaluation		evaluation
glove		glove
knowledge_selection		knowledge_selection
model		model
pd_nrg		pd_nrg
taggers		taggers
tc_processed		tc_processed
train_util		train_util
transformers_src		transformers_src
unused_code		unused_code
.gitignore		.gitignore
README.md		README.md
configuration_gpt2_adapter.py		configuration_gpt2_adapter.py
create_topical_chat_submission_dataset.py		create_topical_chat_submission_dataset.py
data.py		data.py
data.zip		data.zip
dataset_analysis.py		dataset_analysis.py
dataset_base.py		dataset_base.py
dataset_base_demo.py		dataset_base_demo.py
dstc9_anno.py		dstc9_anno.py
generate_submission.py		generate_submission.py
gpt2.py		gpt2.py
knowledge_index.py		knowledge_index.py
modeling_gpt2_adapter.py		modeling_gpt2_adapter.py
my-requirements.txt		my-requirements.txt
my_train.py		my_train.py
requirements.txt		requirements.txt
straggler-info.txt		straggler-info.txt
tc_annotation.py		tc_annotation.py
tc_dataset.py		tc_dataset.py
tc_knowledge_selection.py		tc_knowledge_selection.py
test_freq_cache		test_freq_cache
train_athena_tagger.py		train_athena_tagger.py
train_model.py		train_model.py
train_swbd_tagger.py		train_swbd_tagger.py
trainer.py		trainer.py
utils.py		utils.py
valid_freq_cache		valid_freq_cache
valid_freq_facts_bert.txt		valid_freq_facts_bert.txt
valid_freq_facts_bert_2.txt		valid_freq_facts_bert_2.txt

rrajasek95/DSTC9-Dialog-Evaluation-Challenge

Folders and files

Latest commit

History

Repository files navigation

Cruz Control

GPT2 Models

Contact

DSTC9 Baseline Code (untested)

Response Generation

Data

Contact

Setup

About

Resources

Stars

Watchers

Forks

Languages