Fine-tune BertForSequenceClassification to train a sequence classifier

A pretrained BERT by huggingface can be easily used to train a sequence classifier using BERT. This repo uses a pretrained BERT train a 'yes, and' classifier, to determine whether a given dialogue pair is a "Yes, and...".

Requirements

Requirements can be found in requirements.txt.

Finetuning steps

In practice, this code can be adjusted with minimal effort to train any downstream task that takes two sentences as input, such as entailment(NLI), etc. The only modifications required are the get_data and build_bert_input in utils.py to appropriately format the input data. They are marked with #TODO in the code.

Once code for reformatting data is modified:

Run python train.py. Check train() for default training parameters.
Model checkpoints will be saved in runs/. To make predictions for a held-out dataset, run python predict.py --model_checkpoint runs/<checkpoint directory> --data_path <datapath to held-out data>. Check predict() for default parameters.

References

Code was based on a very useful blog post by Chris McCormick. Most of the code directly comes from his code while the code was refactored with pytorch-ignite and adjusted to incorporate the migration changes from pytorch_pretrained_bert to pytorch_transformers. The migration notes can be found here.
Use of pytorch-ignite to refactor some of the code was referenced from huggingface's ConvAI chatbot implementation.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.gitignore		.gitignore
README.md		README.md
filter_from_predictions.py		filter_from_predictions.py
predict.py		predict.py
process_subtle_corpus.py		process_subtle_corpus.py
requirements.txt		requirements.txt
single_predict.py		single_predict.py
train.py		train.py
turn_train.py		turn_train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

README.md

README.md

filter_from_predictions.py

filter_from_predictions.py

predict.py

predict.py

process_subtle_corpus.py

process_subtle_corpus.py

requirements.txt

requirements.txt

single_predict.py

single_predict.py

train.py

train.py

turn_train.py

turn_train.py

utils.py

utils.py

Repository files navigation

Fine-tune BertForSequenceClassification to train a sequence classifier

Requirements

Finetuning steps

References

About

Releases

Packages

Languages

wise-east/finetune_bertsequence_classifier

Folders and files

Latest commit

History

Repository files navigation

Fine-tune BertForSequenceClassification to train a sequence classifier

Requirements

Finetuning steps

References

About

Resources

Stars

Watchers

Forks

Languages