Seq2Seq in PyTorch

This is a complete suite for training sequence-to-sequence models in PyTorch. It consists of several models and code to both train and infer using them.

Using this code you can train:

Neural-machine-translation (NMT) models
Language models
Image to caption generation
Skip-thought sentence representations
And more...

Models

Models currently available:

Simple Seq2Seq recurrent model
Recurrent Seq2Seq with attentional decoder
Google neural machine translation (GNMT) recurrent model
Transformer - attention-only model from "Attention Is All You Need"
ByteNet - convolution based encoder+decoder

Datasets

Datasets currently available:

WMT16
OpenSubtitles 2016
COCO image captions

All datasets can be tokenized using 3 available segmentation methods:

Character based segmentation
Word based segmentation
Byte-pair-encoding (BPE) as suggested by bpe with selectable number of tokens.

After choosing a tokenization method, a vocabulary will be generated and saved for future inference.

Training methods

The models can be trained using several methods:

Basic Seq2Seq - given encoded sequence, generate (decode) output sequence. Training is done with teacher-forcing.
Multi Seq2Seq - where several tasks (such as multiple languages) are trained simultaneously by using the data sequences as both input to the encoder and output for decoder.
Image2Seq - used to train image to caption generators.

Usage

Example training scripts are available in scripts folder. Inference examples are available in examples folder.

Name		Name	Last commit message	Last commit date
Latest commit History 120 Commits
examples		examples
scripts		scripts
seq2seq		seq2seq
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
eval.py		eval.py
main.py		main.py
setup.py		setup.py
translate.py		translate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

examples

examples

scripts

scripts

seq2seq

seq2seq

.gitignore

.gitignore

.gitmodules

.gitmodules

LICENSE

LICENSE

README.md

README.md

eval.py

eval.py

main.py

main.py

setup.py

setup.py

translate.py

translate.py

Repository files navigation

Seq2Seq in PyTorch

Models

Datasets

Training methods

Usage

About

Releases

Packages

Languages

License

yangkexin/seq2seq.pytorch

Folders and files

Latest commit

History

Repository files navigation

Seq2Seq in PyTorch

Models

Datasets

Training methods

Usage

About

Resources

License

Stars

Watchers

Forks

Languages