sustain-seq2seq

Models that need to work:

Other stuff that needs to be done:

Look at validation measures again (BLEU, METEOR, ROUGE)
Implement all attention types (low priority)
Experiment with multihead attention for RNNs
Beamsearch and/or topk/topp as in pytorch_transformers
Check attention masks are working everywhere
Optimizer: Learning rate scheduler, superconvergence, warm restart si cyclical LR. Implement scheduler. Partially done, needs more testing.

Name		Name	Last commit message	Last commit date
Latest commit History 160 Commits
data		data
models		models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback