byteNet-tensorflow

This is a tensorflow implementation of the byte-net model from DeepMind's paper Neural Machine Translation in Linear Time.

From the abstract

The ByteNet decoder attains state-of-the-art performance on character-level language modeling and outperforms the previous best results obtained with recurrent neural networks. The ByteNet also achieves a performance on raw character-level machine translation that approaches that of the best neural translation models that run in quadratic time. The implicit structure learnt by the ByteNet mirrors the expected alignments between the sequences.

ByteNet Encoder-Decoder Model:

Image Source - Neural Machine Translation in Linear Time paper

The model applies dilated 1d convolutions on the sequential data, layer by layer to obain the source encoding. The decoder then applies masked 1d convolutions on the target sequence (conditioned by the encoder output) to obtain the next character in the target sequence.The character generation model is just the byteNet decoder, while the machine translation model is the combined encoder and decoder.

Implementation Notes

The model has been defined in ByteNet/model.py. ByteNet/ops.py contains the dilated convolution implementation (adapted from tensorflow wavenet ).
The model can be configured by editing model_config.py.
Sub-batch normalisation has not been implemented.
Bags of n-grams have not been used.
Number of residual channels 512 (Configurable in model_config.py).

Requirements

Python 2.7.6
Tensorflow >= rc0.10

Datasets

The character generation model has been trained on Shakespeare text. I have included the text file in the repository Data/shakespeare.txt.
The machine translation model has been trained for german to english translation. You may download the news commentary dataset from here http://www.statmt.org/wmt16/translation-task.html

Training

Text Generation - Configure the model by editing model_config.py. Train on a text corpus by

python train_generator.py --data_dir=PATH_TO_FOLDER_CONTAINING_TXT_FILES

python train.py --help for more options. 2. Machine Translation - Configure the model by editing model_config.py. Train translation model from source to target by:

python train_translator.py --source_file=SOURCE_FILE_PATH --target_file=TARGET_FILE_PATH

python train.py --help for more options.

Results

Text Generation	Machine Translation
Generate new samples using : `python generate.py --seed="SOME_TEXT_TO_START_WITH"`	translate.py still in development. You can test sample translations from the dataset using `python translate.py`

Sample Generations

ANTONIO:
What say you to this part of this to thee?

KING PHILIP:
What say these faith, madam?

First Citizen:
The king of England, the will of the state,
That thou dost speak to me, and the thing that shall
In this the son of this devil to the storm,
That thou dost speak to thee to the world,
That thou dost see the bear that was the foot,

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
ByteNet		ByteNet
Data		Data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data_loader.py		data_loader.py
generate.py		generate.py
model_config.py		model_config.py
train_generator.py		train_generator.py
train_translator.py		train_translator.py
translate.py		translate.py
utils.py		utils.py

License

zabin10/byteNet-tensorflow

Folders and files

Latest commit

History

Repository files navigation

byteNet-tensorflow

ByteNet Encoder-Decoder Model:

Implementation Notes

Requirements

Datasets

Training

Results

Sample Generations

Translation Results to be updated

TODO

References

About

Resources

License

Stars

Watchers

Forks

Languages