tacotron

Implementation of tacotron, a text to speech deep learning model.

Paper can be found here: tacotron.

Getting Started

Dataset can be retrieved from here. When extracted, file will contain LJSpeech-1.1 as a folder. Move that folder into the root directory beside where train.py is.
Run preprocess.py to preprocess the audio files. Audio files will be generated in a directory called training for the default parameters.
After running the preprocessing steps, we can start training the model.

python train.py

todo.

TODO:

Reimplementation of this for education purposes.

Got lots of reference from: https://github.com/keithito/tacotron Really grateful and appreciate the work of Keith Ito.

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
audio		audio
postprocess		postprocess
preprocess		preprocess
.DS_Store		.DS_Store
.gitignore		.gitignore
Hyperparameters.py		Hyperparameters.py
README.md		README.md
RNNHelper.py		RNNHelper.py
dataset.py		dataset.py
decoder.py		decoder.py
encoder.py		encoder.py
eval.py		eval.py
initializer_util.py		initializer_util.py
network_module.py		network_module.py
requirements.txt		requirements.txt
tacotron.py		tacotron.py
testing.txt		testing.txt
train.py		train.py