Skip to content

maozhiqiang/tacotron2-1

 
 

Repository files navigation

Tacotron2

im

NATURAL TTS SYNTHESIS BY CONDITIONING WAVENET ON MEL SPECTROGRAM PREDICTIONS https://arxiv.org/pdf/1712.05884.pdf

WaveNet: A Generative Model for Raw Audio

Contents

  • Simple LJ Speech DataLoader
  • Mel Spectrogram Prediction network (text to Spectrogram)
  • [TODO] WaveNet Vocoder (Spectrogram to raw audio)

https://arxiv.org/abs/1609.03499

Setup

  1. install pytorch and torchvision:
conda install pytorch -c pytorch
  1. install tensorflow and tensorboardX for logging.
pip install tensorboard
pip install tensorboardX

Usage

train Spectrogram Prediction Network

python train.py

view logs in Tensorboard

tensorboard --logdir runs

im

im

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 92.0%
  • Python 8.0%