This work is the implemention of SiD-Waveflow.
Visit our website for audio samples.
-
Clone our repo and initialize submodule
git clone https://github.com/NVIDIA/waveglow.git cd waveglow git submodule init git submodule update
-
Install requirements
pip3 install -r requirements.txt
-
Install Apex
-
Download CSMSC. In this example it's in
~/BBdata/
-
Train
mkdir checkpoints python train.py -c config.json
For mixed precision training set
"fp16_run": true
onconfig.json
. -
Make test set mel-spectrograms
python mel2samp.py -f traintestset_chn/test_files_copy.txt -o ./inferaudio/chn_mel -c config.json
-
Do inference with your network
ls inferaudio/chn_mel/*.pt > mel_files.txt python3 inference.py -f mel_files.txt -w checkpoints/test1_chn_model -o ./inferaudio --is_fp16 -s 0.6