WGANSing: A Multi-Voice Singing Voice Synthesizer Based on the Wasserstein-GAN

Pritish Chandna, Merlijn Blaauw, Jordi Bonada, Emilia Gómez

Music Technology Group, Universitat Pompeu Fabra, Barcelona

This repository contains the source code for multi-voice singing voice synthesis

Installation

To install, clone the repository and use

pip install -r requirements.txt

to install the packages required.

The main code is in the main.py file.

Training and inference

To use the WGANSing, you will have to download the model weights and place it in the log_dir directory, defined in config.py.

Once setup, you can run the following commands. To train the model:

python main.py -t

.

To synthesize a .lab file: Use

python main.py -s filename alternate_singer_name

If no alternate singer is given then the original singer will be used for synthesis. A list of valid singer names will be displayed if an invalid singer is entered.

You will also be prompted on wether plots showed be displayed or not, press y or Y to view plots.

Evaluation

We will further update the repository in the coming months.

Acknowledgments

The TITANX used for this research was donated by the NVIDIA Corporation. This work is partially supported by the Towards Richer Online Music Public-domain Archives (TROMPA) (H2020 770376) European project.

[1] Duan, Zhiyan, et al. "The NUS sung and spoken lyrics corpus: A quantitative comparison of singing and speech." 2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference. IEEE, 2013.

[2] Blaauw, Merlijn, and Jordi Bonada. "A Neural Parametric Singing Synthesizer Modeling Timbre and Expression from Natural Songs." Applied Sciences 7.12 (2017): 1313.

[3] Blaauw, Merlijn, et al. “Data efficient voice cloning forneural singing synthesis,” in2019 IEEE International Conference onAcoustics, Speech and Signal Processing (ICASSP), 2019.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
stats		stats
README.md		README.md
config.py		config.py
data_pipeline.py		data_pipeline.py
main.py		main.py
modules_tf.py		modules_tf.py
prep_data_nus.py		prep_data_nus.py
reduce.py		reduce.py
requirements.txt		requirements.txt
utils.py		utils.py
vocoder.py		vocoder.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

stats

stats

README.md

README.md

config.py

config.py

data_pipeline.py

data_pipeline.py

main.py

main.py

modules_tf.py

modules_tf.py

prep_data_nus.py

prep_data_nus.py

reduce.py

reduce.py

requirements.txt

requirements.txt

utils.py

utils.py

vocoder.py

vocoder.py

Repository files navigation

WGANSing: A Multi-Voice Singing Voice Synthesizer Based on the Wasserstein-GAN

Pritish Chandna, Merlijn Blaauw, Jordi Bonada, Emilia Gómez

Music Technology Group, Universitat Pompeu Fabra, Barcelona

Installation

Training and inference

Evaluation

Acknowledgments

About

Releases

Packages

Languages

entn-at/Multi_Voice_Sing_Speak_Sing

Folders and files

Latest commit

History

Repository files navigation

WGANSing: A Multi-Voice Singing Voice Synthesizer Based on the Wasserstein-GAN

Pritish Chandna, Merlijn Blaauw, Jordi Bonada, Emilia Gómez

Music Technology Group, Universitat Pompeu Fabra, Barcelona

Installation

Training and inference

Evaluation

Acknowledgments

About

Resources

Stars

Watchers

Forks

Languages