ZHAW deep voice

Customized by Daniel Neururer

The ZHAW deep voice is a package of multiple neural networks that try resolving the speaker clustering task. The goal is to provide a uniform way of data-access, -preprocession and analysis fo the results.

Note that the suite needs the TIMIT Dataset to function at this point. This is a paid product from the LDC and can be obtained here. This data also needs to be processed using the sph2pipe tool and be put in the folder common/data/training/TIMIT

Using deep voice

If you simply want to use it, you can let docker do the work for you and let it import all needed packages.

In any way, whether you fork and pull the source code or let docker handle it for you, the whole suite is controllable over a one file interface, controller.py.

To configure the suite, there is a config file located at common/config.cfg. All neural networks can be configured in this file. There are some general flag that can be set:

setup Create all
network specifiy which network should be used. Available: 'pairwise_lstm', 'pairwise_kldiv', 'flow_me', 'luvo' and 'all' (without the single quotes)
train Specify to train the chosen network
test Specify to test the chosen network
plot Specify to plot the results of the chosen network. If network is 'all', all results will be displayed in one single plot
clear Clear the folder in experiments
debug Set the logging level of Tensorflow to Debug
best Just the best results of the networks will be used in -plot
val_number specify which speaker number you want to use (40, 60, 80) to test the networks

There are further parameters to configure for every neural network in this suite.

As an example, you want to train, and test but not plot the network pairwise_lstm. you would call:

controller.py While using the following configuration in the config-file:

[common] setup = False network = pairwise_lst train = True test = True clear = False debug = False plot = False best = False

...

General remarks

Before you start with your training you should run the controller once with the setup flag. This can take a while, approximately around 10 minutes.

Name		Name	Last commit message	Last commit date
Latest commit History 193 Commits
common		common
networks		networks
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
Dockerfile-cpu		Dockerfile-cpu
Dockerfile-cpu_ivector		Dockerfile-cpu_ivector
Dockerfile_ivector		Dockerfile_ivector
README.md		README.md
build_docker.bat		build_docker.bat
build_docker_cpu.bat		build_docker_cpu.bat
clear_all_containers.bat		clear_all_containers.bat
container_shell.sh		container_shell.sh
controller.py		controller.py
create_container.sh		create_container.sh
keras_pairwise_kldiv.py		keras_pairwise_kldiv.py
run_and_enter_container_shell_docker.bat		run_and_enter_container_shell_docker.bat
setup_suite.sh		setup_suite.sh
stop_and_remove_docker.bat		stop_and_remove_docker.bat

neurudan/ZHAW_deep_voice

Folders and files

Latest commit

History

Repository files navigation

ZHAW deep voice

Using deep voice

General remarks

About

Resources

Stars

Watchers

Forks

Languages