world_merlin

To build a new voice with the merlin toolkit and using the clustegen's question set: ###Simple Steps: ####0. Copy this folder into the FESTVOXDIR/src/world_merlin

####1. Setup environment variables:

export ESTDIR=/path/to/speech_tools
export FESTVOXDIR=/path/to/festvox
export SPTKDIR=/path/to/SPTK
THEANO_FLAGS="floatX=float32"
export THEANO_FLAGS
PYTHONPATH=:/usr/lib/python2.7/dist-packages
export PYTHONPATH

####2. Make a new voice directory and set up the initial directory structure

mkdir <institute>_<lexicon>_<voicename>
example: mkdir cmu_us_pnb
cd cmu_us_pnb
$FESTVOXDIR/src/world_merlin/setup_world_merlin cmu us pnb

For Indic languages do:

$FESTVOXDIR/src/world_merlin/setup_world_merlin_indic cmu indic <lang> pnb

where lang is any of asm ben guj hin kan mar pan raj tam tel

####3. Copy the transcript in the festival format:

cp <TRANSCRIPT_DIR>/txt.done.data etc/txt.done.data

It needs to be named txt.done.data and must be of the format: ( wavfile_name "Transcription of wavefile." ) eg:( arctic_a0001 "Author of the danger trail, Philip Steels, etc." )

####4. Copy wav files from your directory and power normalize:

./bin/get_wavs <WAVDIR>/*.wav

####5. Remove extra silences optionally. Remove trailing and leading silences:

./bin/prune_silence wav/*.wav

Remove middle silences:

./bin/prune_middle_silences wav/*.wav

####6. Run the voice building script.

./bin/build_merlin_world_voice

#####Note the last step in the above script assumes the default location of trained neural network model and its name.

Steps in build_merlin_world_voice:-Continue after step 5 above.

####6. Dump aligned WORLD feats with CLUSTERGEN's features:

./bin/dump_world_feats

####7. Make train/test/val splits.

./bin/make_file_id_list.sh `pwd`

####8. Setup the configuration file

./bin/setup_conf.sh ss_dnn/feed_forward_dnn_WORLD_template.conf

####9. Train DNN

python ss_dnn/merlin_scripts/src/run_dnn.py ss_dnn/feed_forward_dnn_WORLD.conf

####10. Resynthesize wavefiles

MODEL_NAME=`cat etc/gen_model_file_name`
./bin/merlin_resynthesis.sh ss_dnn/gen/$MODEL_NAME

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
WORLD		WORLD
merlin_gen_scripts		merlin_gen_scripts
ss_dnn		ss_dnn
.gitignore		.gitignore
README.md		README.md
binary_io.py		binary_io.py
binary_io.pyc		binary_io.pyc
binmapper.py		binmapper.py
binmapper_hindi.py		binmapper_hindi.py
binmapper_new.py		binmapper_new.py
build_cg_world_voice		build_cg_world_voice
build_merlin_world_voice		build_merlin_world_voice
clustergen_build_world.scm		clustergen_build_world.scm
clustergen_world.scm		clustergen_world.scm
dnngen.py		dnngen.py
do_clustergen		do_clustergen
dump_world_feats		dump_world_feats
extract_features_for_merlin.sh		extract_features_for_merlin.sh
get_bin_dim.py		get_bin_dim.py
get_model_name.sh		get_model_name.sh
make_bin_feats.sh		make_bin_feats.sh
make_festival_for_merlin.py		make_festival_for_merlin.py
make_file_id_list.sh		make_file_id_list.sh
make_voicing		make_voicing
make_world_bin_feats.sh		make_world_bin_feats.sh
merlin_resynthesis.sh		merlin_resynthesis.sh
setup_conf.sh		setup_conf.sh
setup_world_merlin		setup_world_merlin
setup_world_merlin_indic		setup_world_merlin_indic

myorm00000000/world_merlin

Folders and files

Latest commit

History

Repository files navigation

world_merlin

For Indic languages do:

where lang is any of asm ben guj hin kan mar pan raj tam tel

Steps in build_merlin_world_voice:-Continue after step 5 above.

About

Resources

Stars

Watchers

Forks

Languages