Exploring Machine Speech Chain for Domain Adaptation

This is an implementation of the paper, based on the ESPnet. If you have any questions, please email to me(11930381@mail.sustech.edu.cn).

Requirements

Follow the installation method of espnet.
You should use torch==1.7.1.

Pretraining

You should download LibriSpeech and LibriTTS manually.
LibriSpeech: run ./pretrain_asr.sh under egs/librispeech/asr (The recipe train ASR model on LibriSpeech train-clean-460)
LibriTTS: run ./pretrain_tts.sh under egs/libritts/tts (The recipe train TTS model on LibriTTS train-clean-460)

Adaptation training

You should download TED-LIUM-1 manually. We give the punctuated TED_LIUM text under egs/tedlium/data path.
Execution directory(egs/tedlium/asrtts):
Run ./prepare_data.sh for preparing json file for training, and then run ./joint_training.sh for joint training.

Experimental options in joint_training.sh for the three-stage training

Stage 1:

update_asr=true
update_tts=false
update_tts2asr=true
filter_data=true
filter_thre=0.58
unpaired_aug=true

Stage 2:

asrexpdir= # change the path of asr baseline to the asr adaptation
update_asr=false
update_tts=true
update_tts2asr=true
filter_data=false
unpaired_aug=flase
tts_loss_weight=0.005

Stage 3:

ttsexpdir= # change the path of tts baseline to the tts adaptation
update_asr=false
update_tts=true
update_tts2asr=true
filter_data=true
filter_thre=0.58
unpaired_aug=true

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
egs		egs
espnet		espnet
tools		tools
utils		utils
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

egs

egs

espnet

espnet

tools

tools

utils

utils

README.md

README.md

Repository files navigation

Exploring Machine Speech Chain for Domain Adaptation

Requirements

Pretraining

Adaptation training

Experimental options in joint_training.sh for the three-stage training

Stage 1:

Stage 2:

Stage 3:

About

Releases

Packages

Languages

fengpeng-yue/ASRTTS

Folders and files

Latest commit

History

Repository files navigation

Exploring Machine Speech Chain for Domain Adaptation

Requirements

Pretraining

Adaptation training

Experimental options in joint_training.sh for the three-stage training

Stage 1:

Stage 2:

Stage 3:

About

Resources

Stars

Watchers

Forks

Languages