Skip to content

HMM-based ASR systems trained on CommonVoice(zh-TW) using Kaldi.

Notifications You must be signed in to change notification settings

jerrykuo7727/ASR-common-voice-zh-tw

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

51 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ASR-common-voice-zh-tw

Automatic Speech Recognition (ASR) system trained on CommonVoice (zh-TW) dataset with Kaldi toolkit.

Simply run shrun.sh to train and test the three models below:

  1. Monophone
  2. Triphone (1st pass): Delta + Delta-Delta
  3. Triphone (2nd pass): LDA + MLLT

If you're familiar with the recipe of TIMIT, you just need to read these codes :)

scripts/prepare_data.py : Preprocess CommonVoice (zh-TW) for usage of Kaldi.
scripts/prepare_data.ipynb : Full details and explanations of the prepare_data.py.

See requirements.txt to check if any required package is not installed.

About

HMM-based ASR systems trained on CommonVoice(zh-TW) using Kaldi.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published