GitHub

koreanVoiceSample

This project is based on https://github.com/GSByeon/multi-speaker-tacotron-tensorflow

To make Korean Voice Sample data like ljspeech (https://keithito.com/LJ-Speech-Dataset/)

Generate custom datasets

The datasets directory should look like:

datasets
├── default
│   ├── metadata.csv
│   └── wavs
│       ├── 1.wav
│       ├── 2.wav
│       ├── 3.wav
│       └── ...

Metadata.csv contains: wav-filesname|text|text-normalization

Install

python pip install -r requirements.txt

python -c "import nltk; nltk.download('punkt')"

if you use windows, ffmpeg is needed

http://adaptivesamples.com/how-to-install-ffmpeg-on-windows/

if you have problem while install hangulize, check the link below

https://github.com/sublee/hangulize

Usage

Each script execute below commands. (explain with son dataset)

To automate an alignment between sounds and texts, prepare GOOGLE_APPLICATION_CREDENTIALS to use Google Speech Recognition API. To get credentials, read this.

export GOOGLE_APPLICATION_CREDENTIALS="YOUR-GOOGLE.CREDENTIALS.json"

Download speech(or video) and text.

python -m dataproc.download

Segment all audios on silence.

python -m audio.silence --audio_pattern "./datasets/default/wavs/*.wav" --method=pydub

By using Google Speech Recognition API, we predict sentences for all segmented audios.

python -m recognition.google --audio_pattern "./datasets/default/wavs/..wav"

Normailize korean text (ex, number )

python -m recognition.normalize --recognition_path "./datasets/default/recognition.json"

End

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
audio		audio
dataproc		dataproc
recognition		recognition
text		text
utils		utils
.gitignore		.gitignore
JPype1-0.6.3-cp36-cp36m-win_amd64.whl		JPype1-0.6.3-cp36-cp36m-win_amd64.whl
README.md		README.md
hparams.py		hparams.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

audio

audio

dataproc

dataproc

recognition

recognition

text

text

utils

utils

.gitignore

.gitignore

JPype1-0.6.3-cp36-cp36m-win_amd64.whl

JPype1-0.6.3-cp36-cp36m-win_amd64.whl

README.md

README.md

hparams.py

hparams.py

requirements.txt

requirements.txt

Repository files navigation

koreanVoiceSample

Generate custom datasets

Install

if you use windows, ffmpeg is needed

if you have problem while install hangulize, check the link below

Usage

About

Releases

Packages

Languages

bookendus/koreanVoiceSample

Folders and files

Latest commit

History

Repository files navigation

koreanVoiceSample

Generate custom datasets

Install

if you use windows, ffmpeg is needed

if you have problem while install hangulize, check the link below

Usage

About

Resources

Stars

Watchers

Forks

Languages