database-construction

Tools for the construction of an articulatory speech database.

Starting from a plain text corpus, such as a cleaned wikipedia dump, these tools are designed to help the user create a phonetically balanced list of sentences to use as prompts for data collection. The tools were developed for Italian, using the PAISA plain text corpus as the starting point http://www.corpusitaliano.it/en/index.html

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
README.md		README.md
all_trans.tar.gz		all_trans.tar.gz
text2phones		text2phones
text2phonesEN		text2phonesEN
transcribe.py		transcribe.py
transcribe_brown_nltk.py		transcribe_brown_nltk.py
triphones.py		triphones.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

all_trans.tar.gz

all_trans.tar.gz

text2phones

text2phones

text2phonesEN

text2phonesEN

transcribe.py

transcribe.py

transcribe_brown_nltk.py

transcribe_brown_nltk.py

triphones.py

triphones.py

Repository files navigation

database-construction

Contents

About

Releases

Packages

Languages

jjberry/database-construction

Folders and files

Latest commit

History

Repository files navigation

database-construction

Contents

About

Resources

Stars

Watchers

Forks

Languages