Say hello Odessa!

Odessa is a basic speech recognition system which can identify very specific phrases of my own speech and respond accordingly, with......spunk.

This algorithmic detection of speech and signal processing were all done from scratch in Python through the use of numpy, scipy, matplotlib, sounddevice and soundfile libraries.

All audio training samples for the phrases can be found here.
All generated hmm binaries for the audio samples are found here.
The generation of speech features from a sound signal is done via the asr_feature_builder.py.
The training of hmm binaries from audio samples is done via the em.py.
The execution of the program funneling identified live speech through recognition is done via the speech_recognizer.py.
The detection of live speech segments and sampling to disk is done via the speech_sampler.py.
The hot word tracking of odessa and subsequent phrase recognitions are done via the speech_state_machine.py.
The funneling of audio samples in for training with different HMM parameters looking for an optimal configuration and generating "Training Results.xlsx" are done through trainer.py.

The final application looks like this!

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
hmm		hmm
hmm_options		hmm_options
samples		samples
.gitignore		.gitignore
README.md		README.md
Training Results.xlsx		Training Results.xlsx
asr_feature_builder.py		asr_feature_builder.py
cleanup.py		cleanup.py
country.mp3		country.mp3
em.py		em.py
hammer.mp3		hammer.mp3
hmm.py		hmm.py
odessa_desktop.cmd		odessa_desktop.cmd
odessa_laptop.cmd		odessa_laptop.cmd
speech_recognizer.py		speech_recognizer.py
speech_sampler.py		speech_sampler.py
speech_state_machine.py		speech_state_machine.py
trainer.py		trainer.py
trainer2.py		trainer2.py

samuraijourney/odessa

Folders and files

Latest commit

History

Repository files navigation

Say hello Odessa!

About

Resources

Stars

Watchers

Forks

Languages