KerasBasedSpeechClassifier

Not an active project.

Using Keras to build a classifier for speech. Not necessarily a speaker identifier

Setup

Python 2.7

numpy is required

Wav files excluded from repo due to possible legal issues.

Configure boot.py to use the branch and input that you want.

Or use rnn_mfcc.py and rnn_raw.py directly.

This branch is currently not working. (Results similar to random guessing.)

Contains 2 branches, neither finished:

rnn_mfcc.py

Uses a MFCC feature set. MFCC being created via CMU Sphinx 4.

rnn_raw.py

Uses data from uncompressed mono wav files. Because that's the corpus we have.

License: MIT

Name		Name	Last commit message	Last commit date
Latest commit History 113 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
boot.py		boot.py
findBestModel.py		findBestModel.py
fuzzyHelper.py		fuzzyHelper.py
infoAdultChild.py		infoAdultChild.py
infoMaleFemale.py		infoMaleFemale.py
infoSKW.py		infoSKW.py
infoUID.py		infoUID.py
mfcPreprocessor.py		mfcPreprocessor.py
modelPlayer.py		modelPlayer.py
outputVisualizer.py		outputVisualizer.py
overlappingSamples.py		overlappingSamples.py
preprocessor.py		preprocessor.py
rnn_mfcc.py		rnn_mfcc.py
rnn_raw.py		rnn_raw.py
speakerInfo.py		speakerInfo.py
unpackMFC.py		unpackMFC.py
wavOpener.py		wavOpener.py