Speaker-Recognition

Automatic Speaker Recognition algorithms in Python

This repository contains Python programs that can be used for Automatic Speaker Recognition. ASR is done by extracting MFCCs and LPCs from each speaker and then forming a speaker-specific codebook of the same by using Vector Quantization (I like to think of it as a fancy name for NN-clustering). After that, the system is trained and tested for 8 different speakers.

To test the algorithm, run test.py from the terminal. Certain parameters are open to be changed, such as the order of LPC coefficients, the number of Mel filterbanks and the number of centroids in each codebook. Everything is included in the repository, including .wav files for testing and training, hence cloning it and running test.py should work.

A PDF has been included that explains the theory and provides links to relevant websites and projects.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
src		src
test		test
train		train
.gitignore		.gitignore
ASR_report.pdf		ASR_report.pdf
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

src

src

test

test

train

train

.gitignore

.gitignore

ASR_report.pdf

ASR_report.pdf

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

Repository files navigation

Speaker-Recognition

About

Releases

Packages

Contributors 2

Languages

License

orchidas/Speaker-Recognition

Folders and files

Latest commit

History

Repository files navigation

Speaker-Recognition

About

Resources

License

Stars

Watchers

Forks

Languages