Skip to content

orchidas/Speaker-Recognition

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Speaker-Recognition

Automatic Speaker Recognition algorithms in Python

This repository contains Python programs that can be used for Automatic Speaker Recognition. ASR is done by extracting MFCCs and LPCs from each speaker and then forming a speaker-specific codebook of the same by using Vector Quantization (I like to think of it as a fancy name for NN-clustering). After that, the system is trained and tested for 8 different speakers.

To test the algorithm, run test.py from the terminal. Certain parameters are open to be changed, such as the order of LPC coefficients, the number of Mel filterbanks and the number of centroids in each codebook. Everything is included in the repository, including .wav files for testing and training, hence cloning it and running test.py should work.

A PDF has been included that explains the theory and provides links to relevant websites and projects.

About

Automatic Speaker Recognition algorithms in Python

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages