Skip to content

koyo922/leap-scd

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Speaker Change Detection

Repository for the project done at LEAP Lab, Indian Institute of Science, under the guidance of Neeraj Sharma and Prof. Sriram Ganapathy. The aim was to develop a real time and speaker independent model to detect speaker change in a given wave file.

Dataset

TIMIT Dataset was used. Three disjoint set of speakers was created, and only SX and SI sentences for each speaker were taken into consideration while creating the files.

Directory contents

  • lists contains different lists like which files were used for training, testing and validation. It also has scripts to generate the lists.
  • models contains different classifiers, like DNN and CNN for different features.
  • scripts contains data processing codes, like for generating gammatone, fbank, and other features. It also has scripts for combining the features generated according to context.
  • resources contains plots used to visualize the features, and some generic diagrams used in the presentation.

References

  • htkmfc.py from here
  • Base Gammatone scripts from this page

Contact

I can be contacted at sidm@iitk.ac.in

About

Speaker Change Detection, Leap IISc

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 85.2%
  • MATLAB 12.4%
  • Shell 2.4%