Speaker Change Detection

Repository for the project done at LEAP Lab, Indian Institute of Science, under the guidance of Neeraj Sharma and Prof. Sriram Ganapathy. The aim was to develop a real time and speaker independent model to detect speaker change in a given wave file.

Dataset

TIMIT Dataset was used. Three disjoint set of speakers was created, and only SX and SI sentences for each speaker were taken into consideration while creating the files.

Directory contents

lists contains different lists like which files were used for training, testing and validation. It also has scripts to generate the lists.
models contains different classifiers, like DNN and CNN for different features.
scripts contains data processing codes, like for generating gammatone, fbank, and other features. It also has scripts for combining the features generated according to context.
resources contains plots used to visualize the features, and some generic diagrams used in the presentation.

References

htkmfc.py from here
Base Gammatone scripts from this page

Contact

I can be contacted at sidm@iitk.ac.in

Name		Name	Last commit message	Last commit date
Latest commit History 145 Commits
lists		lists
models		models
resources		resources
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
info.txt		info.txt
leap_presentation.pdf		leap_presentation.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lists

lists

models

models

resources

resources

scripts

scripts

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

info.txt

info.txt

leap_presentation.pdf

leap_presentation.pdf

Repository files navigation

Speaker Change Detection

Dataset

Directory contents

References

Contact

About

Releases

Packages

Languages

License

koyo922/leap-scd

Folders and files

Latest commit

History

Repository files navigation

Speaker Change Detection

Dataset

Directory contents

References

Contact

About

Resources

License

Stars

Watchers

Forks

Languages