Skip to content

r-DSP-Tasks/Task4-Voice-Recognition-App

Repository files navigation

voice-recognition

A basic implementation of a Audio Recognition algorithm, it utilizes the advantages of a spectrogram and perceptual hashing, implementation is as follows :

  • A database is formed of 75 songs (Audio File) separated to their Vocal and Musical features.
  • Extraction of Spectrogram and spectral Features (Mel Spectrogram, Mel frequency Coefficient and Chroma STFT) is executed.
  • Hashing the extracted data with a Perceptual Hashing Algorithm.
  • A test Song (Audio File) is given to the application with extraction of its Hash the matches are found.
  • Matching percentages are calculated according to a mapping algorithm and then sorted to the user
  • A testing mechanism is implemented by mixing two Audio files then this mix is given to the application to find it's matches in the database

Basic Usage

This is a simple usage application only for testing the Recognition algorithm :

  • Load desired Audio File with max 2 files

    1

  • Select the desired slider value from 0 to 100%

  • Click on the Search button

2

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages