shinytang6 / Voice-Activity-Detection Public

Notifications You must be signed in to change notification settings
Fork 0
Star 1

Assignments of intelligent speech interaction course

1 star 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
pro1		pro1
pro2		pro2
.gitignore		.gitignore
README.md		README.md

Repository files navigation

Voice-Activity-Detection

Introduction

This repo contains two labs in the course of Intelligent speech interaction.

Requirements

Windows

pip install librosa numpy scikit-learn

Contents

Lab1

Materials: wav file en_4092_a.wav, en_4092_b.wav

Aim: This project uses a simple speech endpoint detection algorithm to process wav files,aiming at deciding if a segment is silent or not.

usage:

python onset_detect.py
python transfer.py

Lab2

This lab uses a machine learning approach(GMM) to achieve the same purpose as the lab1.

If you want to use your own wav file,please install HCopy first!

You need to extract features with HCopy using the config file config.feat,which extracts MFCC features

HCopy -C config.feat -S feats.scp

Then you can verify the features exist(*.mfcc),here l have already generated the mfcc file of the given wav files.

usage:

python filter.py
python GMM.py

About

Assignments of intelligent speech interaction course

Report repository

Releases

No releases published

Packages

No packages published

Languages