General Purpose Audio Tagging

Neural network project addresses the problem of general-purpose automatic audio tagging. (http://dcase.community/challenge2018/task-general-purpose-audio-tagging). This project uses a convolutional neural network (VGG) to classify 41 classes of audio. Dowload the dataset here. To initialize the workspace use the init_work_space.sh bash script. If you want to train a model:

python3 train_spec.py --model_name "name of the new model"

If you want to try a pretrained model, first lunch the init_work_space.sh script with the -m option and then:

python3 test.py --model_name test9

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.gitignore		.gitignore
CNNSpecNetwork.py		CNNSpecNetwork.py
DataLoader.py		DataLoader.py
DataManager.py		DataManager.py
LICENSE		LICENSE
Loader.py		Loader.py
Preprocessor.py		Preprocessor.py
README.md		README.md
compute_spectrograms.py		compute_spectrograms.py
init_work_space.sh		init_work_space.sh
requirements.txt		requirements.txt
test.py		test.py
testing_gen_stat.py		testing_gen_stat.py
train_spec.py		train_spec.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

CNNSpecNetwork.py

CNNSpecNetwork.py

DataLoader.py

DataLoader.py

DataManager.py

DataManager.py

LICENSE

LICENSE

Loader.py

Loader.py

Preprocessor.py

Preprocessor.py

README.md

README.md

compute_spectrograms.py

compute_spectrograms.py

init_work_space.sh

init_work_space.sh

requirements.txt

requirements.txt

test.py

test.py

testing_gen_stat.py

testing_gen_stat.py

train_spec.py

train_spec.py

Repository files navigation

General Purpose Audio Tagging

About

Releases

Packages

Contributors 2

Languages

License

LeoBrizi/General-Purpose-Audio-Tagging

Folders and files

Latest commit

History

Repository files navigation

General Purpose Audio Tagging

About

Resources

License

Stars

Watchers

Forks

Languages