GitHub - apardyl/ml-audio-recognition

How to begin:

python -m venv venv
source venv/bin/activate
pip install -r requirements.txt

How to train models:

Download FMA small dataset (8k tracks, 30s each) from https://os.unil.cloud.switch.ch/fma/fma_small.zip and unzip in ./data
python build_dataset.py (this will take a lot of time, but can be stopped and resumed at any moment)
python train.py (see python train.py --help for options, can be stopped and resumed)
(optional - evaluation) python test.py --small_encoder <path to small encoder state> --large_encoder <path to large encoder state>

...or download pre-trained models from here.

How to index your music database:

python indexer.py --small_encoder <path to small encoder state> --large_encoder <path to large encoder state> --database <where to save track database> --index <where to save lookup index> --data <directory containing your mp3 files> ( this will take a lot of time)

How to recognize audio files:

Use recognition.py (see python recogonition.py --help for options).

This tool works either in an interactive mode (audio sample paths, offsets and lengths provided via stdin) or can be pointed to a directory containing mp3 files (random samples will be taken). Output the name of the closest matching track in the database (and similarity score - the lower the better).

TODO:

Recognize audio samples directly from pulseaudio.
Split recognition.py into a client and a server (data loads slowly).

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
graphs		graphs
.gitignore		.gitignore
README.md		README.md
build_dataset.py		build_dataset.py
config.py		config.py
dataset.py		dataset.py
indexer.py		indexer.py
models.py		models.py
recognition.py		recognition.py
requirements.txt		requirements.txt
searcher.py		searcher.py
test.ipynb		test.ipynb
test.py		test.py
train.py		train.py
utils.py		utils.py

apardyl/ml-audio-recognition

Folders and files

Latest commit

History

Repository files navigation

How to begin:

How to train models:

...or download pre-trained models from here.

How to index your music database:

How to recognize audio files:

TODO:

About

Resources

Stars

Watchers

Forks

Languages