CS221: Cover Song Classification

Ian Tenney (iftenney)
Ken Kao (kennykao)
Course project for CS221: Artificial Intelligence: Principles and Techniques
December 12, 2013

Getting Started

Note: This guide assumes that you are logged in to corn, or on a similarly-configured computer running a recent version of Linux with an up-to-date Python distribution.

First, make a link to the dataset, which is hosted on AFS*:
ln -s /afs/ir.stanford.edu/users/i/f/iftenney/public/MSD-SHS MSD-SHS
*Thanks CS221 for giving us extra AFS space!

Alternatively, a very small number of sample tracks are available in the MSD-SHS_r directory; rename this to MSD-SHS to run with the restricted set (this will likely crash, as some algorithms require a certain amount of data to avoid empty-set errors).

You can run the project by typing: python main.py

This will run with default settings, which are:

50 cliques, comboPlus features
logistic regression with L1 regularization (strength=1.0)
KNN classifier with k=7, relaxed n=5

The project requires a number of libraries to run. They are:

Python (tested with 2.7.4 and 2.7.6)
NumPy, SciPy, and matplotlib (particularly, the Pylab namespace)
PyTables (for reading HDF5 data files)
scikit-learn (for core algorithms)

Additionally, to run metric learning (--LMNN):

Kilian Q Weinberger's LMNN library, included in ./lib/
MATLAB (tested with R2013b)
an installed C/C++ compiler, to run install.m in ./lib/mLMNN2.4/

Running the Project

Because our project focused on testing our classifier on a development set, and because we were not able to achieve production-level acccuracy, we did not build a direct query system. Instead, the classifier loads data from the /MSD-SHS/ directory (symlinked to a separate filesystem) and partitions into a training and a test set using a seeded random sample (below).

To run our classifier, type: python main.py [params]

Parameters are passed as UNIX-style flags, and specify all the parameters that we used during testing to fine-tune the data selection, feature extraction, pre-processing, and the logistic regression, metric learning, and KNN classification stages of our system. The parameters described below are those used in testing; some others exist in the code but were not used.

Data:

-c : number of cliques to use; larger cliques are chosen first
--test_fraction : approximate portion of the full dataset to use as the testing/development set
-f : features to use: combo or comboPlus (other features may be available, but not used in our final results)
--seed_xval : random seed for train | test partitioning
--seed_pairs : random seed for sampling negative pairs (used by pairwise binary classifier)

Preprocessing:

-p : preprocessing mode: none, scale, whiten, or pca
--pca : if using -p pca, specifies the number of components to use

Metric Learning (LMNN):

--LMNN : use the LMNN algorithm to learn a Mahalanobis matrix
--lmnnMu : inverse regularization strength for the LMNN algorithm

Note: These will crash unless the LMNN library is properly installed and compiled on the target system, which requires running ./lib/mLMNN2.4/install.m and following the rest of the setup instructions in ./lib/mLMNN2.4/Readme.txt.

Logistic Regression:

--logistic : use pairwise binary logistic regression
-r : regularization type (l1 or l2)
--rstrength : regularization strength (higher is stronger)
--noTestLogistic : disable running logistic classifier on test set, for faster runs for KNN testing

K Nearest Neighbors:

--knn : use K Nearest Neighbors (KNN) classifier, with optional scaling from learned weights, or Mahalanobis metric from LMNN
-k : k-value for KNN
--knn_relax_n : number of candidate cliques to consider for "relaxed" testing

Misc:

--plot : plot feature vectors and clique size distributions, saved as .png files in ./output/

As an example, to run with comboPlus features and whitening on 50 cliques with k=7, use: python main.py -c 50 -f comboPlus -p whiten --logistic -r l1 --rstrength 100.0 --knn -k 7 --knn_relax_n 5

Testing Methodology

To obtain our analysis, we ran our system many times with different parameters. Additionally, we changed the data partitioning seed (--seed_xval) to perform cross-validation for more robust results. This is a tedious process to run manually, and so we include a series of bash scripts to automate the process without making major changes to our python codebase:

xval.sh <n> "params" : will run our program repeatedly with a different seed, and output concatenated results to the ./results/ directory. n refers to the number of cross-validation runs to perform.

run_list.sh <file> <n> : will parse a text file containing lists of parameters as they would be passed to main.py, one list per line, and run the cross-validation script (xval.sh) on each.

parse_set.sh <field_glob> <fname_glob_VAR> <VAR1> [<VAR2> <VAR3> ...] : this will parse output files in ./results/ by matching the output lines to field_glob in files matching fname_glob. The string VAR in the file match string will be substituted by each following argument, providing array-like functionality. Output is in comma-separated format.

For example, to extract data from a variety of clique counts, and those runs that used combo features and the standard scaler:

>> ./parse_set.sh "KNN relax test" c_VAR*scale*combo_*100.0 25 35 50 75 100 200 300
# Field: <KNN relax test> :: match: *c_25*scale*combo_*100.0*,35.80,35.19,32.10,29.01,30.86
# Field: <KNN relax test> :: match: *c_35*scale*combo_*100.0*,23.76,19.80,23.27,27.72,28.22
# Field: <KNN relax test> :: match: *c_50*scale*combo_*100.0*,18.70,17.18,18.70,17.56,17.56
# Field: <KNN relax test> :: match: *c_75*scale*combo_*100.0*,15.50,13.45,10.53,14.62,14.62
# Field: <KNN relax test> :: match: *c_100*scale*combo_*100.0*,11.51,12.47,9.35,9.83,12.23
# Field: <KNN relax test> :: match: *c_200*scale*combo_*100.0*,8.03,9.55,7.88,7.88,6.21
# Field: <KNN relax test> :: match: *c_300*scale*combo_*100.0*,6.51,4.88,4.42,5.35,4.65

Codebase Directory

Our code is decomposed into several auxiliary .py files:

main.py : main entry point
stages.py : code to drive each algorithm (logistic, LMNN, KNN), including data loading and preprocessing
logistic.py : wrappers for scikit-learn logistic regression algorithm
knn.py : wrappers for scikit-learn k nearest neighbors algorithm
transform.py : data preprocessing interface
load_song_data.py : dataset loading and track data access wrappers
feature_util.py : custom feature extractors (combo, comboPlus, and others)
utils.py : utility helper functions
base_classifier.py : baseline histogram generator, used for p-proposal

Additionally, we use several control scripts:

xval.sh : cross-validation script
xval_qsub.sh : cross-validation script designed to run on the barley cluster
run_list.sh : run cross-validation on a list of parameters
filter_output.sh : helper for parsing cross-validation output
parse_set.sh : parameter array for parsing cross-validation output into a spreadsheet-friendly format

Finally, the project directories are:

output : plot output, if --plot is given as a parameter
results : text output from cross-validation scripts
batch : batch files used by run_list.sh
lib : MSD-SHS and LMNN library code
filter : code used to filter the full 300-gigabyte MSD dataset down to the MSD-SHS subset

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
batch		batch
filter		filter
lib		lib
output		output
results		results
.gitignore		.gitignore
README.html		README.html
README.md		README.md
base_classifier.py		base_classifier.py
feature_util.py		feature_util.py
filter_output.sh		filter_output.sh
knn.py		knn.py
load_song_data.py		load_song_data.py
logistic.py		logistic.py
main.py		main.py
parse_set.sh		parse_set.sh
poster-metric.list		poster-metric.list
run_list.sh		run_list.sh
stages.py		stages.py
test-nov05.sh		test-nov05.sh
transform.py		transform.py
utils.py		utils.py
xval.sh		xval.sh
xval_qsub.sh		xval_qsub.sh

kennyk616/CS221-Song-Classification

Folders and files

Latest commit

History

Repository files navigation

CS221: Cover Song Classification

Getting Started

Running the Project

Data:

Preprocessing:

Metric Learning (LMNN):

Logistic Regression:

K Nearest Neighbors:

Misc:

Testing Methodology

Codebase Directory

About

Resources

Stars

Watchers

Forks

Languages