Skip to content

amorpfr/ChumBucket

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

44 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ChumBucket

Project Applied Machine Learning 2017 (Classifying Plankton species).

Plankton images were classified by extracting global(haralick, zernike, binary pattern, image_size and ratio) and local features(SURF). The code structured followed a pipelined proces of preprocessing, feature extraction, feature selection and model evaluation.

Note that the data of the train/test images are not included in this repo due to storage limitations.

Files

The code for classyfing plankton species consists the following files:

pre.py

Script that will preprocess the images suitbale for feature extracting.

surf.py

Script that will extract SURF features for each image.

features.py

Script that will extract global features for each image.

training.py

Script that will train a model based on the train images and features.

test.py

Script that evaluates the model on the test data and is able to create a submission for Kaggle.

visualization.py

Script that visualizes some evaluation metrics of the models.

How to Run

The files are structured as a pipeline.

For the training set

  1. Run pre.py (set test=False) -> input: image paths, output: preprocess.pkl
  2. Run surf.py -> input: preprocess.pkl, output: surf.pkl
  3. Run features.py -> input: surf.pkl, output: features.pkl
  4. Run training.py -> input: features.pkl, output: model.pkl

For the test set

  1. Run pre.py (set test=True) -> input: image paths, output: preprocess_test.pkl
  2. Run surf.py -> input: preprocess_test.pkl, output: surf_test.pkl
  3. Run features.py -> input: surf_test.pkl, output: features_test.pkl

Create submission for Kaggle

  1. Run test.py -> input: features.pkl, features_test.pkl, output: submission.csv
For questions or more information please contact: aaamorus@hotmail.com or danielkooij@live.nl

About

Applied machine learning chumbucket

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages