theano-conv-semantic

!under development!

Convolutional deep network framework for semantic segmentation, implemented using Theano.

Installation

Just do

    git clone https://github.com/iborko/theano-conv-semantic

Later, you can update your repo with

    git pull

Description

This framework is heavily based on the work of Farabet-Pami. As in the aforementioned paper, training has 2 stages: in first stage only the feature selector is trained (convolutinal layers + last softmax layer), in second stage fully connected layer is added at the end of the network and only the newly added layer is trained.

Stanford background can be trained on two types of data: 1 scale YUV transformed images, 3 scales of laplacian pyramid YUV transformed images. Details are described in the aforementioned work.

Usage

Download MSRC-21, Stanford Background (iccv09) or KITTI dataset (German Ros version) and unpack them to the data/ folder.
Update dataset locations in generate-iccv.conf file and in generate-kitti.conf. Also, you can change other parameters like validation set percentage, output folder locations. generate-*.conf config files are used during the generation of theano-ready dataset.
Run python generate_msrc.py, python generate_iccv_1l.py generate-iccv.conf, python generate_iccv_3l.py generate-iccv.conf or python generate_kitti.py generate-kitti.conf.

generate_msrc.py generates single-scale data for MSRC dataset
generate_iccv_1l.py generates single-scale data for Stanford Background dataset
generate_iccv_3l.py generates 3-scale data for Stanfrod Background dataset.
generate_kitti.py generates 3-scale data from KITTI dataset.

Run python train_2step_3l.py file.conf or python train_kitti.py file.conf where file.conf is configuration file. Inside it you can set theano dataset location (produced by generate... script), network parameters, stopping parameters, etc.
To calculate results on test set you can run validate_iccv.py or validate_kitti.py script. Parameters are described at the bottom of the script. Example: python validate.py network.conf network-12-34.bin test where network-12-34.bin contains best network parameters (automatically generated during training).

Results

Results on Stanford background dataset

Method	Accuracy (%)	Class accuracy (%)
Convnet (3 scales)	75.7	59.3
Convnet (3 scales) + superpixels	76.1	59.7

Results on KITTI

Using RGB images + depth component calculated using stereo vision (spsstereo algorithm) and normalized

Method	Accuracy (%)	Class accuracy (%)
Convnet (3 scales)	73.6	42.4
Convnet (3 scales)	75.1	43.1

Visualization

During runtime, framework generates output.log file. Data from it can be visualized using plot_cost.py script. Just run

    python plot_cost.py output.log

Script plots graphs of training cost, validation cost and validation error using pyplot.

Plans

Usage of SIFT Flow dataset
Framework will be able to load network architecture from configuration file (partially done)
Input data generation will be configured through special configuration file (partially done)
Support for oversegmentation methods (currently not supported) like superpixels (done)
Implementation of the Inception layer described in GoogLeNet paper (partially done, helpers/layers/...)

Master thesis work

Ivan Borko, Faculty of electrical engineering and computing, University of Zagreb, Croatia

2014/2015

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
dataset		dataset
docs		docs
helpers		helpers
postprocessing		postprocessing
preprocessing		preprocessing
.gitignore		.gitignore
README.md		README.md
analyze_classes.py		analyze_classes.py
dataset-links.txt		dataset-links.txt
generate-iccv.conf		generate-iccv.conf
generate-kitti.conf		generate-kitti.conf
generate_iccv_1l.py		generate_iccv_1l.py
generate_iccv_3l.py		generate_iccv_3l.py
generate_kitti.py		generate_kitti.py
generate_msrc.py		generate_msrc.py
plot_cost.py		plot_cost.py
test_data_perturbations.py		test_data_perturbations.py
test_processing.py		test_processing.py
test_zca.py		test_zca.py
train-iccv.conf		train-iccv.conf
train-kitti.conf		train-kitti.conf
train.conf		train.conf
train_2step.py		train_2step.py
train_2step_3l.py		train_2step_3l.py
train_inception.py		train_inception.py
train_kitti.py		train_kitti.py
util.py		util.py
validate.py		validate.py
validate_iccv.py		validate_iccv.py
validate_kitti.py		validate_kitti.py
visualize.py		visualize.py
visualize_laplacian.py		visualize_laplacian.py

prassanna-ravishankar/theano-conv-semantic

Folders and files

Latest commit

History

Repository files navigation

theano-conv-semantic

Installation

Description

Usage

Results

Results on Stanford background dataset

Results on KITTI

Visualization

Plans

Master thesis work

About

Resources

Stars

Watchers

Forks

Languages