Cross Modal Distillation for Supervision Transfer

Saurabh Gupta, Judy Hoffman, Jitendra Malik

This codebase allows use of RGB-D object detection models from this arXiv tech report.

License

This code base is built on Fast R-CNN. License for Fast R-CNN can be found in LICENSE_fast_rcnn.

Citing

If you find this code base and models useful in your research, please consider citing an appropriate sub-set of the following papers:

@article{gupta2015cross,
  title={Cross Modal Distillation for Supervision Transfer},
  author={Gupta, Saurabh and Hoffman, Judy and Malik, Jitendra},
  journal={arXiv preprint arXiv:1507.00448},
  year={2015}
}

@incollection{gupta2014learning,
  title={Learning rich features from RGB-D images for object detection and segmentation},
  author={Gupta, Saurabh and Girshick, Ross and Arbel{\'a}ez, Pablo and Malik, Jitendra},
  booktitle={Computer Vision--ECCV 2014},
  pages={345--360},
  year={2014},
  publisher={Springer}
}

@article{girshick15fastrcnn,
    Author = {Ross Girshick},
    Title = {Fast R-CNN},
    Journal = {arXiv preprint arXiv:1504.08083},
    Year = {2015}
}

Requirements: software

Requirements for Caffe and pycaffe (see: Caffe installation instructions)

Note: Caffe must be built with support for Python layers!

# In your Makefile.config, make sure to have this line uncommented
WITH_PYTHON_LAYER := 1

Python packages you might not have: cython, python-opencv, easydict

Requirements: hardware

For training smaller networks (CaffeNet, VGG_CNN_M_1024) a good GPU (e.g., Titan, K20, K40, ...) with at least 3G of memory suffices
For training with VGG16, you'll need a K40 (~11G of memory)

Installation (sufficient for the demo)

Clone the repository

# Clone the python code
git clone git@github.com:s-gupta/fast-rcnn.git

We'll call the directory that you cloned Fast R-CNN into FRCN_ROOT. Clone Caffe with roi_pooling_layers:

cd $FRCNN_ROOT
git clone https://github.com/rbgirshick/caffe-fast-rcnn.git caffe-fast-rcnn
cd caffe-fast-rcnn
# caffe-fast-rcnn needs to be on the fast-rcnn branch (or equivalent detached state).
git checkout fast-rcnn

Build the Cython modules
```
cd $FRCN_ROOT/lib
make
```

Build Caffe and pycaffe

cd $FRCN_ROOT/caffe-fast-rcnn
# Now follow the Caffe installation instructions here:
#   http://caffe.berkeleyvision.org/installation.html

# If you're experienced with Caffe and have all of the requirements installed
# and your Makefile.config in place, then simply do.
# Make sure caffe is built with PYTHON layers.
make -j8 && make pycaffe

Download models and data

Download the NYUD2 data

cd $FRCN_ROOT
./data/scripts/fetch_nyud2_data.sh

Download the NYUD2 MCG boxes

cd $FRCN_ROOT
./data/scripts/fetch_nyud2_mcg_boxes.sh

Download the ImageNet and Supervision Transfer Models

cd $FRCN_ROOT
./data/scripts/fetch_init_models.sh

Fetch NYUD2 Object Detector Models.

cd $FRCN_ROOT
./outputs/scripts/fetch_nyud2_detectors.sh

Usage

Look at experiments/test_pretrained_models.sh and experiments/train_models.sh to use pretrained models and train your models yourself.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
data		data
experiments		experiments
lib		lib
matlab		matlab
output		output
python_utils		python_utils
scripts		scripts
tools		tools
LICENSE		LICENSE
LICENSE_fast_rcnn		LICENSE_fast_rcnn
README.md		README.md
_init_paths.py		_init_paths.py

License

rzel/fast-rcnn-normal

Folders and files

Latest commit

History

Repository files navigation

Cross Modal Distillation for Supervision Transfer

License

Citing

Contents

Requirements: software

Requirements: hardware

Installation (sufficient for the demo)

Download models and data

Usage

fast-rcnn-distillation

fast-rcnn-backup

About

Resources

License

Stars

Watchers

Forks

Languages