Purpose

This is the code the challenge"Chalearn Looking at People 2014“.

Gist: Delief Networks (Gaussian Bernoulli RBM as first layer) + Hidden Markov Networks

by Di WU: stevenwudi@gmail.com, 2015/05/27

Citation

If you use this toolbox as part of a research project, please cite the corresponding paper

@inproceedings{wu2014leveraging,
  title={Leveraging Hierarchical Parametric Networks for Skeletal Joints Based Action Segmentation and Recognition},
  author={Wu, Di and Shao, Ling},
  booktitle={Proc. Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2014}
}

Dependency: Theano

Some dependent libraries requirements: Theano: for deep learning tasks http://deeplearning.net/software/theano/. Note that Wudi change some of the functionalities(Deep Belief Networks, Gaussian Bernoulli Restricted Boltzmann Machines). They are in the subfolder of -->TheanoDL

Test

To reproduce the experimental result for test submission, there is a Python file:

Step3_SK_Test_prediction.py and there are three paths needs to be changed accordingly:

line: 60, Data folder (Test data) data_path=os.path.join("I:\Kaggle_multimodal\Test\Test\")

line: 62, Predictions folder (output) outPred=r'.\training\test'

line: 64, Submision folder (output) outSubmision=r'.\training\test_submission'

It takes about ~20 second for each example file using only skeleton information. (I use Theano GPU model, but I reckon CPU model should almost of the same speed)

Train

To train the network, you first need to extract the skeleton information

1)Step1_SK_Neutral_Realtime.py--> extract neutral frames (aka., 5 frames before and after the gesture)

2)Step1_SK_Realtime.py--> extract gesture frames

3)Step1_DBN_Strucutre2.py-->Start training the networks (Step1_DBN.py specifies a smaller networks, train faster, but the larger the net is always better)

Voila, here you go.

Dataset

According to some reader recommendation, I supplement the links of the datasets used in the paper as follows:

ChaLearn Italian Gesture Recognition --> http://gesture.chalearn.org/2013-multi-modal-challenge

You should download from this dataset from Kaggle platform. https://www.kaggle.com/c/multi-modal-gesture-recognition/data

MSR Action3D --> http://research.microsoft.com/en-us/um/people/zliu/actionrecorsrc
MSRC12 --> http://research.microsoft.com/en-us/um/cambridge/projects/msrc12

(If you use the datasets, please cite the corresponding original paper. Thanks)

Contact

If you read the code and find it really hard to understand, please send feedback to: stevenwudi@gmail.com Thank you!

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
TheanoDL		TheanoDL
.gitattributes		.gitattributes
.gitignore		.gitignore
ChalearnLAPEvaluation.py		ChalearnLAPEvaluation.py
ChalearnLAPSample.py		ChalearnLAPSample.py
ChalearnLAPTest.py		ChalearnLAPTest.py
CoDaLab_Gesure_track3.pyproj		CoDaLab_Gesure_track3.pyproj
CoDaLab_Gesure_track3.sln		CoDaLab_Gesure_track3.sln
README.md		README.md
README.txt		README.txt
SK_normalization.pkl		SK_normalization.pkl
Step1_DBN.py		Step1_DBN.py
Step1_DBN_Structure2.py		Step1_DBN_Structure2.py
Step1_SK_Neutral_Realtime.py		Step1_SK_Neutral_Realtime.py
Step1_SK_realtime.py		Step1_SK_realtime.py
Step1_transition_matrix.py		Step1_transition_matrix.py
Step2_SK_Prediction.py		Step2_SK_Prediction.py
Step3_SK_Test_prediction.py		Step3_SK_Test_prediction.py
Step3_measure_performance.py		Step3_measure_performance.py
Transition_matrix.mat		Transition_matrix.mat
cvpr_2014_diwu.pdf		cvpr_2014_diwu.pdf
dbn_2014-05-23-20-07-28.npy		dbn_2014-05-23-20-07-28.npy
distance_median.npy		distance_median.npy
template.png		template.png
utils.py		utils.py

yuan73/CVPR_2014_code

Folders and files

Latest commit

History

Repository files navigation

Purpose

Citation

Dependency: Theano

Test

Train

Dataset

Contact

About

Resources

Stars

Watchers

Forks

Languages