CMU_11785_Team_Project

Carnegie Mellon University 11785 Deep Learning team project

Steps to start training:

1. Modify train and test data path in Utils/const.py

2. Modify LABEL_NUM in Utils/const.py corresponding to real labels numbers in your data.

3. Run 'python train.py'

Utils

Helper functions for training

Data

Train and test data should be placed in this dirctory.

Model

Contains model implementation.

Reference

参考资料

Data:

数据文件夹，包含：

原始训练图片 train_imgs.tar 34184张150*150的3 channel彩图
原始测试图片 test_imgs.tar 8197张150*150的3 channel彩图（目前一共4.2万多张）
文件列表 file_list 其中包含顺序/乱序的训练/测试数据列表，及对应标签。标签一共10类。当初曾经将数据转换成lmdb格式，不过转换后文件较大（如训练数据为2.4GB），所以暂时不上传，后面如果需要可以直传。

Model：

1) train_val_lvshq_128.prototxt 训练/测试模型文件，包含每层的结构。这里倒数第二层是128个neuron，相当于用128位（这里用的是二进制数）的数字来表示这张图像的特征。最后一层10个neuron对应10类标签。
2) deploy_lvshq_128.prototxt 预测使用的模型文件，需要与训练相对应，只是去掉了最后一层。
3) solver.prototxt 训练参数文件
4) train.sh 训练脚本文件

Src：数据爬取和预处理的一些python和shell脚本。

1) spider.py 数据爬取脚本，从jd.com和taobao.com爬取数据
2) create_filelist*.sh 根据图片文件生成对应的图片列表文件，用于训练与测试。
3) create_lmdb.sh 根据数据及标签生成对应的lmdb文件，用于训练与测试。
4) create_meanfile.sh 根据Caffe的工具对数据做均值化（应该类似于feature zero mean）预处理
5) convert_taobao_size.sh 把爬去的数据resize成想要的大小

注：模型部分主要参考论文为：

Deep Learning of Binary Hash Codes for Fast Image Retrieval Kevin Lin, Huei-Fang Yang, and Chu-Song Chen at Academia Sinica, Taipei, Taiwan.

对应Github地址：https://github.com/kevinlin311tw/caffe-cvprw15

Supervised Learning of Semantics-Preserving Deep Hashing (SSDH) Created by Kevin Lin, Huei-Fang Yang, and Chu-Song Chen at Academia Sinica, Taipei, Taiwan.

对应Github地址：https://github.com/kevinlin311tw/Caffe-DeepBinaryCode

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
Data/file_list		Data/file_list
Model		Model
Reference		Reference
Utils		Utils
.gitignore		.gitignore
README.md		README.md
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data/file_list

Data/file_list

Model

Model

Reference

Reference

Utils

Utils

.gitignore

.gitignore

README.md

README.md

train.py

train.py

Repository files navigation

CMU_11785_Team_Project

Steps to start training:

Utils

Data

Model

Reference

Data:

Model：

Src：数据爬取和预处理的一些python和shell脚本。

About

Releases

Packages

Contributors 3

Languages

lvshq/CMU_11785_Team_Project

Folders and files

Latest commit

History

Repository files navigation

CMU_11785_Team_Project

Steps to start training:

Utils

Data

Model

Reference

Data:

Model：

Src：数据爬取和预处理的一些python和shell脚本。

About

Resources

Stars

Watchers

Forks

Languages