Deep Metric Learning

Learn a deep metric which can be used image retrieval , clustering.

============================

Pytorch Code for deep metric methods:

Contrasstive Loss
Lifted Structure Loss

wait to be done in future
Batch-All-Loss and Batch-Hard-Loss

2 Loss Functions in In Defense of Triplet Loss in ReID
HistogramLoss

Learning Deep Embeddings with Histogram Loss
BinDevianceLoss

Baseline method in BIER(Deep Metric Learning with BIER: Boosting Independent Embeddings Robustly)
DistWeightDevianceLoss

My own implement of the sampling way in sampling matters in deep embedding learning combined with BinDevianceLoss

I think my implement is more reasonable and more flexible than the original sampling way in the paper.
NCA Loss

Learning a Nonlinear Embedding by Preserving Class Neighbourhood Structure -Ruslan Salakhutdinov and Geoffrey Hinton

Though the method was proposed in 2004, It has best performance.

R@1 is higher 0.61 on CUB without test augment with Dim 512 finetuned on pretrained inception-v2
PS: And I have a lot of "wrong" ideas during research the DML problems, I keep them here without description. You can see the code by yourself, the code is clear and easy for understanding. If you have any question about losses that not been mentioned above, Feel free to ask me.

Dataset

Car-196

first 98 classes as train set and last 98 classes as test set
CUB-200-2011

first 100 classes as train set and last 100 classes as test set
Stanford-Online

for the experiments, we split 59,551 images of 11,318 classes for training and 60,502 images of 11,316 classes for testing

After downloading all the three data file, you should precess them as above, and put the directionary named DataSet in the project. We provide a script to precess CUB( Deep_Metric/DataSet/split_dataset.py ) Car and Stanford online products.

Pretrained models in Pytorch

Pre-trained Inceptionn-BN(inception-v2) used in most deep metric learning papers

Download site: http://data.lip6.fr/cadene/pretrainedmodels/bn_inception-239d2248.pth

wget http://data.lip6.fr/cadene/pretrainedmodels/bn_inception-239d2248.pth

mkdir pretrained_models

cp   bn_inception-239d2248.pth    pretrained_models/

(to save your time, we already download them down and put on my Baidu YunPan.We also put inception v3 in the Baidu YunPan, the performance of inception v-3 is a little worse(about 1.5% on recall@1 ) than inception BN on CUB/Car datasets.)

Prerequisites

Computer with Linux or OSX
For training, an NVIDIA GPU is strongly recommended for speed. CPU is supported but training may be slow.

Attention!!

The pre-trained model inception-v2 is transferred from Caffe, it can only work normally on specific version of Pytorch or Python. I do not figure out why, and do not which version is best(the code can be run without bug, but the performance is bad), but if you want to get similar performance as me Please create an env as follows:

Python : 3.5.2 (2.7 may be ok, too)
PyTorch : (0.2.03) (I have tried 0.3.0 and 0.1.0, performance is lower than 0.2.03 by 10% on rank@1)

Another Attention!!

If you are not required to used inception-BN as your pretrained model, you better use my New repository is at https://github.com/bnulihaixia/VGG_dml.

Performance is similar as inception with 2/3 batchsize, and much faster training speed.

which can work normally on pytorch 0.4.0(the most recent stable version)

And I will do experiments on the new repository in the future. This reposity wil not be updated any more.

Performance of Loss:

To be clear and simple, I only provide Rank@1 on CUB-200 DataSet without test augment. Because, in most case, more higher the Rank@1 is, more higher the Rank@K. And better performance on CUB also means better performance on Car-196 , Product-online and other data sets. If you have finetuned the model to have better performance than below, please tell me, I will update the result here.

Loss Function	Rank@1(%)
Pool5-L2	52.4
Pool5-512dim L2	49.2
Pool5-256dim L2	47.0
Pool5-128dim L2	42.0
Pool5-64dim L2	32.0
BinDeviance Loss	63.3
NCA Loss	65.1

Pool5-512(64, 128, 256)dim L2 means the feature is transformed from Pool5 via a orthogonal transform.

Via some data precessing, Result is much better now.

Reproducing Car-196 (or CUB-200-2011) experiments

With NCA Loss :

sh run_train_00.sh

To reproduce other experiments, you can edit the run_train.sh file by yourself.

Notice!!!:

For the pretrained model of Inception-BN transferred from Caffe can only work normally on torch 0.2.0

I change to use VGG-16-BN as GML: <<Generalization in Metric Learning: Should the Embedding Layer be the Embedding Layer?>>

The network structure is exactly the same as GML, and I have reimplement the performance of the paper with different loss function.

The New repository is at https://github.com/bnulihaixia/VGG_dml

Name		Name	Last commit message	Last commit date
Latest commit History 309 Commits
DataSet		DataSet
Vision		Vision
evaluations		evaluations
losses		losses
models		models
result		result
utils		utils
.gitignore		.gitignore
0_feat.npy		0_feat.npy
0_label.npy		0_label.npy
Batch.py		Batch.py
MCA_train.py		MCA_train.py
README.md		README.md
__init__.py		__init__.py
cluster.py		cluster.py
download.sh		download.sh
feat.npy		feat.npy
run_train_00.sh		run_train_00.sh
run_train_01.sh		run_train_01.sh
run_train_02.sh		run_train_02.sh
run_train_03.sh		run_train_03.sh
run_train_04.sh		run_train_04.sh
run_train_05.sh		run_train_05.sh
run_train_06.sh		run_train_06.sh
save_feature.py		save_feature.py
test.py		test.py
train.py		train.py

yuanmengzhixing/Deep_metric

Folders and files

Latest commit

History

Repository files navigation

Deep Metric Learning

Learn a deep metric which can be used image retrieval , clustering.

Pytorch Code for deep metric methods:

Dataset

Pretrained models in Pytorch

Prerequisites

Attention!!

Another Attention!!

Performance of Loss:

Via some data precessing, Result is much better now.

Reproducing Car-196 (or CUB-200-2011) experiments

Notice!!!:

The New repository is at https://github.com/bnulihaixia/VGG_dml

tSNE visualization on CUB-200

About

Resources

Stars

Watchers

Forks

Languages