SimCLR

PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations by T. Chen et al. Including support for:

Data parallel training
LARS (Layer-wise Adaptive Rate Scaling) optimizer

Link to SimCLR paper by Chen et al.

Results

These are the top-1 accuracy of linear classifiers trained on the (frozen) representations learned by SimCLR:

Method	Batch Size	ResNet	Projection output dimensionality	Epochs	Optimizer	CIFAR-10	ImageNet (128x128)
SimCLR + Linear eval.	768	ResNet18	128	100	Adam	0.83	0.35
AVGSimCLR + Linear eval.	768	ResNet50	128	100	Adam	0.861	0.356
SimCLR + Finetuning (100% labels)	768	ResNet18	128	100	Adam	0.904	0.438
AVGSimCLR + Finetuning (100% labels)	768	ResNet18	128	40	Adam	0.915	0.443
Logistic Regression	-	-	-	40	Adam	0.358	0.389

LARS optimizer

The LARS optimizer is implemented in modules/utils/lars.py. It can be activated by setting the --optimizer parameter to "lars". It is still experimental and has not been thoroughly tested.

What is SimCLR?

SimCLR is a "simple framework for contrastive learning of visual representations". The contrastive prediction task is defined on pairs of augmented examples, resulting in 2N examples per minibatch. Two augmented versions of an image are considered as a correlated, "positive" pair (x_i and x_j). The remaining 2(N - 1) augmented examples are considered negative examples. The contrastive prediction task aims to identify x_j in the set of negative examples for a given x_i.

What is AVGSimCLR?

Instead of creating two augmentations to calculate one loss, the AVGSimCLR approach averages the contrastive loss of multiple augmented views and backpropagate from there:

$loss_{A} = \frac{1}{N} \sum_{n \in A}^{} loss_{n_{1}, n_{2}}$

with

$A = {[(aug{1}, aug{2}), (aug{3}, aug_{4}) ....(aug_{i-1}, aug_{i})] , i \in \mathbb{N}}$

and

$N = len(A)$

where A is a list, consisting of two augmented views each. The length of the lists represents the number of losses averaged.

The idea behind AVGSimCLR is closely related to bootstrap aggregation (Bagging). As the augmenation pipeline is stochastic, multiple losses are averaged to reduce noise, so that the network is able to learn more fundamental features across multiple augmented views.

Logging and TensorBoard

To view results in TensorBoard, run:

tensorboard --logdir runs

Optimizers and learning rate schedule

This implementation features the Adam optimizer and the LARS optimizer, with the option to decay the learning rate using a cosine decay schedule. The optimizer and weight decay can be configured via the parameters --optimizer and --wd.

Dependencies

torch
torchvision
tensorboard

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
data		data
jupyters		jupyters
media		media
modules		modules
runs		runs
saved_models		saved_models
README.md		README.md
main.py		main.py
supervised.py		supervised.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

jupyters

jupyters

media

media

modules

modules

runs

runs

saved_models

saved_models

README.md

README.md

main.py

main.py

supervised.py

supervised.py

test.py

test.py

Repository files navigation

SimCLR

Results

LARS optimizer

What is SimCLR?

What is AVGSimCLR?

Logging and TensorBoard

Optimizers and learning rate schedule

Dependencies

About

Releases

Packages

Languages

dtheo91/simclr

Folders and files

Latest commit

History

Repository files navigation

SimCLR

Results

LARS optimizer

What is SimCLR?

What is AVGSimCLR?

Logging and TensorBoard

Optimizers and learning rate schedule

Dependencies

About

Resources

Stars

Watchers

Forks

Languages