vision-transformers-pytorch

Implementation of various Vision Transformers (and other vision models) I found interesting

Models

Currently I have implemented:

ViT (https://arxiv.org/abs/2010.11929)

Implemented.

DINO (https://arxiv.org/abs/2104.14294)

Implemented. Currently testing.

NFNet (https://arxiv.org/abs/2102.06171)

Tested and got 83.17 top-1 accuracy with NFNet-F0

Pyramid Vision Transformer (https://arxiv.org/abs/2102.12122)

Tested and got 78.94 top-1 accuracy with PVT-Small

Swin Transformer (https://arxiv.org/abs/2103.14030)

Tested and got 82.192 on top-1. Re-experimenting with random erasing.

Halo Transformer (https://arxiv.org/abs/2103.12731)

Implemented.

EfficientNetV2 (https://arxiv.org/abs/2104.00298)

Tested and got 82.862 on top-1 @ 300px, 83.2 on top-1 @ 380px

Twins-SVT (https://arxiv.org/abs/2104.13840)

Implemented.

Usage

I'm currently using LMDB of ILSVRC 2012 dataset, that made by using

python preprocess.py [IMAGENET_PATH] [train/val]

I think just using torchvision.datasets will be better. I will change to it later.

Then you can do training.

python train.py --conf [CONFIG FILE] --n_gpu [NUMBER OF GPUS] [Config overrides in the form of key=value]

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
config		config
models		models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
autoaugment.py		autoaugment.py
config.py		config.py
dataset.py		dataset.py
factory.py		factory.py
loss.py		loss.py
lr_scheduler.py		lr_scheduler.py
mix_dataset.py		mix_dataset.py
optimizer.py		optimizer.py
preprocess.py		preprocess.py
train.py		train.py
train_dino.py		train_dino.py
train_util.py		train_util.py
transforms.py		transforms.py

License

kapitsa2811/vision-transformers-pytorch

Folders and files

Latest commit

History

Repository files navigation

vision-transformers-pytorch

Models

ViT (https://arxiv.org/abs/2010.11929)

DINO (https://arxiv.org/abs/2104.14294)

NFNet (https://arxiv.org/abs/2102.06171)

Pyramid Vision Transformer (https://arxiv.org/abs/2102.12122)

Swin Transformer (https://arxiv.org/abs/2103.14030)

Halo Transformer (https://arxiv.org/abs/2103.12731)

EfficientNetV2 (https://arxiv.org/abs/2104.00298)

Twins-SVT (https://arxiv.org/abs/2104.13840)

Usage

About

Topics

Resources

License

Stars

Watchers

Forks

Languages