Dogs vs Cats

VGG style convolution neural network with very leaky ReLU for the kaggle Dogs vs Cats competition. Currently gets 96.6% on kaggle leaderboards without using outside data and instead relying heavily on data augmentation for generalization. Small amount of fine tuning (finishing training with a small number of iterations with very low learning rate and no data augmentation).

Architecture

Layer Type	Parameters
Input	size: 168x168, channel: 3
convolution	kernel: 3x3, channel: 32
leaky ReLU	alpha = 0.2
convolution	kernel: 3x3, channel: 32
leaky ReLU	alpha = 0.2
max pool	kernel: 2x2
dropout	0.1
convolution	kernel: 3x3, channel: 64
leaky ReLU	alpha = 0.2
convolution	kernel: 3x3, channel: 64
leaky ReLU	alpha = 0.2
max pool	kernel: 2x2
dropout	0.2
convolution	kernel: 3x3, channel: 128
leaky ReLU	alpha = 0.2
convolution	kernel: 3x3, channel: 128
leaky ReLU	alpha = 0.2
convolution	kernel: 3x3, channel: 128
leaky ReLU	alpha = 0.2
max pool	kernel: 2x2
dropout	0.3
fully connected	units: 1024
leaky ReLU	alpha = 0.2
dropout	0.5
fully connected	units: 1024
leaky ReLU	alpha = 0.2
dropout	0.5
softmax

Data augmentation

Images are randomly transformed 'on the fly' while they are being prepared in each batch. The CPU will prepare each batch while the GPU will run the previous batch through the network.

Random rotations between -30 and 30 degrees.
Random cropping between -24 and 24 pixels in any direction.
Random zoom between factors of 1 and 1.3.
Random shearing between -10 and 10 degrees.
Random intensity scaling on RGB channels, independent scaling on each channel.

To-do

Stream data from SSD instead of holding all images in memory (need to install SSD first). Try different network archetectures and data pre-processing. Try intensity scaling method from Krizhevsky, et al 2012.

References

Karen Simonyan, Andrew Zisserman, "Very Deep Convolutional Networks for Large-Scale Image Recognition", link
Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton, "ImageNet Classification with Deep Convolutional Neural Networks", link
Sander Dieleman, "Classifying plankton with deep neural networks", link

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
Dogs vs Cats.ipynb		Dogs vs Cats.ipynb
README.md		README.md
dogsvscats_cnn_rgb.py		dogsvscats_cnn_rgb.py
make_submission.py		make_submission.py
make_test.py		make_test.py
make_train_rgb.py		make_train_rgb.py
trim_images.sh		trim_images.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dogs vs Cats.ipynb

Dogs vs Cats.ipynb

README.md

README.md

dogsvscats_cnn_rgb.py

dogsvscats_cnn_rgb.py

make_submission.py

make_submission.py

make_test.py

make_test.py

make_train_rgb.py

make_train_rgb.py

trim_images.sh

trim_images.sh

Repository files navigation

Dogs vs Cats

Architecture

Data augmentation

To-do

References

About

Releases

Packages

Languages

FlorianMuellerklein/dogs_vs_cats

Folders and files

Latest commit

History

Repository files navigation

Dogs vs Cats

Architecture

Data augmentation

To-do

References

About

Resources

Stars

Watchers

Forks

Languages