GitHub

Fully convolutional autoencoder

Autoencoder that contains no dense layers. The encoding layer is calculated by global mean pooling over the last convolutional layer, which has as many filters as there are dimensions in the code.

The resulting autoencoder is somewhat scale-invariant and can effectively encode images larger or smaller than the training set. The current architecture depends on Lasagne's InverseLayer, which allows the decoding layer access to the gradients in the encoding layer. The next step is to remove this dependency, which will make training more difficult.

Example output - leftmost column is original image, second is original-size reconstruction, and each column to the right is reconstruction after upscaling the original image by 20% along each dimension.

DCGAN autoencoder

Inspired by [1] Radford et al 2015. An attempt to train an autoencoder using generative adverserial training with the autoencoder acting as the generator network, and a separate discriminator network. The architecture used is significantly different from DCGAN, in particular in the use of max-pooling layers rather than strided convolutions.

Based on code found here: https://github.com/Newmu/dcgan_code

This approach trains poorly; the generator seems to find the degenerate solution of outputting a single solution designed to exploit the discriminator. The hope was that it would reproduce the input image, but as structured there is actually nothing in the objective that encourages it to do this. Adding in a smaller term to this effect (e.g. epsilon * binary cross-entropy) helped a little but not much.

Rotated convolutions

Compares performance obtained on the CIFAR-10 classification task using

a typical CNN
a CNN with half as many filters per layer, but each outputs its own activations and those of its weights if rotated 180 degrees
a CNN with a quarter as many filters per layer, but each filter outputs both its own activations, and those of its weights if rotated 90, 180, and 270 degrees

The architectures used in this experiment were chosen to have relatively few dense layer weights, using a final convolutional layer containing fewer feature maps. The result so far is that reducing the model size by using rotated convolutional featues achieves classification accuracy nearly as good as using a model 2-4x the size, and with further work and/or on certain architectures perhaps performance can be matched or exceeded.

Dependencies

theano
lasagne
tqdm
fuel
h5py

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
dcganae		dcganae
fcae		fcae
rotconv		rotconv
LICENSE		LICENSE
README.md		README.md
batch_norm_layer.py		batch_norm_layer.py
config.py		config.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dcganae

dcganae

fcae

fcae

rotconv

rotconv

LICENSE

LICENSE

README.md

README.md

batch_norm_layer.py

batch_norm_layer.py

config.py

config.py

utils.py

utils.py

Repository files navigation

Fully convolutional autoencoder

DCGAN autoencoder

Rotated convolutions

Dependencies

About

Releases

Packages

Languages

License

tencia/experiments

Folders and files

Latest commit

History

Repository files navigation

Fully convolutional autoencoder

DCGAN autoencoder

Rotated convolutions

Dependencies

About

Resources

License

Stars

Watchers

Forks

Languages