Alpha_GAN

A Tensorflow implementation to reproduce the results presented in Alpha GAN paper. In this implementation, I have tried alpha-gan model on two real world datasets, namely; Cifar10 and celebA. The paper performs experiments on inception score for quantitative results. However, we restrict only to the qualitative analysis as the famous Inception score has been empirically shown with suboptimalities by Barratt et al. For a detailed read, follow this.

Setup

Python 3.5+
Tensorflow 1.9

Relevant Code Files

File config.py contains the hyper-parameters for Alpha_gan reported results.

File alpha_gan.py contains the code to train Alpha_gan model.

Similarly, as the name suggests, file alpha_gan_inference.py contains the code to test the trained Alpha_gan model.

Usage

Training a model

NOTE: For celebA, make sure you have the downloaded dataset from here and keep it in the current directory of project.

python alpha_gan.py

Test a trained model

First place the model weights in a directory whose name is mentioned in a variable named model_directory (refer to alpha_gan_inference.py) and then:

python alpha_gan_inference.py

Emprical Observations

The model is notoriously hard to train. I found the hyper-paramerters mentioned in the paper to be vague as only the hyper-parameter spreads are mentioned but it is hard to know which parameters were finally chosen to reproduce the results reported in the paper.

However, one may note that the alpha-gan is 4 tier architecture comprising of encoder, decoder(authors call it generator), discriminator and code-discriminator and compared to VAE-GAN results, I qualitatively observe no significant gain.

For code-discriminator, please avoid use of batchnorm layers. We spend a couple of days due to this. If you make it work, do message me over your github repository.

For encoder network, use the RELU activations for intermediate layers. Although in general, we are free to choose any activation function for encoder but in the alpha_gan approach it act as a generator fooling the code-discriminator. Now, as according to DCGAN architecture guidelines, the generator should use RELU activations, therefore, our encoder is RELU activated.

I tried multiple schedules by re-arranging the updates like first updating the discriminator and code-discriminator followed by encoder and generator -- but I could not find any performance gains.

For both the datasets it seems that alpha_gan focuses more on generations as compared to reconstruction ability. Also, the official paper reports reconstructions results only for the training data points. -- I wonder why..?

Model Weights

CelebA model weights

Generations

Cifar10	Celeb-A

Reconstructions

Qualitative analysis for CelebA dataset

CelebA Original	CelebA Reconstruction

Qualitative analysis for Cifar10 dataset

Cifar10 Original	Cifar10 Reconstruction

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
celebA		celebA
cifar10		cifar10
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

celebA

celebA

cifar10

cifar10

README.md

README.md

Repository files navigation

Alpha_GAN

Setup

Relevant Code Files

Usage

Training a model

Test a trained model

Emprical Observations

Model Weights

Generations

Reconstructions

About

Releases

Packages

Languages

PrateekMunjal/Variational-Approaches-for-Auto-Encoding-Generative-Adversarial-Networks

Folders and files

Latest commit

History

Repository files navigation

Alpha_GAN

Setup

Relevant Code Files

Usage

Training a model

Test a trained model

Emprical Observations

Model Weights

Generations

Reconstructions

About

Topics

Resources

Stars

Watchers

Forks

Languages