GAN Testing Playground (WIP)

Project about testing techniques about training GANs and their stability

General info

This project contains code for some of the most know types of GAN (Generative Adverserial Network). I am using this repo to play with these types of networks to get better understanding how they work and how to properly train them.

Disclaimer: This repository is more like proof of concept than download and run!

Content

DCGAN - GAN for generating new images from latent vector
WGAN(GC) - GAN for generating new images from latent vector
SRGAN - GAN for upscaling images

Project Folder Structure

- datasets (place for all data that you will want feed to network)
- media (folder with media files of repo)
- modules
    - gans (trainers for all GANs are pleced)
    - keras_extensions (extesions based on keras functionality)
    - models (models and building blocks for models)
    - utils (other helper stuff)
- settings (settings for scripts)
- utility_scripts (scripts for data processing and other useful stuff)

Setup

pip install -r requirements.txt

Dependencies

- Python3.7
- Tensorflow 2.2.0
- Keras 2.3.1

For GPU acceleration:
    - Cuda Toolkit 10.1
    - cuDNN for toolkit version

Usage

Adjust settings in settings/****_settings.py
Get datasets and place them in dataset directory (Or in directory you set in settings.py)
python preprocess_dataset.py
python train_****.py

After training use
1) python generate_images.py for DCGAN and WGAN
2) python upscale_images.py for SRGAN
(These scripts still needs tweaking because settings for them are hardcoded in them)

Note: These scripts are still in progress of making, some of them may not work!

Utility

preprocess_dataset.py - Script for mass rescaling images to target size and optionaly splitting them to training and testing parts
visualize_conv_activations.py - Script for displaying activation of each conv layer as image
show_vgg_structure.py - Script that will print all layers of vgg19 usable for perceptual loss
parse_hr_image.py - Script to parse large images to small ones (WIP)
Note: Some utility scripts have its settings in settings folder

Results

SRGAN Results - (Upscaled by opencv, Original, Upscaled by SRGAN)

Pretrain

For my dataset ideal pretrain of generator is something around 50k episodes \

No pretrain, 400k episodes
50k pretrain, 400k episodes
200k pretrain, 400k episodes

Used models

    Generator / Discriminator (Critic)
1. mod_srgan_exp / mod_base_9layers
2. mod_srgan_exp_sn / mod_base_9layers_sn

TODO

Current tasks

- Testing best working model pairs for WGAN
- Refactoring
- Retraining all SRGAN models
- Testing efects of pretrain on SRGAN model

Notes

Testing of Charbonnier loss for SRGAN failed because the values were too different from MSE loss values, maybe more tweaking required and test again.
MAE loss is causing lot of artifacts and image distortions (like color shifting, "image bleedoff", etc) in results from SRGAN. \

Testing setup

Hardware:
    Processor: I7-9700KF 4.8GHz
    RAM: HyperX Fury RGB 32GB (2x16GB) DDR4 3200MHz
    GPU: GIGABYTE GeForce RTX 2080 SUPER 8G
    SSD: Intel 660p M.2 2TB SSD NVMe

Editor: PyCharm (always latest version)

Resources

Name		Name	Last commit message	Last commit date
Latest commit History 501 Commits
datasets		datasets
media/srgan_results		media/srgan_results
modules		modules
settings		settings
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
generate_images.py		generate_images.py
parse_hr_image.py		parse_hr_image.py
preprocess_dataset.py		preprocess_dataset.py
requirements.txt		requirements.txt
show_vgg_structure.py		show_vgg_structure.py
train_dcgan.py		train_dcgan.py
train_srgan.py		train_srgan.py
train_wgan.py		train_wgan.py
upscale_images.py		upscale_images.py
visualize_conv_activations.py		visualize_conv_activations.py

License

Tubbz-alt/GAN-Playground

Folders and files

Latest commit

History

Repository files navigation

GAN Testing Playground (WIP)

Project about testing techniques about training GANs and their stability

Table of contents

General info

Content

Project Folder Structure

Setup

Dependencies

Usage

Utility

Results

SRGAN Results - (Upscaled by opencv, Original, Upscaled by SRGAN)

Pretrain

Used models

TODO

Current tasks

Notes

Testing setup

Resources

Basic DCGAN

WGAN (Wasserstein GAN)

WGAN-GP

SRGAN (Super Resolution GAN)

SR Resnet

ESDR (Enhanced Deep Residual Networks for Single Image Super-Resolution)

Perceptual Loss

GAN Stability and diagnostics

Gradient accumulation

Some resources might be missing, I started researching this topic long before this repository was created!

About

Resources

License

Stars

Watchers

Forks

Languages