DCGAN with an ImageLoader

Tensorflow implementation of a DCGAN I wrote for a school project. The implementation uses a ImageLoader which is responsible for feeding the model with learning data during learning and can be repurposed for differend kinds od data. Here I used it for the CelebA data set and for the speech commands dataset where the audio is transformed into an SFTF image representation.

ImageLoader usage

The ImageLoader is originally implemented to give cropped images of the CelebA dataset, but you can write a subclass that can essentially work with any kind of data, as long as the new subclass is going to return data that is in a 3D shape.

To repurpose it, create a new class with the ImageLoader parrent. The main method that probably would need to get overridden is _get_image(), which transforms the original data into a desired 3D shape.

For an example see the SCC class, which is an example of an ImageLoader subclass that reads audio data and constructs a polar form STFT which can be modeled by the DCGAN.

Needed libraries

Learning

CelebA

Learning was done on a laptop with a Nvidia GTX-860M card and went on only for 14 epochs.

SCC

Since the dataset was smaller I could achieve a lager number of epochs, 87 in total.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
src		src
test		test
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

src

src

test

test

README.md

README.md

Repository files navigation

DCGAN with an ImageLoader

ImageLoader usage

Needed libraries

Learning

CelebA

SCC

About

Releases

Packages

Languages

sircelj/dcgan-imgldr

Folders and files

Latest commit

History

Repository files navigation

DCGAN with an ImageLoader

ImageLoader usage

Needed libraries

Learning

CelebA

SCC

About

Resources

Stars

Watchers

Forks

Languages