Shadow Removal (Work in Progress)

Training a Convolutional Neural Network (CNN) to detect and remove unwanted shadows from smartphone document captures.

Built with

Python
Docker
OpenCV (image manipulation)
Pathos (multiprocessing)
Noise (Perlin noise)

Data Synthesis

Silhouettes

A set of manually drawn silhouettes is used to create realistic shadows on a set of document images.

Operations on silhouettes:

Noise is added to silhouette with a perlin noise mask (see noise module)
Silhouette is blurred using a Gaussian convolution operation (see open cv gaussian blurring).
Silhouette is randomly scaled (200-500%)
Silhouette transparency is randomly determined (0.4 - 0.7)
Silhouette is padded with empty pixels (or trimmed) so it has same dimensions as document image.
Final image is a linear blend between original document image and silhouette image.

Original silhouette image	Silhouette image after operations	Silhouette image applied on document image

Document Images

Document images are agregated from two different datasets: SmartDocQA [1] and The IUPR Dataset of Camera-Captured Document Images [2]. The images from the IUPR dataset are scanned and trimmed; the images from SmartDocQA are not, they are manually trimmed using open-cv contour detection.

Operations on SmartDocQA document images:

Threshold is applied to blacken part of image which is outside of document (see open cv threshold)
Document edges are detected using open cv contour detection.
Using document edgebox, document image is trimmed and warped.

Original SmartDoc image	SmartDoc image after threshold application	Trimmed SmartDoc image

Training Data

To create the training data, silhouettes are generated using the aformentionned methods and applied on the document images. Here is the training data creation procedure:

Masks and documents are identified using the uuid module.
Original document is saved as "doc_<<document uuid>>.jpg"
Masked documents are saved as "doc_<<document uuid>>_mask_<<mask uuid>>.jpg

In order to improve run time, the python multiprocessing module as well as the pathos module are used to do multiple operations in parallel.

Docker image

In order to run the application on a Google Compute Engine server instance, a docker image is created and pushed to Docker Hub (Docker Repository Link) and then pulled on the server.

References

[1] Nibal Nayef, Muhammad Muzzamil Luqman, Sophea Prum, Sebastien Eskenazi, Joseph Chazalon, Jean-Marc Ogier: “SmartDoc-QA: A Dataset for Quality Assessment of Smartphone Captured Document Images - Single and Multiple Distortions”, Proceedings of the sixth international workshop on Camera Based Document Analysis and Recognition (CBDAR), 2015.

[2] Bukhari, T. (2012). The IUPR Dataset of Camera-Captured Document Images. In Camera-Based Document Analysis and Recognition (pp. 164–171). Springer Berlin Heidelberg.

Name		Name	Last commit message	Last commit date
Latest commit History 118 Commits
README_images		README_images
shadow_synthesis		shadow_synthesis
tools		tools
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
__init__.py		__init__.py
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README_images

README_images

shadow_synthesis

shadow_synthesis

tools

tools

.dockerignore

.dockerignore

.gitignore

.gitignore

Dockerfile

Dockerfile

README.md

README.md

init.py

init.py

main.py

main.py

requirements.txt

requirements.txt

Repository files navigation

Shadow Removal (Work in Progress)

Built with

Data Synthesis

Silhouettes

Operations on silhouettes:

Document Images

Operations on SmartDocQA document images:

Training Data

Docker image

References

About

Releases

Packages

Languages

ytarfa/shadow_removal

Folders and files

Latest commit

History

Repository files navigation

Shadow Removal (Work in Progress)

Built with

Data Synthesis

Silhouettes

Operations on silhouettes:

Document Images

Operations on SmartDocQA document images:

Training Data

Docker image

References

About

Resources

Stars

Watchers

Forks

Languages