Text Recognition Project

This is a simple document reading program that is made to read machine printed text and has a function to detect IBANs. It was a student project and is far from working perfectly. However, with text that is not blurry the results are ok. The program mainly consists of two parts, a character extractor and a classifier. Character extraction can either be done by connected component analysis in a binary image made from the input or by extracting maximally stable extremal regions from the input. Classification is done by a small convolutional neural network.

Dependencies

The code is written in python 3.6 and requires the following packages:

NumPy
SciPy
scikit-learn
Pillow
PyTorch
OpenCV
Matplotlib
os
pyperclip

Installation

To use the program clone this repository, navigate to it in the terminal and run the interface:

$ python interface.py

Documentation

A documentation is available on this repository's GitHub page.

Report

For a theoretical background on the used algorithms and for results see the report. For a visualization of the individual steps take a look at this notebook.

Authors

Roman Remme, Lucas-Raphael Mueller, Lucas Moeller

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
data_test		data_test
docs		docs
model_weights		model_weights
.gitignore		.gitignore
DisjointSet.py		DisjointSet.py
Evaluate.ipynb		Evaluate.ipynb
README.md		README.md
arguments.py		arguments.py
bbox_based_rot_correction.py		bbox_based_rot_correction.py
blob_extraction.py		blob_extraction.py
component_evaluation.py		component_evaluation.py
components.py		components.py
data_gen.py		data_gen.py
data_loading.py		data_loading.py
detect_iban.py		detect_iban.py
example.jpg		example.jpg
font_results.txt		font_results.txt
interface.py		interface.py
line_extraction.py		line_extraction.py
model.py		model.py
mser_extraction.py		mser_extraction.py
pipeline.py		pipeline.py
processing.py		processing.py
report.pdf		report.pdf
rot_correction.py		rot_correction.py
train_nn.py		train_nn.py

lucasmllr/text_recogition_project

Folders and files

Latest commit

History

Repository files navigation

Text Recognition Project

Dependencies

Installation

Documentation

Report

Authors

About

Resources

Stars

Watchers

Forks

Languages