Skip to content

lucasmllr/text_recogition_project

Repository files navigation

Text Recognition Project

This is a simple document reading program that is made to read machine printed text and has a function to detect IBANs. It was a student project and is far from working perfectly. However, with text that is not blurry the results are ok. The program mainly consists of two parts, a character extractor and a classifier. Character extraction can either be done by connected component analysis in a binary image made from the input or by extracting maximally stable extremal regions from the input. Classification is done by a small convolutional neural network.

Dependencies

The code is written in python 3.6 and requires the following packages:

Installation

To use the program clone this repository, navigate to it in the terminal and run the interface:

$ python interface.py

Documentation

A documentation is available on this repository's GitHub page.

Report

For a theoretical background on the used algorithms and for results see the report. For a visualization of the individual steps take a look at this notebook.

Authors

Roman Remme, Lucas-Raphael Mueller, Lucas Moeller

About

Machine printed text recognition with IBAN detection.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published