Skip to content

EdgarMCR/ocr_try_out

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Aim

Parse documents with regular layout using OCR. This is hacky code with bad style where I mix e.g. lists and Numpy array freely so use with care.

Installation

For this code to run, it is nessecary to install the OCR engine Tesseract. It's possible to do this both on Linux and on Windows. On Ubuntu, sudo apt install tesseract-ocr usually does the trick.

On Windows it will might also be nessecary to install poppler.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages