Skip to content
/ auger Public

Auger is a GUI OCR tool for extracting text from images.

License

Notifications You must be signed in to change notification settings

m-flak/auger

Repository files navigation

Auger

Auger is a GUI OCR tool for extracting text from images.

Have a screenshot but need it as a text file? Then, Auger is the tool for you!

You can select multiple regions of text within an image and format the results yourself.

Features

Formatting

Auger offers you two ways of formatting your output within the program:

  • HTML, with both a WYSIWYG and raw code view
  • Text, with font and font size customizable

Images

Any image format compatible with the Qt library is compatible with Auger.

Languages

Languages supported by your OCR backend (e.g.: Tesseract) are supported by Auger. Pick the language, select part of the image, and boom! It's that simple.

Output

Auger supports output into the following formats:

  • Plain Text
  • HTML

Installation

Installing Auger is easy...

From a Release:

Auger is available in binary distributions for both Windows and Linux. You can get them here.

From the Git Repository:

You can always just clone the repo and setup a virtual environment for the purpose of running Auger:

From Source Directory:
git clone https://github.com/m-flak/auger auger
cd auger
pip install -r requirements.txt
python auger.py
Using setuptools:
git clone https://github.com/m-flak/auger auger
cd auger
python setup.py build # YOU CAN USE ANY COMMAND SUPPORTED BY SETUPTOOLS
cd build/lib; python -m auger

Requirements

  • PyQt5
  • Pillow
  • pyocr
  • lxml
  • iso-language-codes

Screenshots

Decemeber 31st, 2019: Screenshot_12-31-2019

December 24th, 2019: Screenshot_12-24-2019

December 16th, 2019: Screenshot_12-16-2019

December 14th, 2019: Screenshot_12-14-2019