GitHub - Jumpst3r/wordimage2text: Dockerized text recognition

Dockerized word image to text recognition

Description

Given images of words, recognize the words and output the corresponding text.

Usage

To use the docker image, first pull the image using

docker pull jumpst3r/wordimage2text

And then execute

docker run -it --rm -v /PATH_TO_FOLDER_WITH_INPUTS:/input/figures -v /FULL_PATH_TO_OUTPUT_FOLDER/:/output/ jumpst3r/wordimage2text sh /input/script.sh /input/figures/example.png /input/figures/boxes.csv /output/

where PATH_TO_FOLDER_WITH_INPUTS is the full path to a folder containing the inputs described bellow_

The input consists of:

example.png: A document image containing possibly several words.
boxes.csv: A csv file containing the bounding boxes delimiting each word in the previously provided image

The output consists of:

A csv containing the coordinates of the words with the appended detected word:

x1,x2,y1,y2,detected_word
A zip file containing the cropped out words and a csv which can be used to trace back the original words positions and textual content.

The docker image is also compatible with DIVAServices a web-based framework providing streamlined access to DOI methods.

Sources / Comments

Original repo

Name		Name	Last commit message	Last commit date
Latest commit History 150 Commits
demo_image		demo_image
figures		figures
input		input
modules		modules
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE.md		LICENSE.md
README.md		README.md
create_lmdb_dataset.py		create_lmdb_dataset.py
dataset.py		dataset.py
demo.ipynb		demo.ipynb
demo.py		demo.py
install.json		install.json
log.csv		log.csv
model.py		model.py
parser.py		parser.py
run.json		run.json
script.sh		script.sh
test.py		test.py
train.py		train.py
upload.json		upload.json
utils.py		utils.py

License

Jumpst3r/wordimage2text

Folders and files

Latest commit

History

Repository files navigation

Dockerized word image to text recognition

Description

Usage

Sources / Comments

About

Resources

License

Stars

Watchers

Forks

Languages