H2L

OCR for handwritten math expression.

Introduction

H2L is built for recognizing handwritten mathematical equations using neural networks. For now, we use convolutional net combined with other traditional techniques to do the trick.

Internal structure

Everything begins at the function heuristicGenerate in evaluate.py.

Steps for generating PDF file (internal):

Crop the corresponding area of found text with crop_image.
Gray scale and binarize with binarizeNd.
Segment all the lines using line_segmenter.segment
Construct equation from line images using build_equation.
1. Segment characters in a line with heuristicSegmenter.segmenter.
2. Check whether a character is a super script or subscript.
3. Predict the characters from above segmented character images with characterRecognizer.recognizer
4. Return the resulting equation from the line image.
Translate the equation strings into .tex file and call pdflatex to compile.

Training the neural networks:

Originally, there are some neural nets in the code performing different tasks, but we only use one of them to do the recognition due to their poor performance. At the very first, I have tried to use similar techniques to do the segmentation, sadly the resulting performance is a disaster. So all the segmentation are implemented with heuristic techniques.

Codes for training neural networks are in trainer directory, trained models are in models directory, configuration for learning parameters are in configuration directory, you can use train.py to kick the training processes start.

Used model

For now, we only use the convolutional net for character recognition, specifically, a modified version of res-50 or a simple cnn.

Used dataset

Misc

Codes are glued together, and since the project is still in experiment phase, there are ~~lots of~~ deprecated but not removed code in this repository. These code will be cleaned in the future.

What to do next

There is a huge problem when trying to recognize those characters. Although many papers were published to address the image recognition problem, but the performance described in papers and in real life are different. So, here the main task is to tackle the recognition performance issue.

Collaborators

JZP @JZPHome (https://github.com/jzphome)

To-do list for fis: [0/3]

[ ] A better character segmenter
[ ] Better character recognizer
[ ] Characters segmentation check for o shape
[ ] Add more tests.

Name		Name	Last commit message	Last commit date
Latest commit History 116 Commits
H2L		H2L
resource		resource
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.org		README.org
config.cfg		config.cfg
h2l		h2l
h2l_commands		h2l_commands
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

H2L

H2L

resource

resource

tests

tests

.gitignore

.gitignore

LICENSE

LICENSE

README.org

README.org

config.cfg

config.cfg

h2l

h2l

h2l_commands

h2l_commands

setup.py

setup.py

Repository files navigation

H2L

Introduction

Internal structure

Steps for generating PDF file (internal):

Training the neural networks:

Used model

Used dataset

Misc

What to do next

Collaborators

To-do list for fis: [0/3]

About

Releases

Packages

Languages

License

trivialfis/H2L

Folders and files

Latest commit

History

Repository files navigation

H2L

Introduction

Internal structure

Steps for generating PDF file (internal):

Training the neural networks:

Used model

Used dataset

Misc

What to do next

Collaborators

To-do list for fis: [0/3]

About

Resources

License

Stars

Watchers

Forks

Languages