Handwritten-Text-Recognition

Working Procedure

Paragraph Segmentation : Detect that area of the image which contains handwritten text

Word Detection : Detect the words in the area which we got after paragaraph segmentation.
Line Identification : Identify the lines and the words present in that line by checking the vertical overlap between the words

Handwriting Recognition Now for each of the lines we apply a pretrained model to detect the text present.

Error Correction(Denoising) After detecting the text we do denoising.For this we use a pre-trained model which tends to check that vocabulary,grammar of the text. If the text identified is not there in english vocabulary then it predicts the closest matching words for that text and we then replace the previous one with later one.

Pretrained Models

To get the pretrained models run this command on anaconda prompt:

python get_models.py

Output :

To see output along with the code please open : https://github.com/saurabh9450150287/Handwritten-Text-Recognition/blob/master/0_handwriting_ocr.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.ipynb_checkpoints		.ipynb_checkpoints
.pylint.d		.pylint.d
dataset/fonts		dataset/fonts
images		images
models		models
ocr		ocr
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
.travis.yml		.travis.yml
0_handwriting_ocr-Copy1.ipynb		0_handwriting_ocr-Copy1.ipynb
0_handwriting_ocr.ipynb		0_handwriting_ocr.ipynb
0_handwriting_ocr.py		0_handwriting_ocr.py
1_a_paragraph_segmentation_msers.ipynb		1_a_paragraph_segmentation_msers.ipynb
1_a_paragraph_segmentation_msers.py		1_a_paragraph_segmentation_msers.py
1_b_paragraph_segmentation_dcnn.ipynb		1_b_paragraph_segmentation_dcnn.ipynb
1_b_paragraph_segmentation_dcnn.py		1_b_paragraph_segmentation_dcnn.py
2_line_word_segmentation.ipynb		2_line_word_segmentation.ipynb
2_line_word_segmentation.py		2_line_word_segmentation.py
3_handwriting_recognition.ipynb		3_handwriting_recognition.ipynb
3_handwriting_recognition.py		3_handwriting_recognition.py
4_text_denoising.ipynb		4_text_denoising.ipynb
4_text_denoising.py		4_text_denoising.py
5_a_character_error_distance.ipynb		5_a_character_error_distance.ipynb
5_a_character_error_distance.py		5_a_character_error_distance.py
5_b_visual_distance.ipynb		5_b_visual_distance.ipynb
5_b_visual_distance.py		5_b_visual_distance.py
Azure.md		Azure.md
OUTPUT.ipynb		OUTPUT.ipynb
README.md		README.md
bench.sh		bench.sh
benchmark.py		benchmark.py
credentials.json		credentials.json
credentials.json.example		credentials.json.example
get_models.py		get_models.py
pylev.py		pylev.py
setup.py		setup.py
test2.png		test2.png
test3.png		test3.png
test4.png		test4.png
test5.png		test5.png
test6.png		test6.png
tests.py		tests.py

saurabh9450150287/Handwritten-Text-Recognition

Folders and files

Latest commit

History

Repository files navigation

Handwritten-Text-Recognition

Working Procedure

Pretrained Models

Output :

About

Resources

Stars

Watchers

Forks

Languages