lekha_OCR_1.0

Note: This project is under development. All modules are written in python. For image processing opencv is used

lekha_OCR_1.0

Printed text recognizer for Malayalam, Lekha OCR is an optical character recognizer trained for the recognition of printed malayalam Documents.

Prerequirements

OpenCV 2.4.11 Python 2.7.9

Usage

Currently only block recognition is available. Layout analysis is at devoloping stage. To recognize a scanned malayalam document and get the malayalam characters as output.

$./lekhaocr <filename>

for example

$./lekhaocr Example/dc_books_page.png

Supporters

This project is funded by ICFOSS, technically guided under space-kerala.

Contributors

Arun Joseph contributed most of the engine devolopments. Jithin Thankachan contributed some additional features, training tool and helped in documentatiuon. Rijoy V contributed in initial research. Ambily Sreekumar contibuted in building data set for training. Arun M helped in project mangement and technical assistace.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
Example		Example
corrected_docs		corrected_docs
train_images		train_images
LICENSE		LICENSE
README.md		README.md
after_pp_thresholding.png		after_pp_thresholding.png
alphabet.txt		alphabet.txt
application.py		application.py
before_pp_thresholding.png		before_pp_thresholding.png
faces1.py		faces1.py
images.gitignore		images.gitignore
initial_temp.py		initial_temp.py
label		label
lekhaocr		lekhaocr
main.py		main.py
path.py		path.py
path.pyc		path.pyc
preprocess.py		preprocess.py
preprocess.pyc		preprocess.pyc
svm_class.xml		svm_class.xml
t_img_in.png		t_img_in.png
training.py		training.py
training.pyc		training.pyc

License

theidentity/lekha_OCR_1.0

Folders and files

Latest commit

History

Repository files navigation

lekha_OCR_1.0

Prerequirements

Usage

Supporters

Contributors

About

Resources

License

Stars

Watchers

Forks

Languages