Skip to content

RandalMoss/pdf-search

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

pdf-search

A pipelined project for extracting text from PDFs.

This is currently a rough prototype.

Technologies used: -Tika 1.8.8 -Scikit-image 0.11.3 -Ghostscript -Tesseract 3.02.02

About

A pipelined project for extracting text from PDFs.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages