Skip to content

virajds/PDFTools

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Contributors Forks Stargazers Issues MIT License LinkedIn


PDF Scraper and Language Detector

A technical overview of the Scraping PDF Tabular data and Detect Language of the PDF Docuemnt.


Explore the docs »

View Demo · Report Bug · Request Feature

Table of Contents

About The Project

This project manages two tasks.

  1. PDF Tabular Data to JSON export
  2. Detect language of both Image based and Text based PDFs

Built With

This is an open source project; built with;

Getting Started

Project can be started by Cloning the GitHub and Installing required Packages.

Prerequisites

  • python3.8
$ sudo apt update -y
$ sudo apt install python3.8
  • create a virtual environment
cd <Project DIR>
python3 -m venv venv
  • Activate virtual environment
source venv/bin/activate

Installation

  • Clone the repo
git clone https://github.com/virajds/PDFTools
  • Install packages
pip install -r PDFTools/requirements.txt

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages