1. Indian Patents To craw all patent journal files from Indian Patent Website. Extracted text from pdf, filtered & processed it. Usage git clone https://github.com/ChillarAnand/Indian-Patents.git python extract.py path/to/journal.pdf