Skip to content

omshivaprakash/egazette

 
 

Repository files navigation

egazette

Modules to be installed:

  1. BeautifulSoup 4: https://www.crummy.com/software/BeautifulSoup/

  2. python-magic https://github.com/ahupp/python-magic

  3. pypdf2 https://github.com/mstamy2/PyPDF2

  4. Pillow https://pypi.org/project/Pillow/2.2.1/

  5. tesseract https://github.com/tesseract-ocr/

Usage:python sync.py [-l level(critical, error, warn, info, debug)]

                   [-a (all_downloads)]

                   [-m (updateMeta)]

                   [-n (no aggregation of srcs by hostname)]

                   [-r (updateRaw)]

                   [-f logfile]

                   [-t fromdate (DD-MM-YYYY)] [-T todate (DD-MM-YYYY)] 


                   [-s central_weekly -s central_extraordinary -s central
                    -s states 
                    -s andhra -s andhraarchive 
                    -s bihar 
                    -s chattisgarh -s cgweekly -s cgextraordinary 
                    -s delhi -s delhi_weekly -s delhi_extraordinary
                    -s karnataka
                    -s maharashtra -s telangana   -s tamilnadu
                    -s jharkhand   -s odisha      -s madhyapradesh
                    -s punjab      -s uttarakhand -s himachal
                    -s haryana     -s kerala      -s haryanaarchive 
                   ]

                   [-D datadir]

The program will download gazettes from various egazette sites and will place in a specified directory. Gazettes will be placed into directories named by type and date. If fromdate or todate is not specified then the default is your current date.

About

A project to download and process gazettes (Govt Notifications) from India

Resources

License

Unknown, Apache-2.0 licenses found

Licenses found

Unknown
LICENSE
Apache-2.0
COPYING

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%