corePy Work Flow Download all pdfs using getPDF.bash script Convert all the pdfs to pgm using pdftoppm utilities ...........