wcd-2019 pdf links processing using pdfbox Steps: pdf files are downloaded into local path pdfs are converted to text files using pdfbox each pdf is processed using nltk(for person names) text is processed