Skip to content

ethan-hunt-007/newsSitesCrawl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

newsSitesCrawl

Crawls some specified news sites for a particular name and checks for others people's name in the scraped data.

***This PROJECT was made during my internship at Indian Institute of Management-Calcutta.

Import the functions in the news_bot.py and call them with required name to query in those news sites. Some sites have been crawled using requests and others with selenium.

The news_matrix_fill.py is used to create matrix combination of appeareance of names of two or more people together in some news on those sites and to show the count and links for those news.

NB: The sets given with this project was needed for my project so this doesnt need to be download by everyone.

About

Crawls some specified news sites for a particular name and checks for others people's name in the scraped data.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages