Skip to content

this python script prints out article titles, first paragraph and images links under the top stories tab in cnn.com, please notice that because it uses the selenium.Firefox client to interact with a Ruby website it might take a while to load all the data

Notifications You must be signed in to change notification settings

itay1542/cnn.com-scrape

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 

Repository files navigation

cnn.com-scrape

this python script prints out article titles, first paragraph and images links under the top stories tab in cnn.com, please notice that because it uses the selenium.Chrome client it might take a while to load all the data how to run: if you have python on your computer just run python cnn.com-scrape

changes:

  1. changed variable names and spacings to fit PEP8
  2. now you can see the browser as the program runs
  3. browser changed from Firefox to Chrome for performance notes: 1.you need to have a chrome driver in usr/bin for the program to work 2.pip install selenium if you dont have it

About

this python script prints out article titles, first paragraph and images links under the top stories tab in cnn.com, please notice that because it uses the selenium.Firefox client to interact with a Ruby website it might take a while to load all the data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages