Skip to content

xdarkpandax/amzreviewsscrape

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Scrape Amazon Review Pages

Amazon has a system in place to keep you from scraping their pages. What this Python app does is open individual browser windows and scrapes the content on the page displayed using the Selenium WebDriver for Chrome. After each scrape, the opened Chrome window is closed.

This allows you to feed a list of Amazon ASINs in as a .csv (no header) and scrape the number of reviews received and the number of stars as well.

I have recently added the ability to retrieve review text and the number of stars for that rating.

Usage

Just pass the path to your csv of ASINs (no header) as a command line argument as such

py amzreviewscrape.py -asins="C:\PATH\TO\ASINS\FILE.CSV"

To run:

py amzreviewscrape.py

Dependencies:

This uses Beautiful Soup and the Selenium Web Driver for Google Chrome which can be found here, which you will need to install separately and point

OS Note

This works on Windows and MacOSx, however take note of the path to the selenium web driver in the driver_path variable.

The CSV Output currently looks like:

output

About

Scrape Amazon Product Reviews using Python and the Selenium WebDriver for Chrome

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%