Sample code for scrapy that scrapes an audio book website called Librivox.
-
Clone this repository.
-
Install requirements
pip install -r requirements.txt
-
Install mp3val
-
For Linux:
sudo apt-get install mp3val
-
For Windows: http://mp3val.sourceforge.net/
-
Go to project root and run the crawler
scrapy crawl librivox --logfile librivox.log