Skip to content

Scrape an Edline site and upload our content to Google Drive

Notifications You must be signed in to change notification settings

ksdtech/edline-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

edline-scraper

Crawl our Edline school site, download HTML, PDF and image files, and upload them to Google Drive. Uses scrapy tool.

This is the first part of a pipeline that will use the gdrive-static-site project.

Invoke the crawler with:

 scrapy crawl gdrive -o items.json -a max_requests=1000

About

Scrape an Edline site and upload our content to Google Drive

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages