Skip to content

scraper + pyramid web app that pulls docs off websites and produces stats and notifications about them

License

Notifications You must be signed in to change notification settings

citruspi/BarkingOwl

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BarkingOwl

Message Bus Based Web Scraping Infrastructure

####Background and Description####

Barking Owl came out of the need presented at a Hacks and Hackers Rochester (#hhroc) meet-up in Syracuse, NY. A journalist expressed his need for a tool that would assist him in looking for key words within PDFs posted to town websites, such as meeting minutes.

####Proposed Solution####

I wanted to make the code for this project as reusable as possible as I knew it had several parallels to other work I had been doing and wanted to do in the future. The solution was a architecture that would allow for significant scalability and extensibility. The solution being proposed is outlined in a 5-part design series here:

Part I

Part II

Part III

Part IV

Part V

####Documentation####

Coming soon? yes ... coming soon.

About

scraper + pyramid web app that pulls docs off websites and produces stats and notifications about them

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 84.2%
  • JavaScript 15.8%