-
Notifications
You must be signed in to change notification settings - Fork 0
dnephin/Threaded-Crawler
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Threaded Crawler This web crawler is designed to be a generic and highly configurable crawler, that can quickly traverse sites, and pull content based on regex and other selection criteria. __Requirements__ Uses BeatifulSoup to parse html pages (http://www.crummy.com/software/BeautifulSoup/) Uses epydoc for documentation Uses JobSite common package python-psycopg2 2.0.8 __Development__ The 'cmd' script can be used to clean and build docs. Documentation is in doc/API. __INSTALL__ python setup.py install __Running__ $COMMON environment variable should be set to the path for common/patterns.py lib, or the lib should be installed on the default python path.
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published