Skip to content

iakinsey/illume

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

59 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Illume

Build Status codecov

A comprehensive distributed web crawler framework

Illume is a web crawler application framework written in Python and asyncio. It is designed to allow a developer to create a web crawler that can run from anything between a single thread to a distributed cluster with minimal hassle.

Setting up a developer environment

Run the following set of commands:

virtualenv -p `which python3.6` env
source env/bin/activate
python setup.py install

Running tests

  1. Complete Setting up a developer environment
  2. Run the following command:
python setup.py test

Building documentation

  1. Complete Setting up a developer environment
  2. Run the following set of commands:
sphinx-apidoc -o docs/ illume/
cd docs
make html

Running a produciton crawler

See tests/test_fetcher_filter_analyzer_integration.py a basic example.

About

Comprehensive distributed web crawler framework

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published