Skip to content

JallyHe/arachnado

 
 

Repository files navigation

Arachnado

Arachnado is a tool to crawl a specific website. It provides a Tornado-based HTTP API and a web UI for a Scrapy-based crawler.

License is MIT.

Install

Python 2.7 is required to run server. To build static assets node.js + npm are required.

Install all Python requirements from requirements.txt using pip:

pip install -U -r requirements.txt

Install all JavaScript requirements using npm. Run the following command from the repo root:

npm install

then rebuild static files (we use Webpack):

npm run build

or auto-build static files on each change during development:

npm run watch

Run

To run Arachnado execute the following command from the repo root:

python -m arachnado

Then visit http://127.0.0.1:8888/

About

Web Crawling UI and HTTP API, based on Scrapy and Tornado

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 57.3%
  • JavaScript 39.3%
  • HTML 3.3%
  • CSS 0.1%