Skip to content
This repository has been archived by the owner on Mar 18, 2018. It is now read-only.

danielfrg/django_crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

django-crawler

A django blog crawler that uses readability to extract the article form different site.

Uses celery and Django celery to queue tasks.

How to run

The utils queue can be distributed python manage.py celery worker -l info -E -B -Q utils -c 1

The crawl queue needs to be only one for readability API restrictions python manage.py celery worker -l info -E -B -Q crawl -c 1 --hostname=crawler

See and records the celery events python manage.py celerycam

Run django python manage.py runserver_plus

About

A django blog crawler

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages