Skip to content

liushuigs/cavat

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

#cavat, a general spider based on scrapy

set up crawler

make sure you are in PROJECT/crawler folder

install dependencies

set up virtualenv

virtualenv env

source env/bin/activate

if it's ubuntu, set like this, detail

virtualenv --system-site-packages env

source env/bin/activate

install dependencies

pip install -r requirements.txt

init database

copy alembic.ini.example alembic.ini, then copy configs/dev.config.example.ini configs/dev.config.ini

change the sqlalchemy.url to your own

init database alembic upgrade head

run crawler

scrapy crawl 36kr

About

a general spider based on scrapy

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published