Skip to content

wirror800/crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

crawler(Python+Scrapy+Redis)

基于scrapy的网页爬虫

说明

本项目基于scrapy实现多线程爬取网页内容。

依赖环境

Python2.7

安装其他依赖及工具

    $ pip install cffi
    $ pip install libffi-dev
    $ pip install cryptography
    $ pip install mysql-python
    $ pip install service_identity
    $ pip install pypinyin
    $ pip install redis

使用方法

$ scrapy crawl flat

About

基于scrapy的网页爬虫

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages