crawler(Python+Scrapy+Redis)

基于scrapy的网页爬虫

说明

本项目基于scrapy实现多线程爬取网页内容。

依赖环境

Python2.7

安装其他依赖及工具

    $ pip install cffi
    $ pip install libffi-dev
    $ pip install cryptography
    $ pip install mysql-python
    $ pip install service_identity
    $ pip install pypinyin
    $ pip install redis

使用方法

$ scrapy crawl flat

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
driving		driving
tools		tools
README-scrapy.md		README-scrapy.md
README.md		README.md
scrapy.cfg		scrapy.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

driving

driving

tools

tools

README-scrapy.md

README-scrapy.md

README.md

README.md

scrapy.cfg

scrapy.cfg

Repository files navigation

crawler(Python+Scrapy+Redis)

说明

依赖环境

安装其他依赖及工具

使用方法

About

Releases

Packages

Languages

wirror800/crawler

Folders and files

Latest commit

History

Repository files navigation

crawler(Python+Scrapy+Redis)

说明

依赖环境

安装其他依赖及工具

使用方法

About

Resources

Stars

Watchers

Forks

Languages