Skip to content

niyoufa/spider

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

基于scrapy框架和pandas框架开发爬虫与数据分析系统

scrapy框架参考文档 : 
http://scrapy-chs.readthedocs.org/zh_CN/latest/index.html

xpath参考文档 : 
https://www.w3.org/TR/xpath/(英文文档)
http://www.runoob.com/xpath/xpath-tutorial.html

numpy , pandas , matplotlib , IPython , Scipy 
pandas参考文档 : 
http://blog.csdn.net/shandianke/article/details/41525203
http://pandas.pydata.org/pandas-docs/stable/dsintro.html#dsintro



1. 安装 
pip install Scrapy 

2. 新建项目
scrapy startproject projectname

3. 导出数据
scrapy crawl spidername -o scraped_data.json

About

基于python框架scrapy的数据收集

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages