GitHub - GavinHan/sina_weibo_crawler: A multithreaded Micro-blog crawler

Sina Weibo Crawler

The program is based on yaml/pymongo, which is python library . Environment: Python 2.7.5 [GCC 4.8.1] on linux (Ubuntu 12.04)

Notes:

The crawler could crawling weibo/follows/fans/info. I will be add comment/at/retweet in the future.
Enter the conf folder and open conf.yaml, sign in your username password,you can add multiple accounts. Then you should write in the startuid.
You can choose the form of storage file or mongodb, if you choose the file storage you should write the position in conf.yaml then just run command "python graph.py" in shell.

Enjoy~ :-)

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
conf		conf
crawler		crawler
proxy		proxy
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
main.py		main.py

Provide feedback