GitHub - luoyu1111/Yandere-crawler: yande.re图片爬虫

Yande.re图片爬虫

前言

两个月没刷Y站积了2w个post，这时候当然应该找个机器替我刷啦

感谢@mokeyjay的爬虫项目节省了很多时间。

(合并分支覆盖掉了原项目的Readme, 想办法补救中)

本项目基于Win7, Python3.5.2开发，在Win10, Python3.6.7与Ubuntu16.04, Python3.5.2运行成功，其他环境不作考虑。

功能

支持从指定的开始页码爬取到结束页码
也支持从第一页爬取到上一次开始爬取的位置
支持设置爬取的图片类型（全部、横图、竖图、正方形）
支持最大或最小图片尺寸、宽高比限制
支持限制爬取的图片体积
按照当天的日期创建目录并存放爬取的图片
爬取结束后会在图片目录下生成日志文件
支持tag搜索与排除
(可选)GUI

如何使用

可选

编辑config.json中folder_path参数，设为自己想要的目录，如文件夹不存在将会自动创建。路径必须以斜杠结尾。剩下的参数可以运行后根据提示修改。

Windows下命令行执行python index.py即可，Linux下可直接执行。

注意事项

每次运行后config.json中last_stop_id参数会被自动修改为爬取到的第一张图片的ID，便于下一次爬取时只爬取新post，无论停止条件为ID或是页码。

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
Function.py		Function.py
GUI.py		GUI.py
Http.py		Http.py
Log.py		Log.py
README.md		README.md
Yandere.py		Yandere.py
config.json		config.json
index.py		index.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Function.py

Function.py

GUI.py

GUI.py

Http.py

Http.py

Log.py

Log.py

README.md

README.md

Yandere.py

Yandere.py

config.json

config.json

index.py

index.py

Repository files navigation

Yande.re图片爬虫

前言

功能

如何使用

注意事项

更新日志

1.0

未来计划

About

Releases

Packages

Languages

luoyu1111/Yandere-crawler

Folders and files

Latest commit

History

Repository files navigation

Yande.re图片爬虫

前言

功能

如何使用

注意事项

更新日志

1.0

未来计划

About

Resources

Stars

Watchers

Forks

Languages