GitHub - scrapedia/scrapy-pipelines: A collection of pipelines for Scrapy

Read more: noffle/art-of-readme: Learn the art of writing quality READMEs.

Scrapy-Pipelines

Overview

https://api.codacy.com/project/badge/Grade/aeda92e058434a9eb2e8b0512a02235f

Since Scrapy doesn't provide enough pipelines examples for different backends or databases, this repository provides severals to demostrate the decent usages, including:

MongoDB
Redis (todo)
InfluxDB (todo)
LevelDB (todo)

And also these pipelines provide multiple ways to save or update the items, and return id created by backends

Requirements

Python 3.6+
Works on Linux, Windows, Mac OSX

Installation

The quick way:

pip install scrapy-pipelines

For more details see the installation section in the documentation: https://scrapy-pipelines.readthedocs.io/en/latest/intro/installation.html

Documentation

Documentation is available online at https://scrapy-pipelines.readthedocs.io/en/latest/ and in the docs directory.

Community (blog, twitter, mail list, IRC)

Keeping this section same as Scrapy is intending to benefit back to Scrapy.

See https://scrapy.org/community/

Contributing

Keeping this section same as Scrapy is intending to be easier when this repo merge back to Scrapy.

See https://doc.scrapy.org/en/master/contributing.html

Code of Conduct

Please note that this project is released with a Contributor Code of Conduct (see https://github.com/scrapy/scrapy/blob/master/CODE_OF_CONDUCT.md).

By participating in this project you agree to abide by its terms. Please report unacceptable behavior to opensource@scrapinghub.com.

Companies using Scrapy

Keeping this section same as Scrapy is intending to benefit back to Scrapy.

See https://scrapy.org/companies/

Commercial Support

Keeping this section same as Scrapy is intending to benefit back to Scrapy.

See https://scrapy.org/support/

TODO

[X] Add indexes creation in open_spider()
[X] Add item_completed method
[X] Add signals for MongoDB document's id return
[ ] Add MongoDB document update
[ ] Add Percona Server for MongoDB docker support
[ ] Add Redis support
[ ] Add InfluxDB support
[ ] Add LevelDB support

Name		Name	Last commit message	Last commit date
Latest commit History 210 Commits
docker/MongoDB		docker/MongoDB
docs		docs
scrapy_pipelines		scrapy_pipelines
tests		tests
.anylint		.anylint
.codacy.yml		.codacy.yml
.codebeatignore		.codebeatignore
.coveragerc		.coveragerc
.gitattributes		.gitattributes
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.readthedocs.yml		.readthedocs.yml
.travis.yml		.travis.yml
.whitesource		.whitesource
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.rst		README.rst
codecov.yml		codecov.yml
mypy.ini		mypy.ini
pylintrc		pylintrc
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
renovate.json		renovate.json
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py
tox.ini		tox.ini
versioneer.py		versioneer.py

License

scrapedia/scrapy-pipelines

Folders and files

Latest commit

History

Repository files navigation

Scrapy-Pipelines

Overview

Requirements

Installation

Documentation

Community (blog, twitter, mail list, IRC)

Contributing

Code of Conduct

Companies using Scrapy

Commercial Support

TODO

About

Topics

Resources

License

Stars

Watchers

Forks

Languages