Skip to content

ArchiveTeam/patch-grab

Repository files navigation

patch-grab

This is a Seesaw script for grabbing patch.com sites.

The dashboard for this grab is http://quilt.at.ninjawedding.org/patchy.

If you are not familiar with Archive Team grabs: this program repeatedly grabs a user from the tracker, downloads them via HTTP, then uploads the compressed archive to a collection server.

Running in the warrior

Not yet tested. We're working on it. In the meantime, try the directions below.

Running without a warrior

To run this outside the warrior:

(Ubuntu / Debian 7)

sudo apt-get update
sudo apt-get install -y build-essential lua5.1 liblua5.1-0-dev python python-setuptools python-dev git-core openssl libssl-dev python-pip rsync gcc make git screen libxml2-dev libxslt-dev curl
pip install --user seesaw lxml
git clone https://github.com/ArchiveTeam/patch-grab
cd patch-grab
./get-wget-lua.sh

# Start downloading with:
screen ~/.local/bin/run-pipeline --disable-web-server --concurrent 3 pipeline.py YOURNICKNAME

(Debian 6)

sudo apt-get update
sudo apt-get install -y build-essential lua5.1 liblua5.1-0-dev python python-setuptools python-dev git-core openssl libssl-dev python-pip rsync gcc make git screen libxml2-dev libxslt-dev curl
wget --no-check-certificate https://pypi.python.org/packages/source/p/pip/pip-1.3.1.tar.gz tar -xzvf pip-1.3.1.tar.gz
cd pip-1.3.1
python setup.py install --user
cd ..
~/.local/bin/pip install --user seesaw lxml
git clone https://github.com/ArchiveTeam/patch-grab
cd patch-grab
./get-wget-lua.sh

# Start downloading with:
screen ~/.local/bin/run-pipeline --disable-web-server --concurrent 3 pipeline.py YOURNICKNAME

(CentOS / RHEL / Amazon Linux)

sudo yum install lua lua-devel python-devel python-distribute git openssl-devel rsync gcc make screen libxml2-devel libxslt-devel curl
wget --no-check-certificate https://pypi.python.org/packages/source/p/pip/pip-1.3.1.tar.gz
tar -xzvf pip-1.3.1.tar.gz
cd pip-1.3.1
python setup.py install --user
cd ..
~/.local/bin/pip install --user seesaw lxml
git clone https://github.com/ArchiveTeam/patch-grab
cd patch-grab
./get-wget-lua.sh

# Start downloading with:
screen ~/.local/bin/run-pipeline --disable-web-server --concurrent 3 pipeline.py YOURNICKNAME

For more options, run:

~/.local/bin/run-pipeline --help

Releases

No releases published

Packages

No packages published