Skip to content

ArchiveTeam/greader-directory-grab

Repository files navigation

greader-directory-grab

Seesaw script that grabs Google Reader's Feed Directory for Archive Team's Google Reader grab: http://www.archiveteam.org/index.php?title=Google_Reader

See https://github.com/ArchiveTeam/greader-grab for the software that actually grabs feed content.

The dashboard for this grab is http://tracker-alt.dyn.ludios.net:9292/greader-directory/

The IRC channel for this grab is #donereading on efnet.

Running without a warrior

To run this outside the warrior:

Ubuntu / Debian 7

(tested on Ubuntu 12.04, Ubuntu 13.04, Debian 7.0.0)

sudo apt-get update
sudo apt-get install -y build-essential lua5.1 liblua5.1-0-dev python python-setuptools python-dev git-core openssl libssl-dev python-pip rsync gcc make git screen
pip install --user seesaw
git clone https://github.com/ArchiveTeam/greader-directory-grab
cd greader-directory-grab
./get-wget-lua.sh

# Start downloading with:
screen ~/.local/bin/run-pipeline --disable-web-server --concurrent 3 pipeline.py YOURNICKNAME

Debian 6

(tested on Debian 6.0.3)

sudo apt-get update
sudo apt-get install -y build-essential lua5.1 liblua5.1-0-dev python python-setuptools python-dev git-core openssl libssl-dev python-pip rsync gcc make git screen
wget --no-check-certificate https://pypi.python.org/packages/source/p/pip/pip-1.3.1.tar.gz
tar -xzvf pip-1.3.1.tar.gz
cd pip-1.3.1
python setup.py install --user
cd ..
~/.local/bin/pip install --user seesaw
git clone https://github.com/ArchiveTeam/greader-directory-grab
cd greader-directory-grab
./get-wget-lua.sh

# Start downloading with:
screen ~/.local/bin/run-pipeline --disable-web-server --concurrent 3 pipeline.py YOURNICKNAME

CentOS 6 / RHEL 6 / Amazon Linux

sudo yum install lua lua-devel python-devel python-distribute git openssl-devel rsync gcc make screen
wget --no-check-certificate https://pypi.python.org/packages/source/p/pip/pip-1.3.1.tar.gz
tar -xzvf pip-1.3.1.tar.gz
cd pip-1.3.1
python setup.py install --user
cd ..
~/.local/bin/pip install --user seesaw
git clone https://github.com/ArchiveTeam/greader-directory-grab
cd greader-directory-grab
./get-wget-lua.sh

# Start downloading with:
screen ~/.local/bin/run-pipeline --disable-web-server --concurrent 3 pipeline.py YOURNICKNAME

For more options, run:

~/.local/bin/run-pipeline --help