Very experimental demo of recording WARC data to IPFS
Requires redis (for now) and IPFS (of course).
Uses new (experimental) live rec system for writing WARCs
-
pip install -r requirements.txt
-
Start ipfs daemon (if not already running)
ipfs daemon
-
uwsgi uwsgi.ini
-
http://localhost:9080/record/example.com/
to record a url -
http://localhost:9080/replay/example.com/
to replay the recording. If all goes well, the replay will be served from a WARC record stored in IPFS.
WARC records are recorded to a local dir, then uploaded to IPFS and removed.
The index of all WARC records written to IPFS is available (as a text file) under the redis key ipfs:cdxj
and also available as the current IPNS registered name, so ipfs name resolve
will return the hash of the current
index. The index is put into IPFS every 30 seconds, though a real time index is updated in Redis first (and hence the need for redis).
Redis is used for CDXJ (indexing) for recording and replay, but a copy is also written to IPFS.