Skip to content

rambo/QaikuBackup

Repository files navigation

QaikuBackup

Basic idea is to make a simple lib for recusively fetching Qaiku data via the REST API and dump it basically as-is to JSON structure.

Then tools for dumping personal/channel streams using said lib.

After we have a mostly reliable way to get all that data out and to our own safekeeping we can worry about making a tool that renders our backup into something else (HTML, RSS, whatever)

So what ?

AFAIUnderstand Twitter has very similar API, so knock yourself out and adjust the code to make backups of twitter streams.

Status

Dumping channels and user-streams works, resources like attached images are copied to local storage as well. The dumper will attempt to be somewhat nice to the Qaiku server limit the rate and total amount of queries it makes by various techniques (it however uses very naive caching, if you plan on dumping a lot of channels you might want to setup a local real http-proxy configured for very aggressive caching [like ignoring all the no-cache -style directives...])

Most importantly there are no formatters to make anything immediately usefull (like static HTML-pages) of the data, I expect someone else will be quicker to need that. Submit a pull-request if you make a formatting tool and I'll merge it.

Usage

  1. Checkout the repo and change directory there.
  2. Create qaiku_api_key.txt containing your Qaiku API Key on the first line.
  3. Look at fetcherparser.py and test with a single message id from your stream.
  4. Look at userdump.py and channeldump.py if you want to make huge (and time-consuming) dumps of channel/user streams.

Support

None, get your local Python coder to help you (or spend a bit of time learning the basics of the language [my code is not exactly "Pythonic", it's polluted by idioms from other languages I have to work with]).

Sorry but this is a very quick-and-dirty collection of utility functions and even dirtier wrappers to make them into CLI utilities. There is very little regard for end-user usability and so far the only coder I need to consider is me...

Python versions

I have been developing this on OSX with Python2.7 from Homebrew, it probably works on slightly older versions and might even work on Windows (but I give no guarantees, at least I try to use proper cross-platform path handling etc)

Screenscraping

Unfortunately the REST api only provides access to image thumbnails, to get the original images some unsavory techniques are needed and for those BeautifulSoup4.

About

Scripts for dumping Qaiku data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages