Skip to content

simon987/irarchives

 
 

Repository files navigation

irarchives

CodeFactor GitHub

Summary

Reverse image/video search for reddit

demo Realtime dhash demo. The hash (on the right) is stored as a 144-bit integer. We can compute the similarity of two images by counting the number of matching bits of their hashes.

Overview

The repo contains:

  • A script to scrape images from reddit posts and store the data in a database.
  • A web interface for searching the database

Database schema

schema

Requirements

Tested with Python 3.7.2.

Dependencies on Debian: apt install libgmp-dev libmpfr-dev libmpc-dev

This project relies on Architeuthis MITM proxy to respect rate-limits and handle http errors.

Additional C-Language functions for PostgreSQL need to be installed for almost all queries.

The search interface can be configured to use redis for caching (see common.py).

About

Reverse image/video search for reddit

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 63.7%
  • JavaScript 23.0%
  • HTML 8.3%
  • CSS 5.0%