Shazim

Shazam images

Based on Tensorflow with the MobileNet model and ImageHash

What it is?

Shazim retrieve similar images with AI Deep Learning and advanced hash image algorithms

The AI analyse images with MobileNet Tensorflow neural network and image are also analysed with complex linear algebra

A signature (hashcode) is computed for each image

Shazim compares images signature to compute the distance between 2 images

It's a POC only

Big thanks to Johannes Buchner for the ImageHash library and Google for TensorFlow

Requirements

Based on Tensorflow, Pillow, Numpy, Scipy, ImageHash,

Easy installation, tested with Python 3.7 & Tensorflow 2.3

pip install -r requirements.txt
python shazim.py ski.jpg

Basic usage

shazim.py ski.jpg

Found 4 image(s)
images\ski.jpg at 100%
images\ski_copy.jpg at 100%
images\ski2.jpg at 84%
images\000000037689.jpg at 75%

Original: Copy 100%: Flip 84%: Other 75%:

Detect modified image from originals

shazim.py lenna.jpg

Found 3 image(s)
images\lenna.png at 100%
images\lenna-crop.jpg at 93%
images\lenna1.jpg at 72%

Original: Same 100%: Cropped image 93%: Text inserted 72%:

Avanced usage

Image directory must be parsed for training before Shazim

shazim.py -p images

Parse images
Found 15 images in 0.0 s
Hashing
Hash 1/15 in 0.0 s
Load TensorFlow model:
Hash 11/15 in 3.2 s
Hashed in 3.3 s
Save db.pickle

Verbose detection

Thresold = 50% instead of 75%:

# dah: AverageHash score (generate a 8x8 matrix with average points and compute the hamming distance)
# ddh: Difference Hash (generate 8x8 matrix with Discret Cosine Transform from gradients and compute the hamming distance)
# dfv: Feature Vector Hash (generate a 1792 vector from the output of the convulational parts of the MobileNet network and compute the cosine distance)
# Wavelets hash are not used beacause it's too slow and perceptual hash are not used because difference hash is better
# VGG*, Inception*, ResNet* are not used because the training is too slow

shazim.py -v ski.jpg

Parse images
Found 10 image(s)
images\ski.jpg at 100%
{'dah': 1.0, 'ddh': 1.0, 'dfv': 1.0}
images\ski_copy.jpg at 100%
{'dah': 1.0, 'ddh': 1.0, 'dfv': 1.0}
images\ski2.jpg at 84%
{'dah': 0.953, 'ddh': 0.719, 'dfv': 0.836}
images\000000037689.jpg at 75%
{'dah': 0.875, 'ddh': 0.578, 'dfv': 0.782}
images\000000038118.jpg at 73%
{'dah': 0.844, 'ddh': 0.609, 'dfv': 0.73}
images\ski3.jpg at 71%
{'dah': 0.828, 'ddh': 0.5, 'dfv': 0.754}
images\cat.10994.jpg at 57%
{'dah': 0.531, 'ddh': 0.422, 'dfv': 0.661}
images\cat.11016.jpg at 57%
{'dah': 0.625, 'ddh': 0.562, 'dfv': 0.544}
images\00000005.jpg at 55%
{'dah': 0.656, 'ddh': 0.375, 'dfv': 0.591}
images\lenna1.jpg at 54%
{'dah': 0.5, 'ddh': 0.5, 'dfv': 0.582}

API

Training:

from shazim import ShazimEngine

shazim = ShazimEngine()
shazim.parse("images")
shazim.train()
shazim.save()

Predict:

from shazim import ShazimEngine

shazim = ShazimEngine()
shazim.load()
im = shazim.load_image("ski.jpg")
thresold = 0.75
res = shazim.shazim(im, thresold)

Compare two images:

from shazim import ShazimEngine

shazim = ShazimEngine()
im1 = shazim.load_image("ski.jpg")
im2 = shazim.load_image("ski_copy.jpg")
res = im1 - im2

Hash image

from shazim import ShazimEngine

shazim = ShazimEngine()
im = shazim.load_image("ski.jpg")
im.ah #Average hash
im.dh #Difference hash
im.fv #MobileNet hash

Understand Image Hashing vs Deep Learning

Image hashing compute a 8x8 matrix and detect similar image or photoshoped image

For Image Hashing these images are in the same category :

shazim.py images\forest-high.jpg

Shazim...
Found 1 image(s)
images\forest-copyright.jpg at 98%

And these images are differents :

shazim.py images\tumblr1.jpg

Shazim...
Found 0 image(s)

Deep Learning wants to detect that these images are quite similar

Let see the prediction with -v option :

shazim.py images\tumblr1.jpg -v

Found 10 image(s)
images\tumblr2.jpg at 70%
{'dah': 0.609, 'ddh': 0.453, 'dfv': 0.86}
# dh : Difference hash doest not detect anything
# ah : Average hash detect only 61% similarity
# fv : MobileNet detect well at 86%
# Ponderation between hash methodes are weights = [1.0,1.0,2.0]

How to changes weights betweens models :

from shazim import ShazimEngine

shazim = ShazimEngine()
shazim.load()
im = shazim.load_image("ski.jpg")
thresold = 0.75
weights = [0.5,0.5,3.0] #ah, dh, fv
res = shazim.shazim(im, thresold, weights)

How to code this with Deep Learning only

from shazim import ShazimEngine

shazim = ShazimEngine()
im1 = shazim.load_image("tumblr1.jpg")
im2 = shazim.load_image("tumblr2.jpg")
res = im1 - im2
res["dfv"]

# or

weights = [0.0,0.0,1.0] #ah, dh, fv
shazim.predict(im1, im2, weights)

How to code this with Image Hash only

from shazim import ShazimEngine

shazim = ShazimEngine()
im1 = shazim.load_image("tumblr1.jpg")
im2 = shazim.load_image("tumblr2.jpg")
res = im1 - im2
res["dah"] #or res["dah"]

# or

weights = [0.5,0.5,0.0] #ad, dh, fv
shazim.predict(im1, im2, weights)

Sources hosted at GitHub: https://github.com/CyrilVincent/shazim

http://www.CyrilVincent.com

How it works

For the deep learning part I use a pre-trained Deep Learning model call MobileNet because it's the speedest. I do a transfert learning and I take only the convolutional network with a flatten layer (without the MLP). Tensorflow Hub simplify all of the with a pre-trained and configured model: Feature Vector v4. I do a inference and take the flatten layer result, I obtain a vector of 1792 double : it's the hash. I compute the cosine distance between two hash to determine the dfv score. I do not use VGG16 or Inception or ResNet because they are to slow.

For the linear algebra part I use ImageHash to hash images. To compute ah I use average_hash. Average hash is very simple and quick: it reduce the image to 8x8 with average points. To compute dh I use difference_hash. Dh is like perceptual_hash which reduce the image to 8x8 in spectral domain with a Discret Cosine Transform (DCT) but with gradients. I use only dh because is more effective the ph. I compute the hamming distance between two hashes to determine dah and ddh score. I do not use wavelet_hash because it to slow.

Then I ponderates each scores by the weights [1.0,1.0,2.0] repectively for dah, ddh and dfv. Then I ponderates each scores by the weights [1.0,1.0,2.0] repectively for dah, ddh and dfv. The default thresold is 0.75

I tried to implements Google Delf model but it's very to slow

Links

https://pypi.org/project/ImageHash/

http://www.hackerfactor.com/blog/index.php?/archives/432-Looks-Like-It.html

http://www.hackerfactor.com/blog/index.php?/archives/529-Kind-of-Like-That.html

https://towardsdatascience.com/image-similarity-detection-in-action-with-tensorflow-2-0-b8d9a78b2509

https://www.tensorflow.org/hub/common_signatures/image

https://tfhub.dev/google/tf2-preview/mobilenet_v2/feature_vector/4

https://www.tensorflow.org/hub/tutorials/tf_hub_delf_module

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
hubmodule		hubmodule
images		images
.gitignore		.gitignore
README.md		README.md
cyrilload.py		cyrilload.py
db.pickle		db.pickle
delfproto.py		delfproto.py
fvproto.py		fvproto.py
lenna.png		lenna.png
requirements.txt		requirements.txt
shazim.py		shazim.py
ski.jpg		ski.jpg

cyrilvincent/shazim

Folders and files

Latest commit

History

Repository files navigation

Shazim

What it is?

Requirements

Basic usage

Avanced usage

Verbose detection

API

How it works

Links

About

Resources

Stars

Watchers

Forks

Languages