[Update]

DeepLogo2 with DETR is available

DeepLogo

A brand logo detection system using Tensorflow Object Detection API.

Description

Tensorflow Object Detection API is the easy to use framework for creating a custom deep learning model that solves object detection problems.

If you already have your own dataset, you can simply create a custom model with sufficient accuracy using a collection of detection models pre-trained on COCO, KITTI, and OpenImages dataset.

DeepLogo provides training and evaluation environments of Tensorflow Object Detection API for creating a brand logo detection model.

Detection Results

These are some detection results by DeepLogo.

Training DeepLogo

DeepLogo uses SSD as a backbone network and fine-tunes pre-trained SSD released in the tensorflow/models repository.

In order to use that pre-trained model, setting up the tensorflow/models repository first.

Clone the tensorflow/models repository and download the pre-trained model from model zoo.

$ git clone https://github.com/tensorflow/models.git
$ cd models/research/object_detection
$ wget http://download.tensorflow.org/models/object_detection/ssd_inception_v2_coco_2018_01_28.tar.gz
$ tar zxvf ssd_inception_v2_coco_2018_01_28.tar.gz

Tensorflow Object Detection API depends on many other libraries. For detailed steps to setup, please follow the official installation instruction.

Clone the DeepLogo repository.

$ git clone https://github.com/satojkovic/DeepLogo.git

Download the flickr logos 27 dataset from here.

The flickr logos 27 dataset contains 27 classes of brand logo images downloaded from Flickr. The brands included in the dataset are: Adidas, Apple, BMW, Citroen, Coca Cola, DHL, Fedex, Ferrari, Ford, Google, Heineken, HP, McDonalds, Mini, Nbc, Nike, Pepsi, Porsche, Puma, Red Bull, Sprite, Starbucks, Intel, Texaco, Unisef, Vodafone and Yahoo.
```
$ cd DeepLogo
$ wget http://image.ntua.gr/iva/datasets/flickr_logos/flickr_logos_27_dataset.tar.gz
$ tar zxvf flickr_logos_27_dataset.tar.gz
$ cd flickr_logos_27_dataset
$ tar zxvf flickr_logos_27_dataset_images.tar.gz
$ cd ../
```
Preprocess

The flickr logos 27 dataset contains an annotation file for training. This file includes not valid annotations such as an empty size bounding box. Therefore these annotations are removed in this preprocess step, then class names are converted into class numbers and generate two preprocessed files. These two files are used to generate tfrecord files.
```
$ cd DeepLogo
$ python preproc_annot.py
```

Generate tfrecord files.

The Tensorflow Object Detection API expects data to be in the TFRecord format. Run the following command to convert from preprocessed files into TFRecords.

$ python gen_tfrecord.py --train_or_test train --csv_input flickr_logos_27_dataset/flickr_logos_27_dataset_training_set_annotation_cropped.txt --img_dir flickr_logos_27_dataset/flickr_logos_27_dataset_images --output_path train.tfrecord
$ python gen_tfrecord.py --train_or_test test --csv_input flickr_logos_27_dataset/flickr_logos_27_dataset_test_set_annotation_cropped.txt --img_dir flickr_logos_27_dataset/flickr_logos_27_dataset_images --output_path test.tfrecord

Training

The Tensorflow Object Detection API has a python script for training called train.py. This script needs two arguments --pipeline_config_path and --train_dir. (see below)

DeepLogo assumes that the current directory is under the DeepLogo directory and also the path of pre-trained SSD and tfrecord is the relative path from DeepLogo (these paths are written in ssd_inception_v2.config). Therefore create a symbolic link to the directory of tensorflow/models/research/object_detection/ssd_inception_v2_coco_2018_01_28 first, then run the training script.
```
$ OBJECT_DETECTION_API_DIR={path to tensorflow/models/research/object_detection}
$ ln -s ${OBJECT_DETECTION_API_DIR}/ssd_inception_v2_coco_2018_01_28 ssd_inception_v2_coco_2018_01_28
$ python ${OBJECT_DETECTION_API_DIR}/legacy/train.py --logtostderr --pipeline_config_path=ssd_inception_v2_coco.config --train_dir=training
```
Note: DeepLogo doesn't work in Tensorflow 2.0. When you try to train DeepLogo, checkout 5ba3c3f5 of tensorflow/models.

Testing

Exporting a trained model for inference

For testing a model, you should export it to a Tensorflow graph proto first.

$ STEPS={the number of steps when the model is saved}
$ python ${OBJECT_DETECTION_API_DIR}/export_inference_graph.py --input_type=image_tensor --pipeline_config_path=ssd_inception_v2_coco.config --trained_checkpoint_prefix=model.ckpt-${STEPS} --output_directory=logos_inference_graph

Alternatively, you can download a trained model from GoogleDrive!

Inference

Run the following command. The results of logo detection are saved in --output_dir.

$ python logo_detection.py --model_name logos_inference_graph/ --label_map flickr_logos_27_label_map.pbtxt --test_annot_text flickr_logos_27_dataset/flickr_logos_27_dataset_test_set_annotation_cropped.txt --test_image_dir flickr_logos_27_dataset/flickr_logos_27_dataset_images --output_dir detect_results

Evaluation

Before evaluating the trained model saved in training directory, edit the num_examples field in training/pipeline.config file.

The num_examples field represents the number of test images which is equal to number of lines present in a flickr_logos_27_dataset_test_set_annotation_cropped.txt file.

$ wc -l flickr_logos_27_dataset/flickr_logos_27_dataset_test_set_annotation_cropped.txt
     438 flickr_logos_27_dataset/flickr_logos_27_dataset_test_set_annotation_cropped.txt

eval_config: {
  num_examples: 438
  # Note: The below line limits the evaluation process to 10 evaluations.
  # Remove the below line to evaluate indefinitely.
  max_evals: 10
}

Then start evaluation process by using eval.py provided within tensorflow/models repository.

$ python ${OBJECT_DETECTION_API_DIR}/legacy/eval.py --logtostderr --checkpoint_dir=training --eval_dir=eval --pipeline_config_path=training/pipeline.config

After a while you will get evaluation results. If you want to check the results visually, open tensorboard in your browser.

$ tensorboard --logdir=eval/

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 179 Commits
detect_results		detect_results
query_set_results		query_set_results
LICENSE.txt		LICENSE.txt
README.md		README.md
config.py		config.py
flickr_logos_27_label_map.pbtxt		flickr_logos_27_label_map.pbtxt
gen_tfrecord.py		gen_tfrecord.py
gen_tfrecord_logos32plus.py		gen_tfrecord_logos32plus.py
logo_detection.py		logo_detection.py
logos32plus_label_map.pbtxt		logos32plus_label_map.pbtxt
preproc_annot.py		preproc_annot.py
ssd_inception_v2_coco.config		ssd_inception_v2_coco.config
ssd_inception_v2_coco_logos32plus.config		ssd_inception_v2_coco_logos32plus.config

License

satojkovic/DeepLogo

Folders and files

Latest commit

History

Repository files navigation

[Update]

DeepLogo2 with DETR is available

DeepLogo

Description

Detection Results

Training DeepLogo

Testing

Evaluation

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages