Skip to content

Soft-PHOC is an intermediate representation of images based on character probability maps.

Notifications You must be signed in to change notification settings

denabazazian/SoftPHOC_TextDescriptor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 

Repository files navigation

SoftPHOC_TextDescriptor

Soft-PHOC is an intermediate representation of images based on character probability maps.

This work has two implementations based on Pytorch and TensorFlow.

The SoftPhoc annotation.For instance, if the transcription is “PINTU”, we show how we can define the annotation of class “P” for the given transcription based on the value at each level of soft-PHOC descriptor.

A Deep Convolutional Neural Network estimating Soft-PHOC descriptors.

Pytorch

The pytorch implementation of SoftPHOC training.

Installation

Find the environmet at: environment.yml

conda install python=3.6 ipython pytorch=0.4 torchvision opencv=3.4.4 tensorboardx mkl=2019 tensorboard tensorflow tqdm scikit-image
  • Required packages:
    • Pytorch 0.4
    • OpenCV 3.4.4
    • mkl 2019
    • tqm
    • scikit-image
    • tensorboardX

train

  • For training ICDAR:
bash train_icdar.sh
  • For training SynthText:
bash train_synthText.sh

TensorFlow

The TensorFlow implementation of Soft-PHOC.

  • Required packages:

    • TensorFlow 1.10
    • OpenCV 3.4.4
    • mkl 2019
    • tqm
    • scikit-image
    • tensorboardX
  • For training:

python fcn_32_train_generator_validation_summary.py
  • Word spotting codes are in word_spotting to extract the query word.

  • For visualizing the character heatmaps the codes are in visualize_hm.

Citation

Please cite this work in your publications if it helps your research:

@article{Bazazian18-softPHOC,
       author = {D.~Bazazian and D.~Karatzas and A.~Bagdanov},
       title = {Soft-PHOC Descriptor for End-to-End Word Spotting in Egocentric Scene Images},
       journal = {EPIC workshop at European Conference on Computer Vision Workshop},
       year = {2018},
       ee = {arxiv.org/pdf/1809.00854.pdf}
}

About

Soft-PHOC is an intermediate representation of images based on character probability maps.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published