Skip to content

an unofficial implementation of FOTS: Fast Oriented Text Spotting with a Unified Network

Notifications You must be signed in to change notification settings

mranzinger/FOTS.pytorch-1

 
 

Repository files navigation

FOTS.pytorch

This is an unofficial implementation of FOTS: Fast Oriented Text Spotting with a Unified Network, which is a unified end-to-end trainable Fast Oriented Text Spotting (FOTS) network for simultaneous detection and recognition, sharing computation and visual information among the two complementary tasks. and i mainly borrows from E2E-MLT, which is an End-to-end text training and recognition network.

Requirements

Compile extension file

  • RoIRotate for roirotate layer, I've written a pytorch automatic layer roirotate in paper compiling:
# optional
source activate conda_env
cd $project_path/rroi_align
sh make.sh      # compile

TEST

first download the pretrained model from baidu,password:ndav. which is trained on ICDAR2015. put the model in weights folder, then can test on some icdar2015 test samples

cd $project_path
python test.py

some examples:

图1 图2
图3 图4
图5 图6

RoIRotate

RoIRotate applies transformation on oriented feature regions to obtain axis-aligned feature maps.use bilinear interpolation to compute the values of the output

图1 图2
图3 图4
图5 图6

Train

download the ICDAR2015 data and the train_list from baidu, password:q1au

# train_list.txt list the train images path
/home/yangna/deepblue/OCR/data/ICDAR2015/icdar-2015-Ch4/img_546.jpg
/home/yangna/deepblue/OCR/data/ICDAR2015/icdar-2015-Ch4/img_277.jpg
/home/yangna/deepblue/OCR/data/ICDAR2015/icdar-2015-Ch4/img_462.jpg
/home/yangna/deepblue/OCR/data/ICDAR2015/icdar-2015-Ch4/img_237.jpg

training:

python train.py -train_list=$path_to/ICDAR2015.txt

Acknowledgments

Code borrows from MichalBusta/E2E-MLT

About

an unofficial implementation of FOTS: Fast Oriented Text Spotting with a Unified Network

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 72.1%
  • Python 26.3%
  • Cuda 1.1%
  • Dockerfile 0.2%
  • Shell 0.2%
  • C 0.1%