Single Shot Text Detector with Regional Attention

Introduction

SSTD is initially described in our ICCV 2017 spotlight paper.

A third-party implementation of SSTD + Focal Loss. Thanks, Ho taek Han

If you find it useful in your research, please consider citing:

@inproceedings{panhe17singleshot,
      Title   = {Single Shot Text Detector with Regional Attention},
      Author  = {He, Pan and Huang, Weilin and He, Tong and Zhu, Qile and Qiao, Yu and Li, Xiaolin},
      Note    = {Proceedings of Internatioanl Conference on Computer Vision (ICCV)},
      Year    = {2017}
      }
@inproceedings{panhe16readText,
      Title   = {Reading Scene Text in Deep Convolutional Sequences},
      Author  = {He, Pan and Huang, Weilin and Qiao, Yu and Loy, Chen Change and Tang, Xiaoou},
      Note    = {Proceedings of AAAI Conference on Artificial Intelligence, (AAAI)},
      Year    = {2016}
      }
@inproceedings{liu16ssd,
      Title   = {{SSD}: Single Shot MultiBox Detector},
      Author  = {Liu, Wei and Anguelov, Dragomir and Erhan, Dumitru and Szegedy, Christian and Reed, Scott and Fu, Cheng-Yang and Berg, Alexander C.},
      Note    = {Proceedings of European Conference on Computer Vision (ECCV)},
      Year    = {2016}
      }

Installation

Get the code. We will call the directory that you cloned Caffe into $CAFFE_ROOT

git clone https://github.com/BestSonny/SSTD.git
cd SSTD

Build the code. Please follow Caffe instruction to install all necessary packages and build it.

# Modify Makefile.config according to your Caffe installation.
cp Makefile.config.example Makefile.config
make -j8
# Make sure to include $CAFFE_ROOT/python to your PYTHONPATH.
make py
make test -j8
# (Optional)
make runtest -j8
# build nms
cd examples/text
make
cd ..

Run the demo code. Download Model google drive, baiduyun and put it in text/model folder

cd examples
sh text/download.sh
mkdir text/result
python text/demo_test.py

Single Shot Text Detector with Regional Attention

Related tags

Overview

Single Shot Text Detector with Regional Attention

Introduction

Installation

Owner

Pan He

Program created with opencv that allows you to automatically count your repetitions on several fitness exercises.

🖺 OCR using tensorflow with attention

A Python wrapper for the tesseract-ocr API

An interactive document scanner built in Python using OpenCV

零样本学习测评基准，中文版

Localization of thoracic abnormalities model based on VinBigData (top 1%)

Text to QR-CODE

This is an API written in python that uses FastAPI. It is a simple API that can detect discord tokens in Images.

EAST for ICPR MTWI 2018 Challenge II (Text detection of network images)

This repo contains several opencv projects done while learning opencv in python.

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition

A Vietnamese personal card OCR website built with Django.

a deep learning model for page layout analysis / segmentation.

Tensorflow-based CNN+LSTM trained with CTC-loss for OCR

The open source extract transaction infomation by using OCR.

Multi-choice answer sheet correction system using computer vision with opencv & python.

Repositório para registro de estudo da biblioteca opencv (Python)

Page to PAGE Layout Analysis Tool

Python library to extract tabular data from images and scanned PDFs

list all open dataset about ocr.