TextField: Learning A Deep Direction Field for Irregular Scene Text Detection (TIP 2019)

Last update: Dec 12, 2022

Overview

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection

Introduction

The code and trained models of:

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection, TIP 2019 [Paper]

Citation

Please cite the related works in your publications if it helps your research:


@article{xu2018textfield,
  title={TextField: Learning A Deep Direction Field for Irregular Scene Text Detection},
  author={Xu, Yongchao and Wang, Yukang and Zhou, Wei and Wang, Yongpan and Yang, Zhibo and Bai, Xiang},
  journal={arXiv preprint arXiv:1812.01393},
  year={2018}
}

Prerequisite

Caffe and SynthText pretrained model [Link]
Datasets: [Total-Text], [ICDAR2015]
OpenCV 3.4.3
MATLAB

Usage

1. Install Caffe

cp Makefile.config.example Makefile.config
# adjust Makefile.config (for example, enable python layer)
make all -j16
# make sure to include $CAFFE_ROOT/python to your PYTHONPATH.
make pycaffe

Please refer to Caffe Installation to ensure other dependencies.

2. Data and model preparation

# download datasets and pretrained model then
mkdir data && mv [your_dataset_folder] data/
mkdir models && mv [your_pretrained_model] models/

3. Training scripts

# an example on Total-Text dataset
cd examples/TextField/
python train.py --gpu [your_gpu_id] --dataset total --initmodel ../../models/synth_iter_800000.caffemodel

4. Evaluation scripts

# an example on Total-Text dataset
cd evaluation/total/
./eval.sh

Results and Trained Models

Total-Text

Recall	Precision	F-measure	Link
0.816	0.824	0.820	[Google drive]

*lambda=0.50 for post-processing

ICDAR2015

Recall	Precision	F-measure	Link
0.811	0.846	0.828	[Google drive]

*lambda=0.75 for post-processing

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection (TIP 2019)

Related tags

Overview

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection

Introduction

Citation

Prerequisite

Usage

1. Install Caffe

2. Data and model preparation

3. Training scripts

4. Evaluation scripts

Results and Trained Models

Total-Text

ICDAR2015

Owner

Yukang Wang

This is a repository to learn and get more computer vision skills, make robotics projects integrating the computer vision as a perception tool and create a lot of awesome advanced controllers for the robots of the future.

Image Detector and Convertor App created using python's Pillow, OpenCV, cvlib, numpy and streamlit packages.

An advanced 2D image manipulation with features such as edge detection and image segmentation built using OpenCV

基于openpose和图像分类的手语识别项目

Optical character recognition for Japanese text, with the main focus being Japanese manga

Image Smoothing and Blurring Using OpenCV

OCR software for recognition of handwritten text

Code release for Hu et al., Learning to Segment Every Thing. in CVPR, 2018.

Python-based tools for document analysis and OCR

The code for CVPR2022 paper "Likert Scoring with Grade Decoupling for Long-term Action Assessment".

Convert Text-to Handwriting Using Python

Vietnamese Language Detection and Recognition

Natural language detection

Volume Control using OpenCV

caffe re-implementation of R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection

Motion Detection Squid Game with OpenCV Python

ARU-Net - Deep Learning Chinese Word Segment

Demo for the paper "Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation"

The open source extract transaction infomation by using OCR.

Code for paper "Role-based network embedding via structural features reconstruction with degree-regularized constraint"