TextBoxes: A Fast Text Detector with a Single Deep Neural Network https://github.com/MhLiao/TextBoxes 基于SSD改进的文本检测算法，textBoxes_note记录了之前整理的笔记。

Last update: Apr 28, 2022

Related tags

Computer Vision TextBoxes

Overview

TextBoxes: A Fast Text Detector with a Single Deep Neural Network

Introduction

This paper presents an end-to-end trainable fast scene text detector, named TextBoxes, which detects scene text with both high accuracy and efficiency in a single network forward pass, involving no post-process except for a standard nonmaximum suppression. For more details, please refer to our paper.

Citing TextBoxes

Please cite TextBoxes in your publications if it helps your research:

@inproceedings{LiaoSBWL17,
  author    = {Minghui Liao and
               Baoguang Shi and
               Xiang Bai and
               Xinggang Wang and
               Wenyu Liu},
  title     = {TextBoxes: {A} Fast Text Detector with a Single Deep Neural Network},
  booktitle = {AAAI},
  year      = {2017}
}

Installation
Download
Test
Train
Performance

Installation

Get the code. We will call the directory that you cloned Caffe into $CAFFE_ROOT

git clone https://github.com/MhLiao/TextBoxes.git

cd TextBoxes

make -j8

make py

Download

Models trained on ICDAR 2013: Dropbox link BaiduYun link
Fully convolutional reduced (atrous) VGGNet: Dropbox link BaiduYun link
Compiled mex file for evaluation(for multi-scale test evaluation: evaluation_nms.m): Dropbox link BaiduYun link

Test

Download the ICDAR 2013 DataSet
Download the Models trained on ICDAR 2013
Modify the related paths in the "examples/TextBoxes/test_icdar13.py"
run "python examples/test_icdar13.py"
To multi-scale test, you should use "test_icdar13_multi_scale.py" and "evaluation_nms.m"

Train

Train about 50k iterions on Synthetic data which refered in the paper.
Train about 2k iterions on corresponding training data such as ICDAR 2013 and SVT.
For more information, such as learning rate setting, please refer to the paper.

Performance

Using the given test code, you can achieve an F-measure of about 80% on ICDAR 2013 with a single scale.
Using the given multi-scale test code, you can achieve an F-measure of about 85% on ICDAR 2013 with a non-maximum suppression.
More performance information, please refer to the paper and Task1 and Task4 of Challenge2 on the ICDAR 2015 website: http://rrc.cvc.uab.es/?ch=2&com=evaluation

Please let me know if you encounter any issues.

TextBoxes: A Fast Text Detector with a Single Deep Neural Network https://github.com/MhLiao/TextBoxes 基于SSD改进的文本检测算法，textBoxes_note记录了之前整理的笔记。

Related tags

Overview

TextBoxes: A Fast Text Detector with a Single Deep Neural Network

Introduction

Citing TextBoxes

Contents

Installation

Download

Test

Train

Performance

Owner

zhangjing1

An organized collection of tutorials and projects created for aspriring computer vision students.

Code for the paper: Fusformer: A Transformer-based Fusion Approach for Hyperspectral Image Super-resolution

A simple component to display annotated text in Streamlit apps.

Bu uygulamada Python ve Opencv kullanarak bilgisayar kamerasından yüz tespiti yapıyoruz.

👄 The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike

OpenCVを用いたカメラキャリブレーションのサンプルです。2021/06/21時点でPython実装のある3種類(通常カメラ向け、魚眼レンズ向け(fisheyeモジュール)、全方位カメラ向け(omnidirモジュール))について用意しています。

Distilling Knowledge via Knowledge Review, CVPR 2021

Document Layout Analysis Projects

A Python wrapper for Google Tesseract

Educational application aimed at automating user-defined workflows for the mobile game, "Granblue Fantasy", using a variety of CV technologies in the backend such as OpenCV, PyAutoGUI and EasyOCR and a frontend coded in Typescript.

A bot that plays TFT using OCR. Keeps track of bench, board, items, and plays the user defined team comp.

Fatigue Driving Detection Based on Dlib

Generates a message from the infamous Jerma Impostor image

Create single line SVG illustrations from your pictures

Autonomous Driving project for Euro Truck Simulator 2

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

A curated list of awesome synthetic data for text location and recognition

ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones and fixes.

一键翻译各类图片内文字

CUTIE (TensorFlow implementation of Convolutional Universal Text Information Extractor)