Handwriting Recognition System based on a deep Convolutional Recurrent Neural Network architecture

Last update: Jan 07, 2023

Overview

Handwriting Recognition System

This repository is the Tensorflow implementation of the Handwriting Recognition System described in Handwriting Recognition of Historical Documents with Few Labeled Data (please cite the paper if you use this code in your research paper). This code was also used for the baseline system in Fine-tuning Handwriting Recognition systems with Temporal Dropout.

This code is free for academic and research use. For commercial use of the code please contact Edgard Chammas.

To help run the system, sample images from ICDAR2017 Competition on Handwritten Text Recognition on the READ Dataset are added.

Configuration

General configuration can be found in config.py

CNN-specific architecture configuration can be found in cnn.py

Training

python train.py

This will generate a text log file and a Tensorflow summary.

Decoding

python test.py

This will generate, for each image, the line transcription. The output will be written to decoded.txt by default.

python compute_probs.py

This will generate, for each image, the posterior probabilities at each timestep. Files will be stored in Probs by default.

Dependencies

Tensorflow
OpenCV-Python

Citation

Please cite the following paper if you use this code in your research paper:

@inproceedings{chammas2018handwriting,
  title={Handwriting Recognition of Historical Documents with few labeled data},
  author={Chammas, Edgard and Mokbel, Chafic and Likforman-Sulem, Laurence},
  booktitle={2018 13th IAPR International Workshop on Document Analysis Systems (DAS)},
  pages={43--48},
  year={2018},
  organization={IEEE}
}

Acknowledgment

We gratefully acknowledge the support of NVIDIA Corporation with the donation of the Titan Xp GPU used for this research.

Contributions

Feel free to send your pull request or open issues.

Handwriting Recognition System based on a deep Convolutional Recurrent Neural Network architecture

Related tags

Overview

Handwriting Recognition System

Configuration

Training

Decoding

Dependencies

Citation

Acknowledgment

Contributions

Owner

Edgard Chammas

a micro OCR network with 0.07mb params.

An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

The project is an official implementation of our paper "3D Human Pose Estimation with Spatial and Temporal Transformers".

Python tool that takes the OCR.space JSON output as input and draws a text overlay on top of the image.

Usando o Amazon Textract como OCR para Extração de Dados no DynamoDB

"Very simple but works well" Computer Vision based ID verification solution provided by LibraX.

Motion detector, Full body detection, Upper body detection, Cat face detection, Smile detection, Face detection (haar cascade), Silverware detection, Face detection (lbp), and Sending email notifications

scene-linear test images

This can be use to convert text in a file to handwritten text.

Handwritten Text Recognition (HTR) system implemented with TensorFlow.

Pre-Recognize Library - library with algorithms for improving OCR quality.

a Deep Learning Framework for Text

Reference Code for AAAI-20 paper "Multi-Stage Self-Supervised Learning for Graph Convolutional Networks on Graphs with Few Labels"

Some bits of javascript to transcribe scanned pages using PageXML

A bot that extract text from images using the Tesseract OCR.

OCR engine for all the languages

The Open Source Framework for Machine Vision

This repo contains a script that allows us to find range of colors in images using openCV, and then convert them into geo vectors.

The papers published in top-tier AI conferences in recent years.

Document Layout Analysis Projects