A three-stage detection and recognition pipeline of complex meters in wild

This is the first released system towards detection and recognition of complex meters in wild. The system can be divided into three moduels. Fisrtly, a yolo-based detector is applied to get pure meter region. Secondly, a spatial transformer module is eatablished to rectify the position of meter. Lastly, an end-to-end network is to read meter values, which is implemented by pointer/dail predcition and key number learning.

Visulization results

Left row is the original image, middle row is the process of meter rectification, right row is the result of meter value reading.

ToDo List

Installation

Requirements:

Python3 (Python3.7 is recommended)
PyTorch >= 1.0
torchvision from master
numpy
skimage
OpenCV==3.0.x
CUDA >= 9.0 (10.0 is recommended)

Models

Download Trained model

Please put distro_net.pt into meter_distro/weight.
put textgraph_vgg_450.pth into model/meter_data.

Demo

You can run a demo script for a single image inference by two steps.

python get_meter_area.py. and the detected meter will be stored in scene_image_data/deteced_meter

python predict.py to get distored meter and final result.

This is the first released system towards complex meters` detection and recognition, which is implemented by computer vision techniques.

Related tags

Overview

A three-stage detection and recognition pipeline of complex meters in wild

Visulization results

ToDo List

Installation

Requirements:

Models

Demo

Owner

Yan Shu

Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis

The official implementation of Equalization Loss for Long-Tailed Object Recognition (CVPR 2020) based on Detectron2

Hardware accelerated, batchable and differentiable optimizers in JAX.

Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data

Hand-distance-measurement-game - Hand Distance Measurement Game

Learning nonlinear operators via DeepONet

Training data extraction on GPT-2

7th place solution of Human Protein Atlas - Single Cell Classification on Kaggle

Official implementation of SynthTIGER (Synthetic Text Image GEneratoR) ICDAR 2021

Manim is an engine for precise programmatic animations, designed for creating explanatory math videos

A diff tool for language models

Using Tensorflow Object Detection API to detect Waymo open dataset

Dynamic Divide-and-Conquer Adversarial Training for Robust Semantic Segmentation （ICCV2021）

Code for Learning Manifold Patch-Based Representations of Man-Made Shapes, in ICLR 2021.

Region-aware Contrastive Learning for Semantic Segmentation, ICCV 2021

Explainer for black box models that predict molecule properties

I have created this Virtual Paint Program, in this you can paint(draw) on your screen using hand gestures, created in Python-3 using OpenCV and Mediapipe library. Gestures :- Index Finger for drawing and Index+Middle Finger for changing position and objects.

some classic model used to segment the medical images like CT、X-ray and so on

The codes reproduce the figures and statistics in the paper, "Controlling for multiple covariates," by Mark Tygert.

A general framework for inferring CNNs efficiently. Reduce the inference latency of MobileNet-V3 by 1.3x on an iPhone XS Max without sacrificing accuracy.