An interactive document scanner built in Python using OpenCV

Last update: Feb 12, 2022

Related tags

Overview

Document Scanner

An interactive document scanner built in Python using OpenCV

The scanner takes a poorly scanned image, finds the corners of the document, applies the perspective transformation to get a top-down view of the document, sharpens the image, and applies an adaptive color threshold to clean up the image.

On my test dataset of 280 images, the program correctly detected the corners of the document 92.8% of the time.

This project makes use of the transform and imutils modules from pyimagesearch (which can be accessed here). The UI code for the interactive mode is adapted from poly_editor.py from here.

You can manually click and drag the corners of the document to be perspective transformed:
The scanner can also process an entire directory of images automatically and save the output in an output directory:

Here are some examples of images before and after scan:

Usage

python scan.py (--images 
   
     | --image 
    
     ) [-i]

The -i flag enables interactive mode, where you will be prompted to click and drag the corners of the document. For example, to scan a single image with interactive mode enabled:

python scan.py --image sample_images/desk.JPG -i

Alternatively, to scan all images in a directory without any input:

python scan.py --images sample_images

An interactive document scanner built in Python using OpenCV

Related tags

Overview

Document Scanner

An interactive document scanner built in Python using OpenCV

Here are some examples of images before and after scan:

Usage

Owner

Kushal Shingote

graph learning code for ogb

A python script based on opencv and paddleocr, which can automatically pick up tasks, make cookies, and receive rewards in the Destiny 2 Dawning Oven

Resizing Canny Countour In Python

Driver Drowsiness Detection with OpenCV & Dlib

Table Extraction Tool

Using Opencv ,based on Augmental Reality(AR) and will show the feature matching of image and then by finding its matching

Image Detector and Convertor App created using python's Pillow, OpenCV, cvlib, numpy and streamlit packages.

Scene text recognition

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

📷 This repository is focused on having various feature implementation of OpenCV in Python.

Code release for our paper, "SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic Data via Stereo"

Awesome multilingual OCR toolkits based on PaddlePaddle （practical ultra lightweight OCR system, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices）

Code related to "Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic Fidelity" paper

Localization of thoracic abnormalities model based on VinBigData (top 1%)

Developed an AI-based system to control the mouse cursor using Python and OpenCV with the real-time camera.

Characterizing possible failure modes in physics-informed neural networks.

Automatically fishes for you while you are afk :)

Extract tables from scanned image PDFs using Optical Character Recognition.

Code for CVPR 2022 paper "SoftGroup for Instance Segmentation on 3D Point Clouds"