Solution for Problem 1 by team codesquad for AIDL 2020. Uses ML Kit for OCR and OpenCV for image processing

Last update: Nov 27, 2022

Overview

CodeSquad PS1

Solution for Problem Statement 1 for AIDL 2020 conducted by @unifynd technologies.

Problem

Given images of bills/invoices, the task was to perform the following 3 operations:

Edge detection, cropping, flattening, enhancement of cropped image and compression.
Extracting text from the processed image.
The confidence score for the image to text conversion.

Development

Make sure you have react-native cli & the latest Android SDK installed on your system. To get started with React Native, follow here
To install OpenCV for Android, see here
Clone the github repository and install the dependencies using npm

$ git clone https://github.com/burhanuday/codesquad-PS1
$ cd codesquad-PS1
$ npm install

Move the modified versions of the libraries from the modified_open_source_libs to the node_modules folder. Replace in destination when asked
Run development build (Android SDK and adb tools are required to be installed)

$ npx react-native run-android --no-jetifier
$ npx react-native run-ios

Run the flask server from the flask-server folder

$ python app.py

For Mac

Follow the instructions mentioned on Getting Started on React Native documentation
Download the project zip from here
Edit the sdk.dir statement with the SDK path in the <extracted-folder>/android/local.properties file, for your machine.
If getting this error Could not compile settings file 'android\settings.gradle. First run /usr/libexec/java_home -V which will output something like the following:

Matching Java Virtual Machines (2):
    13.0.1, x86_64:	"Java SE 13.0.1"	/Library/Java/JavaVirtualMachines/jdk-13.0.1.jdk/Contents/Home
    1.8.0_242, x86_64:	"AdoptOpenJDK 8"	/Library/Java/JavaVirtualMachines/adoptopenjdk-8.jdk/Contents/Home

/Library/Java/JavaVirtualMachines/jdk-13.0.1.jdk/Contents/Home

Pick the version you want to be the default (1.8.0_242 the version of AdoptOpenJDK 8) then:

export JAVA_HOME=`/usr/libexec/java_home -v 1.8.0_242`

Run the app with npx react-native run-android --no-jetifier

Screens

Build

Create and then copy a keystore file to android/app

$ keytool -genkey -v -keystore mykeystore.keystore -alias mykeyalias -keyalg RSA -keysize 2048 -validity 10000

Setup your gradle variables in android/gradle.properties

MYAPP_RELEASE_STORE_FILE=mykeystore.keystore
MYAPP_RELEASE_KEY_ALIAS=mykeyalias
MYAPP_RELEASE_STORE_PASSWORD=*****
MYAPP_RELEASE_KEY_PASSWORD=*****

Add signing config to android/app/build.gradle

android {
signingConfigs {
release {
storeFile file(MYAPP_RELEASE_STORE_FILE)
storePassword MYAPP_RELEASE_STORE_PASSWORD
keyAlias MYAPP_RELEASE_KEY_ALIAS
keyPassword MYAPP_RELEASE_KEY_PASSWORD
}
}
buildTypes {
release {
signingConfig signingConfigs.release
}
}
}

Setup your gradle variables in android/gradle.properties

cd android && ./gradlew assembleRelease

Your APK will get generated at: android/app/build/outputs/apk/app-release.apk

Credits

Special thanks to react-native-document-scanner & react-native-perspective-image-cropper

NOTE: We are using heavily modified versions of both these libraries to support our usecase. You can find these modified libraries in the modified_open_source_libs/

Solution for Problem 1 by team codesquad for AIDL 2020. Uses ML Kit for OCR and OpenCV for image processing

Related tags

Overview

CodeSquad PS1

Problem

Development

For Mac

Screens

Build

Credits

Owner

Burhanuddin Udaipurwala

Toolbox for OCR post-correction

Introduction to image processing, most used and popular functions of OpenCV

A python program to block out your face

BNF Globalization Code (CVPR 2016)

Polaris is a Face recognition attendance system .

A simple Security Camera created using Opencv in Python where images gets saved in realtime in your Dropbox account at every 5 seconds

A Joint Video and Image Encoder for End-to-End Retrieval

(CVPR 2021) Back-tracing Representative Points for Voting-based 3D Object Detection in Point Clouds

This is a project to detect gestures to zoom in or out, using the real-time distance between the index finger and the thumb. It's based on OpenCV and Mediapipe.

An easy to use an (hopefully useful) captcha solution for pyTelegramBotAPI

PyNeuro is designed to connect NeuroSky's MindWave EEG device to Python and provide Callback functionality to provide data to your application in real time.

Dataset and Code for ICCV 2021 paper "Real-world Video Super-resolution: A Benchmark Dataset and A Decomposition based Learning Scheme"

[BMVC'21] Official PyTorch Implementation of Grounded Situation Recognition with Transformers

Repository for Scene Text Detection with Supervised Pyramid Context Network with tensorflow.

Framework for the Complete Gaze Tracking Pipeline

Handwriting Recognition System based on a deep Convolutional Recurrent Neural Network architecture

Satoshi is a discord bot template in python using discord.py that allow you to track some live crypto prices with your own discord bot.

A simple document layout analysis using Python-OpenCV

[python3.6] 运用tf实现自然场景文字检测,keras/pytorch实现ctpn+crnn+ctc实现不定长场景文字OCR识别

Repository of conference publications and source code for first-/ second-authored papers published at NeurIPS, ICML, and ICLR.