Sign-to-Speech for Sign Language Understanding: A case study of Nigerian Sign Language

Last update: Oct 23, 2022

Overview

Sign-to-Speech for Sign Language Understanding: A case study of Nigerian Sign Language

This repository contains the code, model, and deployment configs for the paper Sign-to-Speech for Sign Language Understanding: A case study of Nigerian Sign Language which appears at the NeurIPS workshop on Machine Learning for Developing World (ML4D) 2021.

Dataset

Our dataset is a novel dataset for the Nigerian Sign Language comprising of 5000 images of 137 sign words/phrases including the alphabet letters. Data collectors of 20+ individuals comprising of a TV sign language broadcaster and students and teachers from 2 special education schools in Nigeria. The dataset is not publicly available for now.

Model configs and code

To run deployed model

Clone the repository and pip install -r requirements.
If you are on a Linux OS, TTS engines might not be pre-installed on your platform. Use the code below to install them.

  sudo apt-get update && sudo apt-get install espeak ffmpeg libespeak1

While in the project directory's root, spin up the deepstack custom model's server by running the command below;

  sudo docker run -v ~/path/to/project_folder/deployed_model:/modelstore/detection -p 88:5000 deepquestai/deepstack

- Detect sign language meanings in image files and generate realistic voice of words.

run the image_detection script on the image;

  python image_detection.py image_filename.file_extension

My default port number is 88. To specify the port on which DeepStack server is running, run this instead;

python image_detection.py image_filename.file_extension --deepstack-port port_number

Running the above command would return two new files in your project root directory -

a copy of the image with bbox around the detected sign with the meaning on the top of the box,
an audiofile of the detected sign language.

- Detect sign language meanings on a live video (via webcam).

run the livefeed detection script;

  python livefeed_detection.py

My default port number is 88. To specify the port on which DeepStack server is running, run this instead;

  python livefeed_detection.py --deepstack-port port_number

This will spin up the webcam and would automatically detect any sign language words in view of the camera, while also displaying the sign meaning and returning its speech equivalent immediately through the PC's audio system. Press **q** to quit the live video.

video2132736597.mp4

Citation

Coming soon!

Sign-to-Speech for Sign Language Understanding: A case study of Nigerian Sign Language

Related tags

Overview

Sign-to-Speech for Sign Language Understanding: A case study of Nigerian Sign Language

Dataset

Model configs and code

To run deployed model

- Detect sign language meanings in image files and generate realistic voice of words.

- Detect sign language meanings on a live video (via webcam).

Citation

Owner

一个免费开源一键搭建的通用验证码识别平台，大部分常见的中英数验证码识别都没啥问题。

PyTorch code for SENTRY: Selective Entropy Optimization via Committee Consistency for Unsupervised DA

Code for the AAAI 2022 paper "Zero-Shot Cross-Lingual Machine Reading Comprehension via Inter-Sentence Dependency Graph".

ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis

Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks

This implementation contains the application of GPlearn's symbolic transformer on a commodity futures sector of the financial market.

Official Code For TDEER: An Efficient Translating Decoding Schema for Joint Extraction of Entities and Relations (EMNLP2021)

A python script to dump all the challenges locally of a CTFd-based Capture the Flag.

3D2Unet: 3D Deformable Unet for Low-Light Video Enhancement (PRCV2021)

Prevent `CUDA error: out of memory` in just 1 line of code.

Implementation of the paper "Fine-Tuning Transformers: Vocabulary Transfer"

Semantic Segmentation for Aerial Imagery using Convolutional Neural Network

DeepStruc is a Conditional Variational Autoencoder which can predict the mono-metallic nanoparticle from a Pair Distribution Function.

The Unsupervised Reinforcement Learning Benchmark (URLB)

TensorFlow 101: Introduction to Deep Learning for Python Within TensorFlow

Image-popularity-score - A novel deep regression method for image scoring.

The code of paper "Block Modeling-Guided Graph Convolutional Neural Networks".

Rotary Transformer

Official implementation of deep Gaussian process (DGP)-based multi-speaker speech synthesis with PyTorch.

Task-based end-to-end model learning in stochastic optimization