PyTorch implementation for Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

Last update: Dec 19, 2022

Related tags

Deep Learning stochastic-cslr

Overview

Stochastic CSLR

This is the PyTorch implementation for the ECCV 2020 paper: Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

Quick Start

1. Installation

pip install git+https://github.com/zheniu/stochastic-cslr

Also, you need to install sclite for evaluation. Take a look at step 2 for instructions.

2. Prepare the dataset

Download the RWTH-PHOENIX-2014 dataset here.
Unzip it and obtain the path to phoenix-2014-multisigner/ folder for later use.
Install sclite for evaluation. Check phoenix-2014-multisigner/evaluation/NIST-sclite_sctk-2.4.0-20091110-0958.tar.bz2 for detail.
After installing sclite, put it in your PATH.

3. Run a quick test

You can use the script quick_test.py for a quick test.

python3 quick_test.py --data-root your_path_to/phoenix-2014-multisigner

By specifying the model type --model sfl/dfl, the data split --split dev/test, whether to use a language model--use-lm, you can get the following results:

Model	WER (dev)	sub/del/ins (dev)	WER (test)	sub/del/ins (test)
DFL	27.1	12.7/7.4/7.0	27.7	13.8/7.3/6.6
SFL	26.2	12.7/6.9/6.7	26.6	13.7/6.5/6.4
DFL + LM	25.6	11.5/9.2/4.9	26.4	12.4/9.3/4.7
SFL + LM	24.3	11.4/8.5/4.4	25.3	12.4/8.5/4.3

Note that these results are slightly different from the paper as a different random seed is used.

You may also take a look at quick_test.py as it shows how to use the pretrained models.

4. Train your own model

The configuration files for deterministic and stochastic fine-grained labeling are put under config/. The training script is based on a PyTorch experiment runner torchzq, which automatically reads the hyperparameters in the YAML file and passes them to stochastic_cslr/runner.py.

Before running, change the data_root in the YAML configurations to phoenix-2014-multisigner/ first.

Train (for instance, dfl):

tzq config/dfl-fp16.yml train

Test the trained model

tzq config/dfl-fp16.yml test

Citation

You may cite this work by:

@inproceedings{niu2020stochastic,
  title={Stochastic Fine-Grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition},
  author={Niu, Zhe and Mak, Brian},
  booktitle={European Conference on Computer Vision},
  pages={172--186},
  year={2020},
  organization={Springer}
}

PyTorch implementation for Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

Related tags

Overview

Stochastic CSLR

Quick Start

1. Installation

2. Prepare the dataset

3. Run a quick test

4. Train your own model

Train (for instance, dfl):

Test the trained model

Citation

Owner

Zhe Niu

ATOMIC 2020: On Symbolic and Neural Commonsense Knowledge Graphs

Image Segmentation Evaluation

Simple, efficient and flexible vision toolbox for mxnet framework.

A community run, 5-day PyTorch Deep Learning Bootcamp

The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of Color

A Moonraker plug-in for real-time compensation of frame thermal expansion

Official Datasets and Implementation from our Paper "Video Class Agnostic Segmentation in Autonomous Driving".

Face recognition project by matching the features extracted using SIFT.

Node Editor Plug for Blender

Code for Multimodal Neural SLAM for Interactive Instruction Following

Picasso: a methods for embedding points in 2D in a way that respects distances while fitting a user-specified shape.

Calculates JMA (Japan Meteorological Agency) seismic intensity (shindo) scale from acceleration data recorded in NumPy array

🏎️ Accelerate training and inference of 🤗 Transformers with easy to use hardware optimization tools

When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataset of 53,000+ Legal Holdings

COPA-SSE contains crowdsourced explanations for the Balanced COPA dataset

Linear algebra python - Number of operations and problems in Linear Algebra and Numerical Linear Algebra

A repository for the updated version of CoinRun used to collect MUGEN, a multimodal video-audio-text dataset.

Simple embedding based text classifier inspired by fastText, implemented in tensorflow

Contrastively Disentangled Sequential Variational Audoencoder

Perturbed Self-Distillation: Weakly Supervised Large-Scale Point Cloud Semantic Segmentation (ICCV2021)