Relative Uncertainty Learning for Facial Expression Recognition

Last update: Dec 28, 2022

Overview

Relative Uncertainty Learning for Facial Expression Recognition

The official implementation of the following paper at NeurIPS2021:
Title: Relative Uncertainty Learning for Facial Expression Recognition
Authors: Yuhang Zhang, Chengrui Wang, Weihong Deng
Institute: BUPT

Abstract

In facial expression recognition (FER), the uncertainties introduced by inherent noises like ambiguous facial expressions and inconsistent labels raise concerns about the credibility of recognition results. To quantify these uncertainties and achieve good performance under noisy data, we regard uncertainty as a relative concept and propose an innovative uncertainty learning method called Relative Uncertainty Learning (RUL). Rather than assuming Gaussian uncertainty distributions for all datasets, RUL builds an extra branch to learn uncertainty from the relative difficulty of samples by feature mixup. Specifically, we use uncertainties as weights to mix facial features and design an add-up loss to encourage uncertainty learning. It is easy to implement and adds little or no extra computation overhead. Extensive experiments show that RUL outperforms state-of-the-art FER uncertainty learning methods in both real-world and synthetic noisy FER datasets. Besides, RUL also works well on other datasets such as CIFAR and Tiny ImageNet.

Pipeline

Feature Visualization

The feature distribution figure shows that RUL encourages intra-class compactness and inter-class seperability of the learned features. (0:Surprise, 1:Fear, 2:Disgust, 3:Happy, 4:Sad, 5:Angry, 6:Neutral)

Train

Torch

We train RUL with Torch 1.8.0 and torchvision 0.9.0.

Dataset

Download RAF-DB, put it into the dataset folder, and make sure that it has the same structure as bellow:

- dataset/raf-basic/
         EmoLabel/
             list_patition_label.txt
         Image/aligned/
	     train_00001_aligned.jpg
             test_0001_aligned.jpg
             ...

Pretrained backbone model

Download the pretrained ResNet18 from this github repository, and then put it into the pretrained_model directory. We thank the authors for providing their pretrained ResNet model.

Train the RUL model

cd src
python main.py --raf_path '../dataset/raf-basic' --label_path '../dataset/raf-basic/EmoLabel/list_patition_label.txt' --pretrained_backbone_path '../pretrained_model/resnet18_msceleb.pth'

Accuracy

Acknowledgments

Our work is based on the following works, thanks for their code and pretrained model:

https://github.com/kaiwang960112/Self-Cure-Network

https://github.com/Ontheway361/dul-pytorch

https://github.com/amirhfarzaneh/dacl

Relative Uncertainty Learning for Facial Expression Recognition

Related tags

Overview

Relative Uncertainty Learning for Facial Expression Recognition

Abstract

Pipeline

Feature Visualization

Train

Owner

Scalable Multi-Agent Reinforcement Learning

Implementation of FSGNN

Auto Seg-Loss: Searching Metric Surrogates for Semantic Segmentation

A simple implementation of Kalman filter in single object tracking

DeepFaceLab fork which provides IPython Notebook to use DFL with Google Colab

Optimized code based on M2 for faster image captioning training

MTCNN face detection implementation for TensorFlow, as a PIP package.

A Sign Language detection project using Mediapipe landmark detection and Tensorflow LSTM's

FuseDream: Training-Free Text-to-Image Generationwith Improved CLIP+GAN Space OptimizationFuseDream: Training-Free Text-to-Image Generationwith Improved CLIP+GAN Space Optimization

Radar-to-Lidar: Heterogeneous Place Recognition via Joint Learning

Causal-BALD: Deep Bayesian Active Learning of Outcomes to Infer Treatment-Effects from Observational Data.

[CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning

Improving Compound Activity Classification via Deep Transfer and Representation Learning

PyTorch implementation of Super SloMo by Jiang et al.

Gesture Volume Control Using OpenCV and MediaPipe

This is a Tensorflow implementation of Learning to See in the Dark in CVPR 2018

FNet Implementation with TensorFlow & PyTorch

Direct application of DALLE-2 to video synthesis, using factored space-time Unet and Transformers

STARCH compuets regional extreme storm physical characteristics and moisture balance based on spatiotemporal precipitation data from reanalysis or climate model data.