Unimodal Face Classification with Multimodal Training

This is a PyTorch implementation of the following paper:

Unimodal Face Classification with Multimodal Training

Wenbin Teng (Boston University), Chongyang Bai (Dartmouth College)

Abstract: We propose a Multimodal Training Unimodal Test (MTUT) framework for robust face classification, which exploits the cross-modality relationship during training and applies it as a complementary of the imperfect single modality input during testing. Technically, during training, the framework (1) builds both intra-modality and cross-modality autoencoders with the aid of facial attributes to learn latent embeddings as multimodal descriptors, (2) proposes a novel multimodal embedding divergence loss to align the heterogeneous features from different modalities, which also adaptively avoids the useless modality (if any) from confusing the model. This way, the learned autoencoders can generate robust embeddings in single-modality face classification on test stage. We evaluate our framework in two face classification datasets and two kinds of testing input: (1) poor-condition image and (2) point cloud or 3D face mesh, when both 2D and 3D modalities are available for training.

The proposed method applies both 2D and 3D encoder to extract the embeddings of each individual modalities. Divergence between both embeddings is minimized adaptively through measuring the classification loss. Based on the type of testing modality, we use certain decoder to reconstruct 2D and 3D inputs from feature embeddings. An overview of the proposed network is shown in the following picture:

Unimodal Face Classification with Multimodal Training

Related tags

Overview

Unimodal Face Classification with Multimodal Training

Owner

Wenbin Teng

Code of U2Fusion: a unified unsupervised image fusion network for multiple image fusion tasks, including multi-modal, multi-exposure and multi-focus image fusion.

An open source Python package for plasma science that is under development

Robust Partial Matching for Person Search in the Wild

TorchMetrics is a collection of 25+ PyTorch metrics implementations and an easy-to-use API to create custom metrics.

Code for "FPS-Net: A convolutional fusion network for large-scale LiDAR point cloud segmentation".

A BaSiC Tool for Background and Shading Correction of Optical Microscopy Images

From a body shape, infer the anatomic skeleton.

Spatial Sparse Convolution Library

SCAAML is a deep learning framwork dedicated to side-channel attacks run on top of TensorFlow 2.x.

Official implementation of "Generating 3D Molecules for Target Protein Binding"

This is official implementaion of paper "Token Shift Transformer for Video Classification".

Official PyTorch implementation of the paper "Graph-based Generative Face Anonymisation with Pose Preservation" in ICIAP 2021

The MLOps platform for innovators 🚀

Irrigation controller for Home Assistant

DziriBERT: a Pre-trained Language Model for the Algerian Dialect

Probabilistic Cross-Modal Embedding (PCME) CVPR 2021

Gems & Holiday Package Prediction

Compare outputs between layers written in Tensorflow and layers written in Pytorch

The repo for reproducing Seed-driven Document Ranking for Systematic Reviews: A Reproducibility Study

Unicorn can be used for performance analyses of highly configurable systems with causal reasoning