Yoga Pose Identification and Icon Matching

Project Goal

Detect yoga poses performed by a user and overlay a corresponding icon image. Running the main script starts the videostream with automatic pose detection.

Part 1: Pose Detection

I use the 32 body landmarks provided by MediaPipe to measure joint angles, then determine yoga poses based on key joint angles for each pose. For example, in the star pose, the angle between the shoulder, elbow, and wrist landmarks (elbow flexion) are below 20 degrees and the angle of the elbow, shoulder, and opposite shoulder (shoulder flexion) are also below 20 degrees.

Part 2: Icon Image Transformation

To transform the icon image that will be overlayed over the user, I first preprocess the icon image then apply an affine transform. To preprocess the icon, I resize the icon image to be roughly the same heigt as the user, a metric also calculated with MediaPie's landmarks. I then apply a border to the icon image so that its image array has the same dimensions as the video stream frames. These steps help make the affine transform more effective. I select three key pose landmarks for each pose, then find three key points on the icon that should match these points. For example, I chose to match the nose and ankles of the person with the top tip and bottom two tips of the star.

Part 3: Image Overlay

I overlayed just the icon pixels (the icon background is ignored) by summing .5 of the icon pixel value with .5 of the the video frame value, resulting in a transparent overlay of just the icon.

OpenCV, MediaPipe Pose Estimation, Affine Transform for Icon Overlay

Related tags

Overview

Yoga Pose Identification and Icon Matching

Project Goal

Part 1: Pose Detection

Part 2: Icon Image Transformation

Part 3: Image Overlay

Results

Star Pose

Tree Pose

Chair pose

Owner

Anna Garverick

CBKH: The Cornell Biomedical Knowledge Hub

Official pytorch implementation of the paper: "SinGAN: Learning a Generative Model from a Single Natural Image"

[TPDS'21] COSCO: Container Orchestration using Co-Simulation and Gradient Based Optimization for Fog Computing Environments

Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning

Shuffle Attention for MobileNetV3

The implementation of FOLD-R++ algorithm

🥇 LG-AI-Challenge 2022 1위 솔루션 입니다.

A Multi-modal Perception Tracker (MPT) for speaker tracking using both audio and visual modalities

Project page for End-to-end Recovery of Human Shape and Pose

Pyramid Grafting Network for One-Stage High Resolution Saliency Detection. CVPR 2022

Code release for "COTR: Correspondence Transformer for Matching Across Images"

Implementation of ICCV 2021 oral paper -- A Novel Self-Supervised Learning for Gaussian Mixture Model

An official repository for Paper "Uformer: A General U-Shaped Transformer for Image Restoration".

Code for the paper "JANUS: Parallel Tempered Genetic Algorithm Guided by Deep Neural Networks for Inverse Molecular Design"

Voice control for Garry's Mod

Submanifold sparse convolutional networks

Predicting Student Attentiveness using OpenCV

Byzantine-robust decentralized learning via self-centered clipping

The Official PyTorch Implementation of DiscoBox.

All supplementary material used by me while TA-ing CS3244: Machine Learning