Official PyTorch implementation of "Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-supervised Action Recognition" in AAAI2022.

Last update: Dec 17, 2022

Related tags

Deep Learning AimCLR

Overview

AimCLR

This is an official PyTorch implementation of "Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-supervised Action Recognition" in AAAI2022.

Requirements

Data Preparation

Download the raw data of NTU RGB+D and PKU-MMD.
For NTU RGB+D dataset, preprocess data with tools/ntu_gendata.py. For PKU-MMD dataset, preprocess data with tools/pku_part1_gendata.py.
Then downsample the data to 50 frames with feeder/preprocess_ntu.py and feeder/preprocess_pku.py.
If you don't want to process the original data, download the file folder action_dataset.

Installation

# Install torchlight
$ cd torchlight
$ python setup.py install
$ cd ..

# Install other python libraries
$ pip install -r requirements.txt

Unsupervised Pre-Training

Example for unsupervised pre-training of 3s-AimCLR. You can change some settings of .yaml files in config/ntu60/pretext folder.

# train on NTU RGB+D xview joint stream
$ python main.py pretrain_aimclr --config config/ntu60/pretext/pretext_aimclr_xview_joint.yaml

# train on NTU RGB+D xview motion stream
$ python main.py pretrain_aimclr --config config/ntu60/pretext/pretext_aimclr_xview_motion.yaml

# train on NTU RGB+D xview bone stream
$ python main.py pretrain_aimclr --config config/ntu60/pretext/pretext_aimclr_xview_bone.yaml

Linear Evaluation

Example for linear evaluation of 3s-AimCLR. You can change .yaml files in config/ntu60/linear_eval folder.

# Linear_eval on NTU RGB+D xview
$ python main.py linear_evaluation --config config/ntu60/linear_eval/linear_eval_aimclr_xview_joint.yaml

$ python main.py linear_evaluation --config config/ntu60/linear_eval/linear_eval_aimclr_xview_motion.yaml

$ python main.py linear_evaluation --config config/ntu60/linear_eval/linear_eval_aimclr_xview_bone.yaml

Trained models

We release several trained models in released_model. The performance is better than that reported in the paper. You can download them and test them with linear evaluation by changing weights in .yaml files.

Model	NTU 60 xsub (%)	NTU 60 xview (%)	PKU-MMD Part I (%)
AimCLR-joint	74.34	79.68	83.43
AimCLR-motion	68.68	71.83	72.00
AimCLR-bone	71.87	77.02	82.03
3s-AimCLR	79.18	84.02	87.79

Visualization

The t-SNE visualization of the embeddings after AimCLR pre-training on NTU60-xsub.

Citation

Please cite our paper if you find this repository useful in your resesarch:

@inproceedings{guo2022aimclr,
  Title= {Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-supervised Action Recognition},
  Author= {Tianyu, Guo and Hong, Liu and Zhan, Chen and Mengyuan, Liu and Tao, Wang  and Runwei, Ding},
  Booktitle= {AAAI},
  Year= {2022}
}

Acknowledgement

The framework of our code is extended from the following repositories. We sincerely thank the authors for releasing the codes.

The framework of our code is based on CrosSCLR.
The encoder is based on ST-GCN.

Licence

This project is licensed under the terms of the MIT license.

Official PyTorch implementation of "Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-supervised Action Recognition" in AAAI2022.

Related tags

Overview

AimCLR

Requirements

Data Preparation

Installation

Unsupervised Pre-Training

Linear Evaluation

Trained models

Visualization

Citation

Acknowledgement

Licence

Owner

Gty

1st-in-MICCAI2020-CPM - Combined Radiology and Pathology Classification

This is my codes that can visualize the psnr image in testing videos.

ObjDetApp deploys a pytorch model for object detection

Complete system for facial identity system. Include one-shot model, database operation, features visualization, monitoring

source code of “Visual Saliency Transformer” (ICCV2021)

A simple, high level, easy-to-use open source Computer Vision library for Python.

KSAI Lite is a deep learning inference framework of kingsoft, based on tensorflow lite

这是一个利用facenet和retinaface实现人脸识别的库，可以进行在线的人脸识别。

Python Auto-ML Package for Tabular Datasets

Code for the paper "TadGAN: Time Series Anomaly Detection Using Generative Adversarial Networks"

3D AffordanceNet is a 3D point cloud benchmark consisting of 23k shapes from 23 semantic object categories, annotated with 56k affordance annotations and covering 18 visual affordance categories.

Supplementary materials to "Spin-optomechanical quantum interface enabled by an ultrasmall mechanical and optical mode volume cavity" by H. Raniwala, S. Krastanov, M. Eichenfield, and D. R. Englund, 2022

A PyTorch-based Semi-Supervised Learning (SSL) Codebase for Pixel-wise (Pixel) Vision Tasks

Python KNN model: Predicting a probability of getting a work visa. Tableau: Non-immigrant visas over the years.

《LightXML: Transformer with dynamic negative sampling for High-Performance Extreme Multi-label Text Classiﬁcation》(AAAI 2021) GitHub:

MLPs for Vision and Langauge Modeling (Coming Soon)

Reinforcement learning algorithms in RLlib

This code reproduces the results of the paper, "Measuring Data Leakage in Machine-Learning Models with Fisher Information"

An Object Oriented Programming (OOP) interface for Ontology Web language (OWL) ontologies.

Generate high quality pictures. GAN. Generative Adversarial Networks

Official PyTorch implementation of "Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-supervised Action Recognition" in AAAI2022.

Related tags

Overview

AimCLR

Requirements

Data Preparation

Installation

Unsupervised Pre-Training

Linear Evaluation

Trained models

Visualization

Citation

Acknowledgement

Licence

Owner

Gty

1st-in-MICCAI2020-CPM - Combined Radiology and Pathology Classification

This is my codes that can visualize the psnr image in testing videos.

*ObjDetApp* deploys a pytorch model for object detection

Complete system for facial identity system. Include one-shot model, database operation, features visualization, monitoring

source code of “Visual Saliency Transformer” (ICCV2021)

A simple, high level, easy-to-use open source Computer Vision library for Python.

KSAI Lite is a deep learning inference framework of kingsoft, based on tensorflow lite

这是一个利用facenet和retinaface实现人脸识别的库，可以进行在线的人脸识别。

Python Auto-ML Package for Tabular Datasets

Code for the paper "TadGAN: Time Series Anomaly Detection Using Generative Adversarial Networks"

3D AffordanceNet is a 3D point cloud benchmark consisting of 23k shapes from 23 semantic object categories, annotated with 56k affordance annotations and covering 18 visual affordance categories.

Supplementary materials to "Spin-optomechanical quantum interface enabled by an ultrasmall mechanical and optical mode volume cavity" by H. Raniwala, S. Krastanov, M. Eichenfield, and D. R. Englund, 2022

A PyTorch-based Semi-Supervised Learning (SSL) Codebase for Pixel-wise (Pixel) Vision Tasks

Python KNN model: Predicting a probability of getting a work visa. Tableau: Non-immigrant visas over the years.

《LightXML: Transformer with dynamic negative sampling for High-Performance Extreme Multi-label Text Classiﬁcation》(AAAI 2021) GitHub:

MLPs for Vision and Langauge Modeling (Coming Soon)

Reinforcement learning algorithms in RLlib

This code reproduces the results of the paper, "Measuring Data Leakage in Machine-Learning Models with Fisher Information"

An Object Oriented Programming (OOP) interface for Ontology Web language (OWL) ontologies.

Generate high quality pictures. GAN. Generative Adversarial Networks

ObjDetApp deploys a pytorch model for object detection