This is the code for the paper "Motion-Focused Contrastive Learning of Video Representations" (ICCV'21).

Last update: Sep 23, 2022

Overview

Motion-Focused Contrastive Learning of Video Representations

Introduction

This is the code for the paper "Motion-Focused Contrastive Learning of Video Representations" (ICCV'21).

Requirements

torch == 1.5.1
torchvision == 0.6.1
liblinear
joblib

Data Preparation

You can refer to data_prepare

MCL Pretraining and Linear Evaluation

This implementation only supports multi-gpu, DistributedDataParallel training, which is faster and simpler; single-gpu or DataParallel training is not supported.

Following SeCo, try to download the weights MoCo v2 (200epochs) and put it into the pretrain folder, and run:

for UCF101 pretraining and linear evaluation
```
bash main_ucf101.sh
```
for Kinetics400 pretraining and linear evaluation
```
bash main_kinetics.sh
```

The checkpoint will be saved in the output/checkpoints entry defined in the configuration file. Besides, the linear evaluation result can be found in output/eval_output_linear.

Downstream task evaluation

finetune for UCF101

cd evaluate/downstream_finetune
bash run_ucf101.sh

finetune for HMDB51

cd evaluate/downstream_finetune
bash run_hmdb51.sh

The finetune result can be found in output/eval_output_finetune

This is the code for the paper "Motion-Focused Contrastive Learning of Video Representations" (ICCV'21).

Related tags

Overview

Motion-Focused Contrastive Learning of Video Representations

Introduction

Requirements

Data Preparation

MCL Pretraining and Linear Evaluation

Downstream task evaluation

Owner

A GPT, made only of MLPs, in Jax

A facial recognition doorbell system using a Raspberry Pi

A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation

[IEEE TPAMI21] MobileSal: Extremely Efficient RGB-D Salient Object Detection [PyTorch & Jittor]

A PyTorch Reimplementation of TecoGAN: Temporally Coherent GAN for Video Super-Resolution

A Home Assistant custom component for Lobe. Lobe is an AI tool that can classify images.

COVID-Net Open Source Initiative

My implementation of Image Inpainting - A deep learning Inpainting model

SHRIMP: Sparser Random Feature Models via Iterative Magnitude Pruning

A toolkit for developing and comparing reinforcement learning algorithms.

Codeflare - Scale complex AI/ML pipelines anywhere

LQM - Improving Object Detection by Estimating Bounding Box Quality Accurately

the code for our CVPR 2021 paper Bilateral Grid Learning for Stereo Matching Network [BGNet]

Python scripts form performing stereo depth estimation using the high res stereo model in PyTorch .

Neurolab is a simple and powerful Neural Network Library for Python

Pytorch implementation for DFN: Distributed Feedback Network for Single-Image Deraining.

Landmarks Recogntion Web application using Streamlit.

Official Pytorch Code for the paper TransWeather

The official codes for the ICCV2021 Oral presentation "Rethinking Counting and Localization in Crowds: A Purely Point-Based Framework"