Codes_APN

Official codes of CVPR21 paper: Normal Learning in Videos with Attention Prototype Network (https://arxiv.org/abs/2108.11055)

Overview of our approach based on APU and CAU model:

Introduction

Frame reconstruction (current or future frame) based on Auto-Encoder (AE) is a popular method for video anomaly detection. With models trained on the normal data, the reconstruction errors of anomalous scenes are usually much larger than those of normal ones. Previous methods introduced the memory bank into AE, for encoding diverse normal patterns across the training videos. However, they are memory consuming and cannot cope with unseen new scenarios in the testing data. In this work, we propose a self-attention prototype unit (APU) to encode the normal latent space as prototypes in real time, free from extra memory cost. In addition, we introduce circulative attention mechanism to our backbone to form a novel feature extracting learner, namely Circulative Attention Unit(CAU). It enables the fast adaption capability on new scenes by only consuming a few iterations of update. Extensive experiments are conducted on various benchmarks. The superior performance over the state-of-the-art demonstrates the effectiveness of our method.

Performance

We achieved SOTA on many video anomaly detection datasets.

Unsupervised Anomaly Detection Model Training

bash train.sh

Unsupervised Anomaly Detection Model Testing

bash test.sh

If you find this work helpful, please cite:

@inproceedings{Nv2021APN,
  author    = {Chao Hu and
	       Fan Wu and
               Weijie Wu and
               Weibin Qiu and
               Shengxin Lai},
  title     = {Normal Learning in Videos with Attention Prototype Network},
  booktitle = {Computer Vision and Pattern Recognition},
  year      = {2021}
}

Normal Learning in Videos with Attention Prototype Network

Related tags

Overview

Codes_APN

Introduction

Performance

Unsupervised Anomaly Detection Model Training

Unsupervised Anomaly Detection Model Testing

Owner

The Body Part Regression (BPR) model translates the anatomy in a radiologic volume into a machine-interpretable form.

Differentiable simulation for system identification and visuomotor control

Kaggle | 9th place single model solution for TGS Salt Identification Challenge

YOLOX-RMPOLY

Pytorch implementation of PCT: Point Cloud Transformer

Apply Graph Self-Supervised Learning methods to graph-level task(TUDataset, MolculeNet Datset)

Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."

Latent Network Models to Account for Noisy, Multiply-Reported Social Network Data

Intro-to-dl - Resources for "Introduction to Deep Learning" course.

Molecular Sets (MOSES): A benchmarking platform for molecular generation models

[CVPR'21] Projecting Your View Attentively: Monocular Road Scene Layout Estimation via Cross-view Transformation

Source code to accompany Defunctland's video "FASTPASS: A Complicated Legacy"

A PyTorch-Based Framework for Deep Learning in Computer Vision

x-transformers-paddle 2.x version

Split Variational AutoEncoder

Official PyTorch code for the paper: "Point-Based Modeling of Human Clothing" (ICCV 2021)

FedCV: A Federated Learning Framework for Diverse Computer Vision Tasks

MINOS: Multimodal Indoor Simulator

Pure python implementations of popular ML algorithms.

[CVPR'22] Official PyTorch Implementation of Collaborative Transformers for Grounded Situation Recognition