Official implementation of CATs: Cost Aggregation Transformers for Visual Correspondence NeurIPS'21

Last update: Jan 04, 2023

Overview

CATs: Cost Aggregation Transformers for Visual Correspondence NeurIPS'21

For more information, check out the paper on [arXiv].

Training with different backbones and evaluations of them are to be updated soon..

Check out our new paper! [arXiv]

Network

Our model CATs is illustrated below:

Environment Settings

git clone https://github.com/SunghwanHong/CATs
cd CATs

conda create -n CATs python=3.6
conda activate CATs

pip install torch==1.8.0+cu111 torchvision==0.9.0+cu111 torchaudio==0.8.0 -f https://download.pytorch.org/whl/torch_stable.html
pip install -U scikit-image
pip install git+https://github.com/albumentations-team/albumentations
pip install tensorboardX termcolor timm tqdm requests pandas

Evaluation

Download pre-trained weights on Link
All datasets are automatically downloaded into directory specified by argument datapath

Result on SPair-71k: (PCK 49.9%)

  python test.py --pretrained "/path_to_pretrained_model/spair" --benchmark spair

Result on SPair-71k, feature backbone frozen: (PCK 42.4%)

  python test.py --pretrained "/path_to_pretrained_model/spair_frozen" --benchmark spair

Results on PF-PASCAL: (PCK 75.4%, 92.6%, 96.4%)

  python test.py --pretrained "/path_to_pretrained_model/pfpascal" --benchmark pfpascal

Results on PF-PACAL, feature backbone frozen: (PCK 67.5%, 89.1%, 94.9%)

  python test.py --pretrained "/path_to_pretrained_model/pfpascal_frozen" --benchmark pfpascal

Acknowledgement

We borrow code from public projects (huge thanks to all the projects). We mainly borrow code from DHPF and GLU-Net.

BibTeX

If you find this research useful, please consider citing:

@inproceedings{cho2021cats,
  title={CATs: Cost Aggregation Transformers for Visual Correspondence},
  author={Cho, Seokju and Hong, Sunghwan and Jeon, Sangryul and Lee, Yunsung and Sohn, Kwanghoon and Kim, Seungryong},
  booktitle={Thirty-Fifth Conference on Neural Information Processing Systems},
  year={2021}
}

Official implementation of CATs: Cost Aggregation Transformers for Visual Correspondence NeurIPS'21

Related tags

Overview

CATs: Cost Aggregation Transformers for Visual Correspondence NeurIPS'21

Network

Environment Settings

Evaluation

Acknowledgement

BibTeX

Owner

Sunghwan Hong

A python library to build Model Trees with Linear Models at the leaves.

Official Implementation for HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing

RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Testability-Aware Low Power Controller Design with Evolutionary Learning, ITC2021

PyTorch code for the paper "Curriculum Graph Co-Teaching for Multi-target Domain Adaptation" (CVPR2021)

Aquarius - Enabling Fast, Scalable, Data-Driven Virtual Network Functions

The authors' implementation of Unsupervised Adversarial Learning of 3D Human Pose from 2D Joint Locations

Code for layerwise detection of linguistic anomaly paper (ACL 2021)

Deep learning models for change detection of remote sensing images

Pytorch implementation of MaskGIT: Masked Generative Image Transformer

Machine Learning University: Accelerated Computer Vision Class

Official implementation of "Dynamic Anchor Learning for Arbitrary-Oriented Object Detection" (AAAI2021).

EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering

Learning from History: Modeling Temporal Knowledge Graphs with Sequential Copy-Generation Networks

Tf alloc - Simplication of GPU allocation for Tensorflow2

Convex optimization for fun and profit.

2D&3D human pose estimation

AI grand challenge 2020 Repo (Speech Recognition Track)

Code for Talk-to-Edit (ICCV2021). Paper: Talk-to-Edit: Fine-Grained Facial Editing via Dialog.

This repository contains all code and data for the Inside Out Visual Place Recognition task