Rotation Robust Descriptors

Last update: Nov 15, 2022

Overview

RoRD

Rotation-Robust Descriptors and Orthographic Views for Local Feature Matching

Project Page | Paper link

Evaluation and Datasets

MMA : Training on PhotoTourism and testing on HPatches and proposed Rotated HPatches
Pose Estimation : Training on same PhotoTourism datasets as used for MMA and testing on proposed DiverseView
Visual Place Recognition : Oxford RobotCar training sequence and testing sequence

Pretrained Models

Download models from Google Drive (73.9 MB) in the base directory.

Evaluating RoRD

You can evaluate RoRD on demo images or replace it with your custom images.

Dependencies can be installed in a conda of virtualenv by running:
1. pip install -r requirements.txt
python extractMatch.py <rgb_image1> <rgb_image2> --model_file <path to the model file RoRD>
Example:
python extractMatch.py demo/rgb/rgb1_1.jpg demo/rgb/rgb1_2.jpg --model_file models/rord.pth
This should give you output like this:

RoRD

SIFT

DiverseView Dataset

Download dataset from Google Drive (97.8 MB) in the base directory (only needed if you want to evaluate on DiverseView Dataset).

Evaluation on DiverseView Dataset

The DiverseView Dataset is a custom dataset consisting of 4 scenes with images having high-angle camera rotations and viewpoint changes.

Pose estimation on single image pair of DiverseView dataset:
1. cd demo
2. python register.py --rgb1 <path to rgb image 1> --rgb2 <path to rgb image 2> --depth1 <path to depth image 1> --depth2 <path to depth image 2> --model_rord <path to the model file RoRD>
3. Example:
  python register.py --rgb1 rgb/rgb2_1.jpg --rgb2 rgb/rgb2_2.jpg --depth1 depth/depth2_1.png --depth2 depth/depth2_2.png --model_rord ../models/rord.pth
4. This should give you output like this:

RoRD matches in perspective view

RoRD matches in orthographic view

To visualize the registered point cloud, use --viz3d command:
1. python register.py --rgb1 rgb/rgb2_1.jpg --rgb2 rgb/rgb2_2.jpg --depth1 depth/depth2_1.png --depth2 depth/depth2_2.png --model_rord ../models/rord.pth --viz3d

PointCloud registration using correspondences

Pose estimation on a sequence of DiverseView dataset:
1. cd evaluation/DiverseView/
2. python evalRT.py --dataset <path to DiverseView dataset> --sequence <sequence name> --model_rord <path to RoRD model> --output_dir <name of output dir>
3. Example:
  1. python evalRT.py --dataset /path/to/preprocessed/ --sequence data1 --model_rord ../../models/rord.pth --output_dir out
4. This would generate out folder containing predicted transformations and matching results in out/vis folder, containing images like below:

RoRD

Training RoRD on PhotoTourism Images

Training using rotation homographies with initialization from D2Net weights (Download base models as mentioned in Pretrained Models).
Download branderburg_gate dataset that is used in the configs/train_scenes_small.txt from here(5.3 Gb) in phototourism folder.

Folder stucture should be:

phototourism/  
___ brandenburg_gate  
___ ___ dense  
___ ___	___ images  
___ ___	___ stereo  
___ ___	___ sparse

python trainPT_ipr.py --dataset_path <path_to_phototourism_folder> --init_model models/d2net.pth --plot

TO-DO

Provide VPR code
Provide combine training of RoRD + D2Net
Provide code for calculating error in Diverseview Dataset

Credits

Our base model is borrowed from D2-Net.

BibTex

If you use this code in your project, please cite the following paper:

@misc{rord2021,
      title={RoRD: Rotation-Robust Descriptors and Orthographic Views for Local Feature Matching}, 
      author={Udit Singh Parihar and Aniket Gujarathi and Kinal Mehta and Satyajit Tourani and Sourav Garg and Michael Milford and K. Madhava Krishna},
      year={2021},
      eprint={2103.08573},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Rotation Robust Descriptors

Related tags

Overview

RoRD

Evaluation and Datasets

Pretrained Models

Evaluating RoRD

RoRD

SIFT

DiverseView Dataset

Evaluation on DiverseView Dataset

RoRD matches in perspective view

RoRD matches in orthographic view

PointCloud registration using correspondences

RoRD

Training RoRD on PhotoTourism Images

TO-DO

Credits

BibTex

Owner

Udit Singh Parihar

A Python framework for conversational search

PyTorch implementation of NeurIPS 2021 paper: "CoFiNet: Reliable Coarse-to-fine Correspondences for Robust Point Cloud Registration"

A Human-in-the-Loop workflow for creating HD images from text

DumpSMBShare - A script to dump files and folders remotely from a Windows SMB share

E-Ink Magic Calendar that automatically syncs to Google Calendar and runs off a battery powered Raspberry Pi Zero

Real-time Object Detection for Streaming Perception, CVPR 2022

Image Segmentation with U-Net Algorithm on Carvana Dataset using AWS Sagemaker

Code for the CVPR 2021 paper "Triple-cooperative Video Shadow Detection"

An open-source project for applying deep learning to medical scenarios

Robot Reinforcement Learning on the Constraint Manifold

A task-agnostic vision-language architecture as a step towards General Purpose Vision

Layered Neural Atlases for Consistent Video Editing

Towards Part-Based Understanding of RGB-D Scans

Implementation of character based convolutional neural network

DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition, TPAMI 2021

TensorFlow CNN for fast style transfer

[CVPR 2022] Back To Reality: Weak-supervised 3D Object Detection with Shape-guided Label Enhancement

:boar: :bear: Deep Learning based Python Library for Stock Market Prediction and Modelling

TDmatch is a Python library developed to perform matching tasks in three categories:

Pytorch implementation of "Geometrically Adaptive Dictionary Attack on Face Recognition" (WACV 2022)