Code for LIGA-Stereo Detector, ICCV'21

Last update: Dec 09, 2022

Related tags

Overview

LIGA-Stereo

Introduction

This is the official implementation of the paper LIGA-Stereo: Learning LiDAR Geometry Aware Representations for Stereo-based 3D Detector, In ICCV'21, Xiaoyang Guo, Shaoshuai Shi, Xiaogang Wang and Hongsheng Li.

[project page] [paper] [code]

Installation

Requirements

All the codes are tested in the following environment:

Linux (tested on Ubuntu 14.04 / 16.04)
Python 3.7
PyTorch 1.6.0
Torchvision 0.7.0
CUDA 9.2 / 10.1
spconv (commit f22dd9)

Installation Steps

a. Clone this repository.

git clone https://github.com/xy-guo/LIGA.git

b. Install the dependent libraries as follows:

Install the dependent python libraries:

pip install -r requirements.txt

Install the SparseConv library, we use the implementation from [spconv].

git clone https://github.com/traveller59/spconv
git reset --hard f22dd9
git submodule update --recursive
python setup.py bdist_wheel
pip install ./dist/spconv-1.2.1-cp37-cp37m-linux_x86_64.whl

Install modified mmdetection from [mmdetection_kitti]

git clone https://github.com/xy-guo/mmdetection_kitti
python setup.py develop

c. Install this library by running the following command:

python setup.py develop

Getting Started

The dataset configs are located within configs/stereo/dataset_configs, and the model configs are located within configs/stereo for different datasets.

Dataset Preparation

Currently we only provide the dataloader of KITTI dataset.

Please download the official KITTI 3D object detection dataset and organize the downloaded files as follows (the road planes are provided by OpenPCDet [road plane], which are optional for training LiDAR models):

LIGA_PATH
├── data
│   ├── kitti
│   │   │── ImageSets
│   │   │── training
│   │   │   ├──calib & velodyne & label_2 & image_2 & (optional: planes)
│   │   │── testing
│   │   │   ├──calib & velodyne & image_2
├── configs
├── liga
├── tools

You can also choose to link your KITTI dataset path by

YOUR_KITTI_DATA_PATH=~/data/kitti_object
ln -s $YOUR_KITTI_DATA_PATH/training/ ./data/kitti/
ln -s $YOUR_KITTI_DATA_PATH/testing/ ./data/kitti/

Generate the data infos by running the following command:

python -m liga.datasets.kitti.kitti_dataset create_kitti_infos
python -m liga.datasets.kitti.kitti_dataset create_gt_database_only

Training & Testing

Test and evaluate the pretrained models

To test with multiple GPUs:

./scripts/dist_test_ckpt.sh ${NUM_GPUS} ./configs/stereo/kitti_models/liga.yaml ./ckpt/pretrained_liga.pth

Train a model

Train with multiple GPUs

./scripts/dist_train.sh ${NUM_GPUS} 'exp_name' ./configs/stereo/kitti_models/liga.yaml

Pretrained Models

Google Drive

Citation

@InProceedings{Guo_2021_ICCV,
    author = {Guo, Xiaoyang and Shi, Shaoshuai and Wang, Xiaogang and Li, Hongsheng},
    title = {LIGA-Stereo: Learning LiDAR Geometry Aware Representations for Stereo-based 3D Detector},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month = {October},
    year = {2021}
}

Acknowledgements

Part of codes are migrated from OpenPCDet and DSGN.

Code for LIGA-Stereo Detector, ICCV'21

Related tags

Overview

LIGA-Stereo

Introduction

Overview

Installation

Requirements

Installation Steps

Getting Started

Dataset Preparation

Training & Testing

Test and evaluate the pretrained models

Train a model

Pretrained Models

Citation

Acknowledgements

Owner

Xiaoyang Guo

Codes and pretrained weights for winning submission of 2021 Brain Tumor Segmentation (BraTS) Challenge

Exploring Cross-Image Pixel Contrast for Semantic Segmentation

Self-Supervised Image Denoising via Iterative Data Refinement

Unofficial Implementation of MLP-Mixer, gMLP, resMLP, Vision Permutator, S2MLPv2, RaftMLP, ConvMLP, ConvMixer in Jittor and PyTorch.

PyTorch implementation of deep GRAph Contrastive rEpresentation learning (GRACE).

You can draw the corresponding bounding box into the image and save it according to the result file (txt format) run by the tracker.

Text-to-SQL in the Wild: A Naturally-Occurring Dataset Based on Stack Exchange Data

Official Pytorch Implementation of Relational Self-Attention: What's Missing in Attention for Video Understanding

Face2webtoon - Despite its importance, there are few previous works applying I2I translation to webtoon.

The code of Zero-shot learning for low-light image enhancement based on dual iteration

Task-related Saliency Network For Few-shot learning

Omnidirectional camera calibration in python

The implemention of Video Depth Estimation by Fusing Flow-to-Depth Proposals

High level network definitions with pre-trained weights in TensorFlow

"Segmenter: Transformer for Semantic Segmentation" reproduced via mmsegmentation

Can we visualize a large scientific data set with a surrogate model? We're building a GAN for the Earth's Mantle Convection data set to see if we can!

CAPRI: Context-Aware Interpretable Point-of-Interest Recommendation Framework

Python interface for SmartRF Sniffer 2 Firmware

Implementation of Continuous Sparsification, a method for pruning and ticket search in deep networks

This is a Deep Leaning API for classifying emotions from human face and human audios.