MonoRCNN is a monocular 3D object detection method for automonous driving

Last update: Dec 27, 2022

Related tags

Overview

MonoRCNN

MonoRCNN is a monocular 3D object detection method for automonous driving, published at ICCV 2021. This project is an implementation of MonoRCNN.

Visualization

Methodology

Installation

Python 3.6
PyTorch 1.5.0
Detectron2 0.1.3

Please use the Detectron2 included in this project. To ignore fully occluded objects during training, build.py, rpn.py, and roi_heads.py have been modified.

Dataset Preparation

KITTI

Model & Log

KITTI val1 split

Organize the downloaded files as follows:

├── projects
│   ├── MonoRCNN
│   │   ├── output
│   │   │   ├── model
│   │   │   ├── log.txt
│   │   │   ├── ...

Test

cd projects/MonoRCNN
./main.py --config-file config/MonoRCNN_KITTI.yaml --num-gpus 1 --resume --eval-only

Set VISUALIZE as True to visualize 3D object detection results (saved in output/evaluation/test/visualization).

Training

cd projects/MonoRCNN
./main.py --config-file config/MonoRCNN_KITTI.yaml --num-gpus 1

Citation

If you find this project useful in your research, please cite:

@inproceedings{MonoRCNN_ICCV21,
    title = {Geometry-based Distance Decomposition for Monocular 3D Object Detection},
    author = {Xuepeng Shi and Qi Ye and 
              Xiaozhi Chen and Chuangrong Chen and 
              Zhixiang Chen and Tae-Kyun Kim},
    booktitle = {ICCV},
    year = {2021},
}

Contact

[email protected]

MonoRCNN is a monocular 3D object detection method for automonous driving

Related tags

Overview

MonoRCNN

Visualization

Methodology

Related Link

Installation

Dataset Preparation

Model & Log

Test

Training

Citation

Contact

Acknowledgement

Owner

AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation

Official Implementation of "Tracking Grow-Finish Pigs Across Large Pens Using Multiple Cameras"

[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias

A forwarding MPI implementation that can use any other MPI implementation via an MPI ABI

A pytorch implementation of Pytorch-Sketch-RNN

Config files for my GitHub profile.

A flag generation AI created using DeepAIs API

City-Scale Multi-Camera Vehicle Tracking Guided by Crossroad Zones Code

Deep Ensembling with No Overhead for either Training or Testing: The All-Round Blessings of Dynamic Sparsity

Transfer Reinforcement Learning for Differing Action Spaces via Q-Network Representations

MPI-IS Mesh Processing Library

social humanoid robots with GPGPU and IoT

Implementation of ICCV21 paper: PnP-DETR: Towards Efficient Visual Analysis with Transformers

DirectVoxGO reconstructs a scene representation from a set of calibrated images capturing the scene.

Use AI to generate a optimized stock portfolio

FaceVerse: a Fine-grained and Detail-controllable 3D Face Morphable Model from a Hybrid Dataset (CVPR2022)

Files for a tutorial to train SegNet for road scenes using the CamVid dataset

Computer Vision is an elective course of MSAI, SCSE, NTU, Singapore

Code repository for paper `Skeleton Merger: an Unsupervised Aligned Keypoint Detector`.

Human Activity Recognition example using TensorFlow on smartphone sensors dataset and an LSTM RNN. Classifying the type of movement amongst six activity categories - Guillaume Chevalier