Code for CVPR 2022 paper "SoftGroup for Instance Segmentation on 3D Point Clouds"

Last update: Dec 27, 2022

Related tags

Overview

SoftGroup

We provide code for reproducing results of the paper SoftGroup for 3D Instance Segmentation on Point Clouds (CVPR 2022)

Author: Thang Vu, Kookhoi Kim, Tung M. Luu, Xuan Thanh Nguyen, and Chang D. Yoo.

Introduction

Existing state-of-the-art 3D instance segmentation methods perform semantic segmentation followed by grouping. The hard predictions are made when performing semantic segmentation such that each point is associated with a single class. However, the errors stemming from hard decision propagate into grouping that results in (1) low overlaps between the predicted instance with the ground truth and (2) substantial false positives. To address the aforementioned problems, this paper proposes a 3D instance segmentation method referred to as SoftGroup by performing bottom-up soft grouping followed by top-down refinement. SoftGroup allows each point to be associated with multiple classes to mitigate the problems stemming from semantic prediction errors and suppresses false positive instances by learning to categorize them as background. Experimental results on different datasets and multiple evaluation metrics demonstrate the efficacy of SoftGroup. Its performance surpasses the strongest prior method by a significant margin of +6.2% on the ScanNet v2 hidden test set and +6.8% on S3DIS Area 5 of AP_50.

Feature

State of the art performance on the ScanNet benchmark and S3DIS dataset (3/Mar/2022).
High speed of 345 ms per scan on ScanNet dataset, which is comparable with the existing fastest methods (HAIS).
Reproducibility code for both ScanNet and S3DIS datasets.

Installation

Please refer to installation guide.

Data Preparation

Please refer to data preparation for preparing the S3DIS and ScanNet v2 dataset.

Pretrained models

Dataset	AP	AP_50	AP_25	Download
S3DIS	51.4	66.5	75.4	model
ScanNet v2	46.0	67.6	78.9	model

Training

We use the checkpoint of HAIS as pretrained backbone. Download the pretrained HAIS model at here at put it in SoftGroup/ directory.

Training S3DIS dataset

First, finetune the pretrained HAIS point-wise prediction network (backbone) on S3DIS.

python train.py --config config/softgroup_fold5_backbone_s3dis.yaml

Then, train model from frozen backbone.

python train.py --config config/softgroup_fold5_default_s3dis.yaml

Training ScanNet V2 dataset

Training on ScanNet doesnot require finetuning the backbone. Just freeze pretrained backbone and train the model.

python train.py --config config/softgroup_default_scannet.yaml

Inference

Testing for S3DIS dataset.

CUDA_VISIBLE_DEVICES=0 python test_s3dis.py --config config/softgroup_fold5_phase2_s3dis.yaml --pretrain $PATH_TO_PRETRAIN_MODEL$

Testing for ScanNet V2 dataset.

CUDA_VISIBLE_DEVICES=0 python test.py --config config/softgroup_default_scannet.yaml --pretrain $PATH_TO_PRETRAIN_MODEL$

Visualization

We provide visualization tools based on Open3D (tested on Open3D 0.8.0).

pip install open3D==0.8.0
python visualize_open3d.py --data_path {} --prediction_path {} --data_split {} --room_name {} --task {}

Please refer to visualize_open3d.py for more details.

Citation

If you find our work helpful for your research. Please consider citing our paper.

@inproceedings{vu2022softgroup,
  title={SoftGroup for 3D Instance Segmentation on 3D Point Clouds},
  author={Vu, Thang and Kim, Kookhoi and Luu, Tung M. and Nguyen, Xuan Thanh and Yoo, Chang D.},
  booktitle={CVPR},
  year={2022}
}

Code for CVPR 2022 paper "SoftGroup for Instance Segmentation on 3D Point Clouds"

Related tags

Overview

SoftGroup

Introduction

Feature

Installation

Data Preparation

Pretrained models

Training

Training S3DIS dataset

Training ScanNet V2 dataset

Inference

Visualization

Citation

Owner

Thang Vu

Detecting Text in Natural Image with Connectionist Text Proposal Network (ECCV'16)

Code for the ACL2021 paper "Combining Static Word Embedding and Contextual Representations for Bilingual Lexicon Induction"

Document Layout Analysis Projects

Python library to extract tabular data from images and scanned PDFs

Recognizing the text contents from a scanned visiting card

A python screen recorder for low-end computers, provides high quality video output.

fishington.io bot with OpenCV and NumPy

Code for the paper STN-OCR: A single Neural Network for Text Detection and Text Recognition

A python scripts that uses 3 different feature extraction methods such as SIFT, SURF and ORB to find a book in a video clip and project trailer of a movie based on that book, on to it.

Course material for the Multi-agents and computer graphics course

A webcam-based 3x3x3 rubik's cube solver written in Python 3 and OpenCV.

かの有名なあの東方二次創作ソング、「bad apple!」のMVをPythonでやってみたって話

BoxToolBox is a simple python application built around the openCV library

Python tool that takes the OCR.space JSON output as input and draws a text overlay on top of the image.

Face Detection with DLIB

Natural language detection

Multi-choice answer sheet correction system using computer vision with opencv & python.

BNF Globalization Code (CVPR 2016)

Thresholding-and-masking-using-OpenCV - Image Thresholding is used for image segmentation

keras复现场景文本检测网络CPTN: 《Detecting Text in Natural Image with Connectionist Text Proposal Network》；欢迎试用，关注，并反馈问题...