[CVPR 2021] Few-shot 3D Point Cloud Semantic Segmentation

Last update: Dec 27, 2022

Related tags

Overview

Few-shot 3D Point Cloud Semantic Segmentation

Created by Na Zhao from National University of Singapore

Introduction

This repository contains the PyTorch implementation for our CVPR 2021 Paper "Few-shot 3D Point Cloud Semantic Segmentation" by Na Zhao, Tat-Seng Chua, Gim Hee Lee.

Many existing approaches for point cloud semantic segmentation are fully supervised. These fully supervised approaches heavily rely on a large amount of labeled training data that is difficult to obtain and can not generalize to unseen classes after training. To mitigate these limitations, we propose a novel attention-aware multi-prototype transductive few-shot point cloud semantic segmentation method to segment new classes given a few labeled examples. Specifically, each class is represented by multiple prototypes to model the complex data distribution of 3D point clouds. Subsequently, we employ a transductive label propagation method to exploit the affinities between labeled multi-prototypes and unlabeled query points, and among the unlabeled query points. Furthermore, we design an attention-aware multi-level feature learning network to learn the discriminative features that capture the semantic correlations and geometric dependencies between points. Our proposed method shows significant and consistent improvements compared to the baselines in different few-shot point cloud segmentation settings (i.e. 2/3-way 1/5-shot) on two benchmark datasets.

Installation

Install python --This repo is tested with python 3.6.8.
Install pytorch with CUDA -- This repo is tested with torch 1.4.0, CUDA 10.1. It may work with newer versions, but that is not gauranteed.
Install faiss with cpu version

Install 'torch-cluster' with the corrreponding torch and cuda version

 pip install torch-cluster==latest+cu101 -f https://pytorch-geometric.com/whl/torch-1.5.0.html

Install dependencies

pip install tensorboard h5py transforms3d

Usage

Data preparation

S3DIS

Download S3DIS Dataset Version 1.2.
Re-organize raw data into npy files by running
```
cd ./preprocess
python collect_s3dis_data.py --data_path $path_to_S3DIS_raw_data
```
The generated numpy files are stored in ./datasets/S3DIS/scenes/ by default.
To split rooms into blocks, run

python ./preprocess/room2blocks.py --data_path ./datasets/S3DIS/scenes/

One folder named blocks_bs1_s1 will be generated under ./datasets/S3DIS/ by default.

ScanNet

Download ScanNet V2.
Re-organize raw data into npy files by running
```
cd ./preprocess
python collect_scannet_data.py --data_path $path_to_ScanNet_raw_data
```
The generated numpy files are stored in ./datasets/ScanNet/scenes/ by default.
To split rooms into blocks, run

python ./preprocess/room2blocks.py --data_path ./datasets/ScanNet/scenes/ --dataset scannet

One folder named blocks_bs1_s1 will be generated under ./datasets/ScanNet/ by default.

Running

Training

First, pretrain the segmentor which includes feature extractor module on the available training set:

cd scripts
bash pretrain_segmentor.sh

Second, train our method:

bash train_attMPTI.sh

Evaluation

bash eval_attMPTI.sh

Note that the above scripts are used for 2-way 1-shot on S3DIS (S^0). You can modified the corresponding hyperparameters to conduct experiments on other settings.

Citation

Please cite our paper if it is helpful to your research:

@inproceedings{zhao2021few,
  title={Few-shot 3D Point Cloud Semantic Segmentation},
  author={Zhao, Na and Chua, Tat-Seng and Lee, Gim Hee},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year={2021}
}

Acknowledgement

We thank DGCNN (pytorch) for sharing their source code.

[CVPR 2021] Few-shot 3D Point Cloud Semantic Segmentation

Related tags

Overview

Few-shot 3D Point Cloud Semantic Segmentation

Introduction

Installation

Usage

Data preparation

S3DIS

ScanNet

Running

Training

Evaluation

Citation

Acknowledgement

Owner

Art Project "Schrödinger's Game of Life"

Designing a Minimal Retrieve-and-Read System for Open-Domain Question Answering (NAACL 2021)

Fast RFC3339 compliant Python date-time library

A distributed deep learning framework that supports flexible parallelization strategies.

Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)

Prototypical Networks for Few shot Learning in PyTorch

QuakeLabeler is a Python package to create and manage your seismic training data, processes, and visualization in a single place — so you can focus on building the next big thing.

The official implementation of You Only Compress Once: Towards Effective and Elastic BERT Compression via Exploit-Explore Stochastic Nature Gradient.

This is the code of paper ``Contrastive Coding for Active Learning under Class Distribution Mismatch'' with python.

Deep learning toolbox based on PyTorch for hyperspectral data classification.

AI-Fitness-Tracker - AI Fitness Tracker With Python

Using Streamlit to host a multi-page tool with model specs and classification metrics, while also accepting user input values for prediction.

POCO: Point Convolution for Surface Reconstruction

Audio Visual Emotion Recognition using TDA

Quickly and easily create / train a custom DeepDream model

Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filters, and more. All exercises include solutions.

FuseDream: Training-Free Text-to-Image Generationwith Improved CLIP+GAN Space OptimizationFuseDream: Training-Free Text-to-Image Generationwith Improved CLIP+GAN Space Optimization

Scikit-event-correlation - Event Correlation and Forecasting over High Dimensional Streaming Sensor Data algorithms

The pure and clear PyTorch Distributed Training Framework.

Code for the paper "Combining Textual Features for the Detection of Hateful and Offensive Language"