Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

Last update: Jun 27, 2022

Related tags

Overview

Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

Abstract

Analyzing complex scenes with DNN is a challenging task, particularly when images contain multiple objects that partially occlude each other. Existing approaches to image analysis mostly process objects independently and do not take into account the relative occlusion of nearby objects. We propose a deep network for multi-object instance segmentation that is robust to occlusion and can be trained from bounding box supervision only.

We also introduce an Occlusion Challenge dataset generated from real-world segmented objects with accurate annotations and propose a taxonomy of occlusion scenarios that pose a particular challenge for computer vision.

NOTICE

dataset links and model will be released in a few days. Update: 18 June

Requirments

The code uses Python 3.6 and it is tested on PyTorch GPU version 1.2, with CUDA-10.0 and cuDNN-7.5.

Installation

Clone the repository with:

git clone https://github.com/XD7479/Multi-Object-Occlusion.git
cd Multi-Object-Occlusion

Install requirments:

pip install -r requirements.txt

Datasets

Download the KINS dataset here and the Occlusion Challenge dataset here.
Enter the project folder and make links for the datasets:

ln -s  kins
ln -s  occ_challenge

Download the pre-trained model here.
Make links for the pre-trained model:

ln -s  models

Check the configuration file configs.py for the dataset and backbone you're using:

dataset_eval = 'occ_challenge'      # kins, occ_challenge
nn_type = 'resnext'             # vgg, resnext

Run the evaluation code with:

python3 eval_meanIoU.py

Segmentation Demo

Citation

@misc{yuan2021robust,
      title={Robust Instance Segmentation through Reasoning about Multi-Object Occlusion}, 
      author={Xiaoding Yuan and Adam Kortylewski and Yihong Sun and Alan Yuille},
      booktitle = {Proceedings IEEE Conf. on Computer Vision and Pattern Recognition (CVPR)},
      month = jun,
      year = {2021},
      month_numeric = {6}
}

Contact

If you have any questions you can contact Xiaoding Yuan by [email protected].

Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

Related tags

Overview

Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

Abstract

NOTICE

Requirments

Installation

Datasets

Segmentation Demo

Citation

Contact

Owner

Irene Yuan

The Pytorch implementation for "Video-Text Pre-training with Learned Regions"

Code for Private Recommender Systems: How Can Users Build Their Own Fair Recommender Systems without Log Data? (SDM 2022)

Disentangled Face Attribute Editing via Instance-Aware Latent Space Search, accepted by IJCAI 2021.

Image process framework based on plugin like imagej, it is esay to glue with scipy.ndimage, scikit-image, opencv, simpleitk, mayavi...and any libraries based on numpy

Official implementation of "A Shared Representation for Photorealistic Driving Simulators" in PyTorch.

Language Models Can See: Plugging Visual Controls in Text Generation

Simple ray intersection library similar to coldet - succedeed by libacc

Python scripts for performing 3D human pose estimation using the Mobile Human Pose model in ONNX.

🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥

Data-depth-inference - Data depth inference with python

Data labels and scripts for fastMRI.org

Finetuner allows one to tune the weights of any deep neural network for better embeddings on search tasks

10x faster matrix and vector operations

A clean and scalable template to kickstart your deep learning project 🚀 ⚡ 🔥

Make your AirPlay devices as TTS speakers

This repository contains the source code for the paper "DONeRF: Towards Real-Time Rendering of Compact Neural Radiance Fields using Depth Oracle Networks",

Hierarchical User Intent Graph Network for Multimedia Recommendation

[CVPR'22] Weakly Supervised Semantic Segmentation by Pixel-to-Prototype Contrast

Stochastic Extragradient: General Analysis and Improved Rates

Source code of our TTH paper: Targeted Trojan-Horse Attacks on Language-based Image Retrieval.