Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing (CVPR 2018).

Last update: Dec 13, 2022

Overview

Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing (CVPR2018)

By Zilong Huang, Xinggang Wang, Jiasi Wang, Wenyu Liu and Jingdong Wang.

This code is a implementation of the weakly-supervised semantic segmentation experiments in the paper DSRG. The code is developed based on the Caffe framework.

Introduction

Overview of the proposed approach. The Deep Seeded Region Growing module takes the seed cues and segmentation map as input to produces latent pixel-wise supervision which is more accurate and more complete than seed cues. Our method iterates between reﬁning pixel-wise supervision and optimizing the parameters of a segmentation network.

License

DSRG is released under the MIT License (refer to the LICENSE file for details).

Citing DSRG

If you find DSRG useful in your research, please consider citing:

@inproceedings{huang2018dsrg,
    title={Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing},
    author={Huang, Zilong and Wang, Xinggang and Wang, Jiasi and Liu, Wenyu and Wang, Jingdong},
    booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
    pages={7014--7023},
    year={2018}
}

Installing dependencies

Python packages:

      $ pip install -r python-dependencies.txt

caffe (deeplabv2 version): deeplabv2 caffe installation instructions are available at https://bitbucket.org/aquariusjay/deeplab-public-ver2. Note, you need to compile caffe with python wrapper and support for python layers. Then add the caffe python path into training/tools/findcaffe.py.
Fully connected CRF wrapper (requires the Eigen3 package).

      $ pip install CRF/

Training the DSRG model

Go into the training directory:

      $ cd training
      $ mkdir localization_cues

Download the initial VGG16 model pretrained on Imagenet and put it in training/ folder.
Download CAM seed and put it in training/localization_cues folder. We use CAM for localizing the foreground seed classes and utilize the saliency detection technology DRFI for localizing background seed. We provide the python interface to DRFI here for convenience if you want to generate the seed by yourself.

      $ cd training/experiment/seed_mc
      $ mkdir models

Set root_folder parameter in train-s.prototxt, train-f.prototxt and PASCAL_DIR in run-s.sh to the directory with PASCAL VOC 2012 images
Run:

      $ bash run.sh

The trained model will be created in models

Acknowledgment

This code is heavily borrowed from SEC.

Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing (CVPR 2018).

Related tags

Overview

Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing (CVPR2018)

Introduction

License

Citing DSRG

Installing dependencies

Training the DSRG model

Acknowledgment

Owner

Zilong Huang

Code for Multinomial Diffusion

Source code for paper "ATP: AMRize Than Parse! Enhancing AMR Parsing with PseudoAMRs" @NAACL-2022

PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.

classification task on dataset-CIFAR10,by using Tensorflow/keras

CAPRI: Context-Aware Interpretable Point-of-Interest Recommendation Framework

Semantic Segmentation for Aerial Imagery using Convolutional Neural Network

Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition

Neuron Merging: Compensating for Pruned Neurons (NeurIPS 2020)

Transformer in Computer Vision

Outlier Exposure with Confidence Control for Out-of-Distribution Detection

This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".

"Segmenter: Transformer for Semantic Segmentation" reproduced via mmsegmentation

Implementation of SETR model, Original paper: Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.

Find-Lane-Line - Use openCV library and Python to detect the road-lane-line

Misc YOLOL scripts for use in the Starbase space sandbox videogame

Self-Supervised Pre-Training for Transformer-Based Person Re-Identification

MutualGuide is a compact object detector specially designed for embedded devices

The personal repository of the work: DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer.

RAMA: Rapid algorithm for multicut problem

A PyTorch implementation of EfficientDet.

Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing (CVPR 2018).

Related tags

Overview

Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing (CVPR2018)

Introduction

License

Citing DSRG

Installing dependencies

Training the DSRG model

Acknowledgment

Owner

Zilong Huang

Code for Multinomial Diffusion

Source code for paper "ATP: AMRize Than Parse! Enhancing AMR Parsing with PseudoAMRs" @NAACL-2022

PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.

classification task on dataset-CIFAR10,by using Tensorflow/keras

CAPRI: Context-Aware Interpretable Point-of-Interest Recommendation Framework

Semantic Segmentation for Aerial Imagery using Convolutional Neural Network

Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition

Neuron Merging: Compensating for Pruned Neurons (NeurIPS 2020)

Transformer in Computer Vision

Outlier Exposure with Confidence Control for Out-of-Distribution Detection

This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".

"Segmenter: Transformer for Semantic Segmentation" reproduced via mmsegmentation

Implementation of SETR model, Original paper: Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.

Find-Lane-Line - Use openCV library and Python to detect the road-lane-line

Misc YOLOL scripts for use in the Starbase space sandbox videogame

Self-Supervised Pre-Training for Transformer-Based Person Re-Identification

MutualGuide is a compact object detector specially designed for embedded devices

The personal repository of the work: *DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer*.

RAMA: Rapid algorithm for multicut problem

A PyTorch implementation of EfficientDet.

The personal repository of the work: DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer.