Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Last update: Sep 12, 2022

Related tags

Deep Learning NRD_decoder

Overview

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Requirements

This repository needs mmsegmentation

Training

To train the model(s) in the paper, run this command:

python tools/train.py ./configs/NRD/ade20k/NRD_r101_512x512_164k_ade20k.py

The batch size is 16 in this work. Please change the 'samples_per_gpu' in configs/base/datasets/.. accordingly

Evaluation

To evaluate my model at single-scale inference, run:

python tools/eval.py ./configs/NRD/ade20k/NRD_r101_512x512_164k_ade20k.py  {path-to-checkpoint-file}   --eval mIoU

Pre-trained Models

Results

Our model achieves the following performance on :

[Semantic segmentation results]

Model name	datasets	mIoU	mIoU (ms)
NRD-r101	ade20k (val)	44.01	45.62
NRD-x101	ade20k (val)	44.34	46.35
NRD-r101	pascal-context(val)	52.31 (59 classes)	54.1 (59 classes)
NRD-r101	pascal-context(val)	47.5 (60 classes)	40.9 (60 classes)
NRD-r50	Cityscapes (val)	79.8	80.8
NRD-r101	Cityscapes (val)	80.7	82.0

Contributing

The code is mostly taken from mmsegmentation mmsegmentation is released under the Apache 2.0 license.

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Related tags

Overview

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Requirements

Training

Evaluation

Pre-trained Models

Results

[Semantic segmentation results]

Contributing

Owner

Adelaide Intelligent Machines (AIM) Group

Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis

Vision-Language Pre-training for Image Captioning and Question Answering

Official implementation of paper "Query2Label: A Simple Transformer Way to Multi-Label Classification".

Husein pet projects in here!

Deep Illuminator is a data augmentation tool designed for image relighting. It can be used to easily and efficiently generate a wide range of illumination variants of a single image.

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

Cross-media Structured Common Space for Multimedia Event Extraction (ACL2020)

Simple Linear 2nd ODE Solver GUI - A 2nd constant coefficient linear ODE solver with simple GUI using euler's method

Code for Piggyback: Adapting a Single Network to Multiple Tasks by Learning to Mask Weights

Diabet Feature Engineering - Predict whether people have diabetes when their characteristics are specified

PyTorch implementation of our ICCV 2021 paper Intrinsic-Extrinsic Preserved GANs for Unsupervised 3D Pose Transfer.

Summary of related papers on visual attention

This program uses trial auth token of Azure Cognitive Services to do speech synthesis for you.

Deep Learning applied to Integral data analysis

PRTR: Pose Recognition with Cascade Transformers

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

A small fun project using python OpenCV, mediapipe, and pydirectinput

code for our ECCV 2020 paper "A Balanced and Uncertainty-aware Approach for Partial Domain Adaptation"

Deeply Supervised, Layer-wise Prediction-aware (DSLP) Transformer for Non-autoregressive Neural Machine Translation

Source code for Fixed-Point GAN for Cloud Detection