A Strong Baseline for Image Semantic Segmentation

Introduction

This project is an open source semantic segmentation toolbox based on PyTorch. It is based on the codes of our Tianchi competition in 2021 (https://tianchi.aliyun.com/competition/entrance/531860/introduction).
In the competition, our team won the third place (please see Tianchi_README.md).

Overview

The master branch works with PyTorch 1.6+.The project now supports popular and contemporary semantic segmentation frameworks, e.g. UNet, DeepLabV3+, HR-Net etc.

Requirements

Support

Backbone

ResNet (CVPR'2016)
SeNet (CVPR'2018)
IBN-Net (CVPR'2018)
EfficientNet (CVPR'2020)

Methods

Tricks

Tools

large image inference (cut and merge)
post process (crf/superpixels)

Quick Start

Train a model

python train.py --config_file ${CONFIG_FILE}

CONFIG_FILE: File of training config about model

Examples:
We trained our model in Tianchi competition according to the following script:
Stage 1 (160e)

python train.py --config_file configs/tc_seg/tc_seg_res_unet_r34_ibn_a_160e.yml

Stage 2 (swa 24e)

python train.py --config_file configs/tc_seg/tc_seg_res_unet_r34_ibn_a_swa.yml

Inference with pretrained models

python inference.py --config_file ${CONFIG_FILE}

CONFIG_FILE: File of inference config about model

Predict large image with pretrained models

python predict_demo.py --config_file ${CONFIG_FILE} --rs_img_file ${IMAGE_FILE_PATH} --temp_img_save_path ${TEMP_CUT_PATH} -temp_seg_map_save_path ${TEMP_SAVE_PATH} --save_seg_map_file ${SAVE_SEG_FILE}

CONFIG_FILE: File of inference config about model
IMAGE_FILE_PATH: File of large input image to predict
TEMP_CUT_PATH: Temp folder of small cutting samples
TEMP_SAVE_PATH: Temp folder of predict results of cutting samples
SAVE_SEG_FILE: Predict result of the large image

A Strong Baseline for Image Semantic Segmentation

Related tags

Overview

A Strong Baseline for Image Semantic Segmentation

Introduction

Overview

Requirements

Support

Backbone

Methods

Tricks

Tools

Quick Start

Train a model

Inference with pretrained models

Predict large image with pretrained models

Owner

Clark He

PFENet: Prior Guided Feature Enrichment Network for Few-shot Segmentation (TPAMI).

Code for our paper "MG-GAN: A Multi-Generator Model Preventing Out-of-Distribution Samples in Pedestrian Trajectory Prediction" published at ICCV 2021.

MediaPipe is a an open-source framework from Google for building multimodal

KUIELAB-MDX-Net got the 2nd place on the Leaderboard A and the 3rd place on the Leaderboard B in the MDX-Challenge ISMIR 2021

Classifying cat and dog images using Kaggle dataset

exponential adaptive pooling for PyTorch

PyTorch implementation of TSception V2 using DEAP dataset

High performance distributed framework for training deep learning recommendation models based on PyTorch.

Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.

This is the official implementation of VaxNeRF (Voxel-Accelearated NeRF).

Implementation of a protein autoregressive language model, but with autoregressive infilling objective (editing subsequences capability)

ObjectDrawer-ToolBox: a graphical image annotation tool to generate ground plane masks for a 3D object reconstruction system

smc.covid is an R package related to the paper A sequential Monte Carlo approach to estimate a time varying reproduction number in infectious disease models: the COVID-19 case by Storvik et al

Python implementation of Wu et al (2018)'s registration fusion

Official PyTorch Implementation of Learning Architectures for Binary Networks

TorchX: A PyTorch Extension Library for More Efficient Deep Learning

Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

CR-FIQA: Face Image Quality Assessment by Learning Sample Relative Classifiability

Example of a Quantum LSTM