PyTorch implementation of PSPNet

Last update: Nov 16, 2022

Overview

PSPNet with PyTorch

Unofficial implementation of "Pyramid Scene Parsing Network" (https://arxiv.org/abs/1612.01105). This repository is just for caffe to pytorch model conversion and evaluation.

Requirements

pytorch
click
addict
pydensecrf
protobuf

Preparation

Instead of building the author's caffe implementation, you can convert off-the-shelf caffemodels to pytorch models via the caffe.proto.

1. Compile the `caffe.proto` for Python API

This step can be skipped. FYI.
Download the author's caffe.proto into the libs, not the one in the original caffe.

# For protoc command
pip install protobuf
# This generates ./caffe_pb2.py
protoc --python_out=. caffe.proto

2. Model conversion

Find the caffemodels on the author's page (e.g. pspnet50_ADE20K.caffemodel) and store them to the data/models/ directory.
Convert the caffemodels to .pth file.

python convert.py -c <PATH TO YAML>

Demo

python demo.py -c <PATH TO YAML> -i <PATH TO IMAGE>

With a --no-cuda option, this runs on CPU.
With a --crf option, you can perform a CRF postprocessing.

Evaluation

PASCAL VOC2012 only. Please set the dataset path in config/voc12.yaml.

python eval.py -c config/voc12.yaml

88.1% mIoU (SS) and 88.6% mIoU (MS) on validation set.
NOTE: 3 points lower than caffe implementation. WIP

SS: averaged prediction with flipping (2x)
MS: averaged prediction with multi-scaling (6x) and flipping (2x)
Both: No CRF post-processing

References

Official implementation: https://github.com/hszhao/PSPNet
Chainer implementation: https://github.com/mitmul/chainer-pspnet

PyTorch implementation of PSPNet

Related tags

Overview

PSPNet with PyTorch

Requirements

Preparation

1. Compile the `caffe.proto` for Python API

2. Model conversion

Demo

Evaluation

References

Owner

Kazuto Nakashima

Code in conjunction with the publication 'Contrastive Representation Learning for Hand Shape Estimation'

[IEEE Transactions on Computational Imaging] Self-Gated Memory Recurrent Network for Efficient Scalable HDR Deghosting

Repository features UNet inspired architecture used for segmenting lungs on chest X-Ray images

PyTorch implementation of NIPS 2017 paper Dynamic Routing Between Capsules

OpenMMLab Pose Estimation Toolbox and Benchmark.

Joint Learning of 3D Shape Retrieval and Deformation, CVPR 2021

Source code for The Power of Many: A Physarum Swarm Steiner Tree Algorithm

MADT: Offline Pre-trained Multi-Agent Decision Transformer

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

Extreme Lightwegith Portrait Segmentation

SAMO: Streaming Architecture Mapping Optimisation

A multi-entity Transformer for multi-agent spatiotemporal modeling.

Adjust Decision Boundary for Class Imbalanced Learning

PN-Net a neural field-based framework for depth estimation from single-view RGB images.

Multiple Object Extraction from Aerial Imagery with Convolutional Neural Networks

Simple converter for deploying Stable-Baselines3 model to TFLite and/or Coral

Arabic Car License Recognition. A solution to the kaggle competition Machathon 3.0.

Application of K-means algorithm on a music dataset after a dimensionality reduction with PCA

DeepStochlog Package For Python

JORLDY an open-source Reinforcement Learning (RL) framework provided by KakaoEnterprise

PyTorch implementation of PSPNet

Related tags

Overview

PSPNet with PyTorch

Requirements

Preparation

1. Compile the caffe.proto for Python API

2. Model conversion

Demo

Evaluation

References

Owner

Kazuto Nakashima

Code in conjunction with the publication 'Contrastive Representation Learning for Hand Shape Estimation'

[IEEE Transactions on Computational Imaging] Self-Gated Memory Recurrent Network for Efficient Scalable HDR Deghosting

Repository features UNet inspired architecture used for segmenting lungs on chest X-Ray images

PyTorch implementation of NIPS 2017 paper Dynamic Routing Between Capsules

OpenMMLab Pose Estimation Toolbox and Benchmark.

Joint Learning of 3D Shape Retrieval and Deformation, CVPR 2021

Source code for The Power of Many: A Physarum Swarm Steiner Tree Algorithm

MADT: Offline Pre-trained Multi-Agent Decision Transformer

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

Extreme Lightwegith Portrait Segmentation

SAMO: Streaming Architecture Mapping Optimisation

A multi-entity Transformer for multi-agent spatiotemporal modeling.

Adjust Decision Boundary for Class Imbalanced Learning

PN-Net a neural field-based framework for depth estimation from single-view RGB images.

Multiple Object Extraction from Aerial Imagery with Convolutional Neural Networks

Simple converter for deploying Stable-Baselines3 model to TFLite and/or Coral

Arabic Car License Recognition. A solution to the kaggle competition Machathon 3.0.

Application of K-means algorithm on a music dataset after a dimensionality reduction with PCA

DeepStochlog Package For Python

JORLDY an open-source Reinforcement Learning (RL) framework provided by KakaoEnterprise

1. Compile the `caffe.proto` for Python API