Implementation of SETR model, Original paper: Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.

Last update: Dec 16, 2022

Overview

SETR - Pytorch

Since the original paper (Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.) has no official code,I implemented SETR-Progressive UPsampling(SETR-PUP) using pytorch.

Original paper: Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.

Vit

The Vit model is also implemented, and you can use it for image classification.

Usage SETR

from SETR.transformer_seg import SETRModel
import torch 

if __name__ == "__main__":
    net = SETRModel(patch_size=(32, 32), 
                    in_channels=3, 
                    out_channels=1, 
                    hidden_size=1024, 
                    num_hidden_layers=8, 
                    num_attention_heads=16, 
                    decode_features=[512, 256, 128, 64])
    t1 = torch.rand(1, 3, 256, 256)
    print("input: " + str(t1.shape))
    
    # print(net)
    print("output: " + str(net(t1).shape))

If the output size is (1, 1, 256, 256), the code runs successfully.

Usage Vit

from SETR.transformer_seg import Vit
import torch 

if __name__ == "__main__":
    model = Vit(patch_size=(7, 7), 
                    in_channels=1, 
                    out_class=10, 
                    hidden_size=1024, 
                    num_hidden_layers=1, 
                    num_attention_heads=16)
    print(model)
    t1 = torch.rand(1, 1, 28, 28)
    print("input: " + str(t1.shape))

    print("output: " + str(model(t1).shape))

The output shape is (1, 10).

current examples

task_mnist: The simplest example, using the Vit model to classify the minst dataset.
task_car_seg: The example is sample segmentation task. data download: https://www.kaggle.com/c/carvana-image-masking-challenge/data

More examples will be updated later.

Implementation of SETR model, Original paper: Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.

Related tags

Overview

SETR - Pytorch

Vit

Usage SETR

Usage Vit

current examples

more

Owner

zhaohu xing

Scalable Multi-Agent Reinforcement Learning

Data reduction pipeline for KOALA on the AAT.

ICCV2021 Expert-Goal Trajectory Prediction

Discord Multi Tool that focuses on design and easy usage

Automatic differentiation with weighted finite-state transducers.

Implementation of ECCV20 paper: the devil is in classification: a simple framework for long-tail object detection and instance segmentation

Differentiable Neural Computers, Sparse Access Memory and Sparse Differentiable Neural Computers, for Pytorch

学习 python3 以来写的一些垃圾玩具……

Pixel-level Crack Detection From Images Of Levee Systems : A Comparative Study

LRBoost is a scikit-learn compatible approach to performing linear residual based stacking/boosting.

Tensorflow implementation of "Learning Deep Features for Discriminative Localization"

Exploiting a Zoo of Checkpoints for Unseen Tasks

Code for ACL 21: Generating Query Focused Summaries from Query-Free Resources

Codebase for Inducing Causal Structure for Interpretable Neural Networks

A free, multiplatform SDK for real-time facial motion capture using blendshapes, and rigid head pose in 3D space from any RGB camera, photo, or video.

Revisting Open World Object Detection

Implementation of Fast Transformer in Pytorch

Python scripts for performing 3D human pose estimation using the Mobile Human Pose model in ONNX.

The best solution of the Weather Prediction track in the Yandex Shifts challenge

1st place solution in CCF BDCI 2021 ULSEG challenge