This repo holds code for TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation

Last update: Jan 04, 2023

Related tags

Deep Learning TransUNet

Overview

TransUNet

This repo holds code for TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation

Usage

1. Download Google pre-trained ViT models

Get models in this link: R50-ViT-B_16, ViT-B_16, ViT-L_16...

wget https://storage.googleapis.com/vit_models/imagenet21k/{MODEL_NAME}.npz &&
mkdir ../model/vit_checkpoint/imagenet21k &&
mv {MODEL_NAME}.npz ../model/vit_checkpoint/imagenet21k/{MODEL_NAME}.npz

2. Prepare data

Please go to "./datasets/README.md" for details, or please send an Email to jienengchen01 AT gmail.com to request the preprocessed data. If you would like to use the preprocessed data, please use it for research purposes and do not redistribute it.

3. Environment

Please prepare an environment with python=3.7, and then use the command "pip install -r requirements.txt" for the dependencies.

4. Train/Test

Run the train script on synapse dataset. The batch size can be reduced to 12 or 6 to save memory (please also decrease the base_lr linearly), and both can reach similar performance.

CUDA_VISIBLE_DEVICES=0 python train.py --dataset Synapse --vit_name R50-ViT-B_16

Run the test script on synapse dataset. It supports testing for both 2D images and 3D volumes.

python test.py --dataset Synapse --vit_name R50-ViT-B_16

Reference

Citations

@article{chen2021transunet,
  title={TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation},
  author={Chen, Jieneng and Lu, Yongyi and Yu, Qihang and Luo, Xiangde and Adeli, Ehsan and Wang, Yan and Lu, Le and Yuille, Alan L., and Zhou, Yuyin},
  journal={arXiv preprint arXiv:2102.04306},
  year={2021}
}

This repo holds code for TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation

Related tags

Overview

TransUNet

Usage

1. Download Google pre-trained ViT models

2. Prepare data

3. Environment

4. Train/Test

Reference

Citations

Owner

A data-driven approach to quantify the value of classifiers in a machine learning ensemble.

Semi-supervised Semantic Segmentation with Directional Context-aware Consistency (CVPR 2021)

Co-mining: Self-Supervised Learning for Sparsely Annotated Object Detection, AAAI 2021.

Self-Supervised Pre-Training for Transformer-Based Person Re-Identification

ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives

SMPL-X: A new joint 3D model of the human body, face and hands together

Official repository for Jia, Raghunathan, Göksel, and Liang, "Certified Robustness to Adversarial Word Substitutions" (EMNLP 2019)

Offical implementation for "Trash or Treasure? An Interactive Dual-Stream Strategy for Single Image Reflection Separation".

DeepLab2: A TensorFlow Library for Deep Labeling

A dataset for online Arabic calligraphy

Wordplay, an artificial Intelligence based crossword puzzle solver.

TensorFlow CNN for fast style transfer

DROPO: Sim-to-Real Transfer with Offline Domain Randomization

Neural network for recognizing the gender of people in photos

A scikit-learn-compatible module for estimating prediction intervals.

To prepare an image processing model to classify the type of disaster based on the image dataset

Adversarial Color Enhancement: Generating Unrestricted Adversarial Images by Optimizing a Color Filter

Mosaic of Object-centric Images as Scene-centric Images (MosaicOS) for long-tailed object detection and instance segmentation.

A note taker for NVDA. Allows the user to create, edit, view, manage and export notes to different formats.

This is an example of a reproducible modelling project