Dual Path Learning for Domain Adaptation of Semantic Segmentation

Official PyTorch implementation of "Dual Path Learning for Domain Adaptation of Semantic Segmentation".

Accepted by ICCV 2021. Paper

Requirements

Pytorch 3.6
torch==1.5
torchvision==0.6
Pillow==7.1.2

Dataset Preparations

For GTA5->Cityscapes scenario, download:

Source dataset: GTA5 Dataset
Target dataset: Cityscapes Dataset

For further evaluation on SYNTHIA->Cityscapes scenario, download:

Source dataset: SYNTHIA dataset (SYNTHIA-RAND-CITYSCAPES)

The folder should be structured as:

|DPL
|—— DPL_master/
|—— CycleGAN_DPL/
|—— data/
│   ├—— Cityscapes/  
|   |   ├—— data/
|   |       ├—— gtFine/
|   |       ├—— leftImg8bit/
│   ├—— GTA5/
|   |   ├—— images/
|   |   ├—— labels/
|   |   ├—— ...
│   ├—— synthia/ 
|   |   ├—— RGB/
|   |   ├—— GT/
|   |   ├—— Depth/
|   |   ├—— ...

Evaluation

Download pre-trained models from Pretrained_Resnet_GTA5 [Google_Drive, BaiduYun(Code:t7t8)] and save the unzipped models in ./DPL_master/DPL_pretrained, download translated target images from DPI2I_City2GTA_Resnet [Google_Drive, BaiduYun(Code:cf5a)] and save the unzipped images in ./DPL_master/DPI2I_images/DPI2I_City2GTA_Resnet/val. Then you can evaluate DPL and DPL-Dual as following:

Evaluation of DPL

cd DPL_master
python evaluation.py --init-weights ./DPL_pretrained/Resnet_GTA5_DPLst4_T.pth --save path_to_DPL_results/results --log-dir path_to_DPL_results

Evaluation of DPL-Dual

python evaluation_DPL.py --data-dir-targetB ./DPI2I_images/DPI2I_City2GTA_Resnet --init-weights_S ./DPL_pretrained/Resnet_GTA5_DPLst4_S.pth --init-weights_T ./DPL_pretrained/Resnet_GTA5_DPLst4_T.pth --save path_to_DPL_dual_results/results --log-dir path_to_DPL_dual_results

More pretrained models and translated target images on other settings can be downloaded from:

GTA5->Cityscapes, FCN-8s with VGG16: GTA5_VGG_chpt [Google_Drive, BaiduYun(Code:fanp)]
SYNTHIA->Cityscapes, DeepLab-V2 with ResNet-101: SYN_Resnet_chpt [Google_Drive, BaiduYun(Code:drvo)]
SYNTHIA->Cityscapes, FCN-8s with VGG16: SYN_VGG_chpt [Google_Drive, BaiduYun(Code:9vio)]

Training

The training process of DPL consists of two phases: single-path warm-up and DPL training. The training example is given on default setting: GTA5->Cityscapes, DeepLab-V2 with ResNet-101.

Quick start for DPL training

Downlad pretrained and [Google_Drive, BaiduYun(Code: 3ndm)], save to path_to_model_S, save to path_to_model_T, then you can train DPL as following:

Train dual path image generation module.

cd ../CycleGAN_DPL
python train.py --dataroot ../data --name dual_path_I2I --A_setroot GTA5/images --B_setroot Cityscapes/leftImg8bit/train --model cycle_diff --lambda_semantic 1 --init_weights_S path_to_model_S --init_weights_T path_to_model_T

Generate transferred images with dual path image generation module.

Generate transferred GTA5->Cityscapes images.

python test.py --name dual_path_I2I --no_dropout --load_size 1024 --crop_size 1024 --preprocess scale_width --dataroot ../data/GTA5/images --model_suffix A  --results_dir DPI2I_path_to_GTA52cityscapes

Generate transferred Cityscapes->GTA5 images.

 python test.py --name dual_path_I2I --no_dropout --load_size 1024 --crop_size 1024 --preprocess scale_width --dataroot ../data/Cityscapes/leftImg8bit/train --model_suffix B  --results_dir DPI2I_path_to_cityscapes2GTA5/train
 
 python test.py --name dual_path_I2I --no_dropout --load_size 1024 --crop_size 1024 --preprocess scale_width --dataroot ../data/Cityscapes/leftImg8bit/val --model_suffix B  --results_dir DPI2I_path_to_cityscapes2GTA5/val

Train dual path adaptive segmentation module

3.1. Generate dual path pseudo label.

cd ../DPL_master
python DP_SSL.py --save path_to_dual_pseudo_label_stepi --init-weights_S path_to_model_S --init-weights_T path_to_model_T --thresh 0.9 --threshlen 0.3 --data-list-target ./dataset/cityscapes_list/train.txt --set train --data-dir-targetB DPI2I_path_to_cityscapes2GTA5 --alpha 0.5

3.2. Train and with dual path pseudo label respectively.

python DPL.py --snapshot-dir snapshots/DPL_modelS_step_i --data-dir-target DPI2I_path_to_cityscapes2GTA5 --data-label-folder-target path_to_dual_pseudo_label_stepi --init-weights path_to_model_S --domain S

python DPL.py --snapshot-dir snapshots/DPL_modelT_step_i --data-dir DPI2I_path_to_GTA52cityscapes --data-label-folder-target path_to_dual_pseudo_label_stepi --init-weights path_to_model_T

3.3. Update path_to_model_Swith path to best model, update path_to_model_Twith path to best model, adjust parameter threshenlen to 0.25, then repeat 3.1-3.2 for 3 more rounds.

Single path warm up

If you want to train DPL from the very begining, training example of single path warm up is also provided as below:

Single Path Warm-up

Download trained with labeled source dataset Source_only [Google_Drive, BaiduYun(Code:fjdw)].

Train original cycleGAN (without Dual Path Image Translation).

cd CycleGAN_DPL
python train.py --dataroot ../data --name ori_cycle --A_setroot GTA5/images --B_setroot Cityscapes/leftImg8bit/train --model cycle_diff --lambda_semantic 0

Generate transferred GTA5->Cityscapes images with original cycleGAN.

python test.py --name ori_cycle --no_dropout --load_size 1024 --crop_size 1024 --preprocess scale_width --dataroot ../data/GTA5/images --model_suffix A  --results_dir path_to_ori_cycle_GTA52cityscapes

Before warm up, pretrain without SSL and restore the best checkpoint in path_to_pretrained_T:

cd ../DPL_master
python DPL.py --snapshot-dir snapshots/pretrain_T --init-weights path_to_initialization_S --data-dir path_to_ori_cycle_GTA52cityscapes

Warm up .

4.1. Generate labels on source dataset with label correction.

python SSL_source.py --set train --data-dir path_to_ori_cycle_GTA52cityscapes --init-weights path_to_pretrained_T --threshdelta 0.3 --thresh 0.9 --threshlen 0.65 --save path_to_corrected_label_step1_or_step2

4.2. Generate pseudo labels on target dataset.

python SSL.py --set train --data-list-target ./dataset/cityscapes_list/train.txt --init-weights path_to_pretrained_T  --thresh 0.9 --threshlen 0.65 --save path_to_pseudo_label_step1_or_step2

4.3. Train with label correction.

python DPL.py --snapshot-dir snapshots/label_corr_step1_or_step2 --data-dir path_to_ori_cycle_GTA52cityscapes --source-ssl True --source-label-dir path_to_corrected_label_step1_or_step2 --data-label-folder-target path_to_pseudo_label_step1_or_step2 --init-weights path_to_pretrained_T

4.4 Update path_to_pretrained_T with path to best model in 4.3, repeat 4.1-4.3 for one more round.

More Experiments

For SYNTHIA to Cityscapes scenario, please train DPL with "--source synthia" and change the data path.
For training on "FCN-8s with VGG16", please train DPL with "--model VGG".

Citation

If you find our paper and code useful in your research, please consider giving a star and citation.

@inproceedings{cheng2021dual,
  title={Dual Path Learning for Domain Adaptation of Semantic Segmentation},
  author={Cheng, Yiting and Wei, Fangyun and Bao, Jianmin and Chen, Dong and Wen, Fang and Zhang, Wenqiang},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  pages={9082--9091},
  year={2021}
}

Acknowledgment

This code is heavily borrowed from BDL.

Official PyTorch implementation of "Dual Path Learning for Domain Adaptation of Semantic Segmentation".

Related tags

Overview

Dual Path Learning for Domain Adaptation of Semantic Segmentation

Requirements

Dataset Preparations

Evaluation

Training

Quick start for DPL training

Single path warm up

More Experiments

Citation

Acknowledgment

Owner

BiNE: Bipartite Network Embedding

UniSpeech - Large Scale Self-Supervised Learning for Speech

A Japanese tokenizer based on recurrent neural networks

Honor's thesis project analyzing whether the GPT-2 model can more effectively generate free-verse or structured poetry.

Unofficial PyTorch implementation of Google AI's VoiceFilter system

Bot to connect a real Telegram user, simulating responses with OpenAI's davinci GPT-3 model.

Extract rooms type, door, neibour rooms, rooms corners nad bounding boxes, and generate graph from rplan dataset

基于百度的语音识别，用python实现，pyaudio+pyqt

Beyond Paragraphs: NLP for Long Sequences

KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)

BERT score for text generation

Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form.

fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.

PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers

CorNet Correlation Networks for Extreme Multi-label Text Classification

Tools, wrappers, etc... for data science with a concentration on text processing

Bu Chatbot, Konya Bilim Merkezi Yen için tasarlanmış olan bir projedir.

edge-SR: Super-Resolution For The Masses

Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downstream tasks like translation and summarisation.

Geometry-Consistent Neural Shape Representation with Implicit Displacement Fields