PyTorch implementation of SimSiam: Exploring Simple Siamese Representation Learning

Last update: Dec 30, 2022

Related tags

Overview

SimSiam: Exploring Simple Siamese Representation Learning

This is a PyTorch implementation of the SimSiam paper:

@Article{chen2020simsiam,
  author  = {Xinlei Chen and Kaiming He},
  title   = {Exploring Simple Siamese Representation Learning},
  journal = {arXiv preprint arXiv:2011.10566},
  year    = {2020},
}

Preparation

Install PyTorch and download the ImageNet dataset following the official PyTorch ImageNet training code. Similar to MoCo, the code release contains minimal modifications for both unsupervised pre-training and linear classification to that code.

In addition, install apex for the LARS implementation needed for linear classification.

Unsupervised Pre-Training

Only multi-gpu, DistributedDataParallel training is supported; single-gpu or DataParallel training is not supported.

To do unsupervised pre-training of a ResNet-50 model on ImageNet in an 8-gpu machine, run:

python main_simsiam.py \
  -a resnet50 \
  --dist-url 'tcp://localhost:10001' --multiprocessing-distributed --world-size 1 --rank 0 \
  --fix-pred-lr \
  [your imagenet-folder with train and val folders]

The script uses all the default hyper-parameters as described in the paper, and uses the default augmentation recipe from MoCo v2.

The above command performs pre-training with a non-decaying predictor learning rate for 100 epochs, corresponding to the last row of Table 1 in the paper.

Linear Classification

With a pre-trained model, to train a supervised linear classifier on frozen features/weights in an 8-gpu machine, run:

python main_lincls.py \
  -a resnet50 \
  --dist-url 'tcp://localhost:10001' --multiprocessing-distributed --world-size 1 --rank 0 \
  --pretrained [your checkpoint path]/checkpoint_0099.pth.tar \
  --lars \
  [your imagenet-folder with train and val folders]

The above command uses LARS optimizer and a default batch size of 4096.

Models and Logs

Our pre-trained ResNet-50 models and logs:

pre-train epochs	batch size	pre-train ckpt	pre-train log	linear cls. ckpt	linear cls. log	top-1 acc.
100	512	link	link	link	link	68.1
100	256	link	link	link	link	68.3

Settings for the above: 8 NVIDIA V100 GPUs, CUDA 10.1/CuDNN 7.6.5, PyTorch 1.7.0.

Transferring to Object Detection

Same as MoCo for object detection transfer, please see moco/detection.

License

This project is under the CC-BY-NC 4.0 license. See LICENSE for details.

PyTorch implementation of SimSiam: Exploring Simple Siamese Representation Learning

Related tags

Overview

SimSiam: Exploring Simple Siamese Representation Learning

Preparation

Unsupervised Pre-Training

Linear Classification

Models and Logs

Transferring to Object Detection

License

Owner

Facebook Research

Privacy-Preserving Portrait Matting [ACM MM-21]

ByteTrack(Multi-Object Tracking by Associating Every Detection Box)のPythonでのONNX推論サンプル

Labels4Free: Unsupervised Segmentation using StyleGAN

[NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images

CLIP2Video: Mastering Video-Text Retrieval via Image CLIP

A repository for the updated version of CoinRun used to collect MUGEN, a multimodal video-audio-text dataset.

Detectron2 for Document Layout Analysis

End-To-End Crowdsourcing

Code for the paper titled "Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages"

The BCNet related data and inference model.

A PyTorch implementation of DenseNet.

Code for the CIKM 2019 paper "DSANet: Dual Self-Attention Network for Multivariate Time Series Forecasting".

Development kit for MIT Scene Parsing Benchmark

Theory-inspired Parameter Control Benchmarks for Dynamic Algorithm Configuration

Multi-Agent Reinforcement Learning (MARL) method to learn scalable control polices for multi-agent target tracking.

CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification (ICCV2021)

Official repository for "Exploiting Session Information in BERT-based Session-aware Sequential Recommendation", SIGIR 2022 short.

PyTorch implementation for OCT-GAN Neural ODE-based Conditional Tabular GANs (WWW 2021)

[ICCV21] Official implementation of the "Social NCE: Contrastive Learning of Socially-aware Motion Representations" in PyTorch.

Official PyTorch implementation for paper Context Matters: Graph-based Self-supervised Representation Learning for Medical Images