UVO_Challenge

Team Alpes_runner Solutions

This is an official repo for our UVO Challenge solutions for Image/Video-based open-world segmentation. Our team "Alpes_runner" achieved the best performance on both Image/Video-based benchmarks. More details about the workshop can be found here.

Technical Reports

For Track 1: paper
For Track 2: paper

Models

Detection

Model	Pretrained datasets	Finetuned datasets	links
UVO_Detector	COCO	-	config/weights
UVO_Detector	COCO	UVO	config/weights

Segmentation

Model	Pretrained datasets	Finetuned datasets	links
UVO_Segementor	COCO	-	weights
UVO_Segmentor	COCO, PASCAL, OpenImage	-	config/weights
UVO_Segmentor	COCO, PASCAL, OpenImage	UVO	config/weights

Citation

If you find this project useful in your research, please consider cite:

@article{du20211st,
  title={1st Place Solution for the UVO Challenge on Image-based Open-World Segmentation 2021},
  author={Du, Yuming and Guo, Wen and Xiao, Yang and Lepetit, Vincent},
  journal={arXiv preprint arXiv:2110.10239},
  year={2021}
}

@article{du20211st,
  title={1st Place Solution for the UVO Challenge on Video-based Open-World Segmentation 2021},
  author={Du, Yuming and Guo, Wen and Xiao, Yang and Lepetit, Vincent},
  journal={arXiv preprint arXiv:2110.11661},
  year={2021}
}

Contact

Feel free to contact me or open a new issue if you have any questions.

Video-based open-world segmentation

Related tags

Overview

UVO_Challenge

Team Alpes_runner Solutions

Technical Reports

Models

Citation

Contact

Owner

Yuming Du

GEA - Code for Guided Evolution for Neural Architecture Search

Data Augmentation Using Keras and Python

Transparent Transformer Segmentation

Pytorch implementation of SimSiam Architecture

Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional Encoding (CVPR2022)

Geometric Vector Perceptrons --- a rotation-equivariant GNN for learning from biomolecular structure

HybVIO visual-inertial odometry and SLAM system

Expressive Body Capture: 3D Hands, Face, and Body from a Single Image

Code accompanying the paper Shared Independent Component Analysis for Multi-subject Neuroimaging

Instant Real-Time Example-Based Style Transfer to Facial Videos

This repository contains source code for the Situated Interactive Language Grounding (SILG) benchmark

This repository provides an efficient PyTorch-based library for training deep models.

StyleGAN-Human: A Data-Centric Odyssey of Human Generation

Pytorch implementation for the paper: Contrastive Learning for Cold-start Recommendation

This tool uses Deep Learning to help you draw and write with your hand and webcam.

Code for 'Blockwise Sequential Model Learning for Partially Observable Reinforcement Learning' (AAAI 2022)

PyTorch implementation of CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition

Spontaneous Facial Micro Expression Recognition using 3D Spatio-Temporal Convolutional Neural Networks

SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model

Pytorch implementation of few-shot semantic image synthesis