[CVPR'20] TTSR: Learning Texture Transformer Network for Image Super-Resolution

Last update: Dec 28, 2022

Related tags

Overview

TTSR

Official PyTorch implementation of the paper Learning Texture Transformer Network for Image Super-Resolution accepted in CVPR 2020.

Introduction
Requirements and dependencies
Model
Quick test
Dataset prepare
Evaluation
Train
Citation
Contact

Introduction

We proposed an approach named TTSR for RefSR task. Compared to SISR, RefSR has an extra high-resolution reference image whose textures can be utilized to help super-resolve low-resolution input.

Contribution

We are one of the first to introduce the transformer architecture into image generation tasks. More specifically, we propose a texture transformer with four closely-related modules for image SR which achieves significant improvements over SOTA approaches.
We propose a novel cross-scale feature integration module for image generation tasks which enables our approach to learn a more powerful feature representation by stacking multiple texture transformers.

Approach overview

Main results

Requirements and dependencies

python 3.7 (recommend to use Anaconda)
python packages: pip install opencv-python imageio
pytorch >= 1.1.0
torchvision >= 0.4.0

Model

Pre-trained models can be downloaded from onedrive, baidu cloud(0u6i), google drive.

TTSR-rec.pt: trained with only reconstruction loss
TTSR.pt: trained with all losses

Quick test

Clone this github repo

git clone https://github.com/FuzhiYang/TTSR.git
cd TTSR

Download pre-trained models and modify "model_path" in test.sh
Run test

sh test.sh

The results are in "save_dir" (default: ./test/demo/output)

Dataset prepare

Download CUFED train set and CUFED test set
Make dataset structure be:

CUFED
- train
  - input
  - ref
- test
  - CUFED5

Evaluation

Prepare CUFED dataset and modify "dataset_dir" in eval.sh
Download pre-trained models and modify "model_path" in eval.sh
Run evaluation

sh eval.sh

The results are in "save_dir" (default: ./eval/CUFED/TTSR)

Train

Prepare CUFED dataset and modify "dataset_dir" in train.sh
Run training

sh train.sh

The training results are in "save_dir" (default: ./train/CUFED/TTSR)

Citation

@InProceedings{yang2020learning,
author = {Yang, Fuzhi and Yang, Huan and Fu, Jianlong and Lu, Hongtao and Guo, Baining},
title = {Learning Texture Transformer Network for Image Super-Resolution},
booktitle = {CVPR},
year = {2020},
month = {June}
}

Contact

If you meet any problems, please describe them in issues or contact:

Fuzhi Yang: [email protected]

[CVPR'20] TTSR: Learning Texture Transformer Network for Image Super-Resolution

Related tags

Overview

TTSR

Contents

Introduction

Contribution

Approach overview

Main results

Requirements and dependencies

Model

Quick test

Dataset prepare

Evaluation

Train

Citation

Contact

Owner

Multimedia Research

NAACL2021 - COIL Contextualized Lexical Retriever

Agent-based model simulator for air quality and pandemic risk assessment in architectural spaces

A Python library for working with arbitrary-dimension hypercomplex numbers following the Cayley-Dickson construction of algebras.

This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".

[EMNLP 2021] MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations

Tensorflow2 Keras-based Semantic Segmentation Models Implementation

Pytorch implementation of paper "Efficient Nearest Neighbor Language Models" (EMNLP 2021)

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

EZ graph is an easy to use AI solution that allows you to make and train your neural networks without a single line of code.

DockStream: A Docking Wrapper to Enhance De Novo Molecular Design

Le dataset des images du projet d'IA de 2021

RITA is a family of autoregressive protein models, developed by LightOn in collaboration with the OATML group at Oxford and the Debora Marks Lab at Harvard.

Self-driving car env with PPO algorithm from stable baseline3

Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Repo for our ICML21 paper Unsupervised Learning of Visual 3D Keypoints for Control

🔮 Execution time predictions for deep neural network training iterations across different GPUs.

2.86% and 15.85% on CIFAR-10 and CIFAR-100

A new test set for ImageNet

Neural Nano-Optics for High-quality Thin Lens Imaging