Spatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset (CVPR'19)

Last update: Dec 01, 2022

Overview

Spatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset (CVPR'19)

Tianyu Wang*, Xin Yang*, Ke Xu, Shaozhe Chen, Qiang Zhang, Rynson W.H. Lau † (* Joint first author. † Rynson Lau is the corresponding author.)

[Arxiv]

Abstract

Removing rain streaks from a single image has been drawing considerable attention as rain streaks can severely degrade the image quality and affect the performance of existing outdoor vision tasks. While recent CNN-based derainers have reported promising performances, deraining remains an open problem for two reasons. First, existing synthesized rain datasets have only limited realism, in terms of modeling real rain characteristics such as rain shape, direction and intensity. Second, there are no public benchmarks for quantitative comparisons on real rain images, which makes the current evaluation less objective. The core challenge is that real world rain/clean image pairs cannot be captured at the same time. In this paper, we address the single image rain removal problem in two ways. First, we propose a semi-automatic method that incorporates temporal priors and human supervision to generate a high-quality clean image from each input sequence of real rain images. Using this method, we construct a large-scale dataset of ∼29.5K rain/rain-free image pairs that cover a wide range of natural rain scenes. Second, to better cover the stochastic distributions of real rain streaks, we propose a novel SPatial Attentive Network (SPANet) to remove rain streaks in a local-to-global manner. Extensive experiments demonstrate that our network performs favorably against the state-of-the-art deraining methods.

Citation

If you use this code or our dataset(including test set), please cite:

@InProceedings{Wang_2019_CVPR,
  author = {Wang, Tianyu and Yang, Xin and Xu, Ke and Chen, Shaozhe and Zhang, Qiang and Lau, Rynson W.H.},
  title = {Spatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset},
  booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  month = {June},
  year = {2019}
}

Dataset

See my personal site

UPDATE We release the code of clean image generation. We also provide some synthesize and real video examples for researchers to try. Note that we only implemented the code using 8 threads.

Requirements

PyTorch == 0.4.1 (1.0.x may not work for training)
cupy (Installation Guide)
opencv-python
TensorBoardX
Python3.6
progressbar2
scikit-image
ffmpeg >= 4.0.1
python-ffmpeg

Setup

Clone this repo:

$ git clone ...
$ cd SPANet

Train & Test

Train:

Download the dataset(~44GB) and unpack it into code folder (See details in Train_Dataset_README.md). Then, run:

$ python main.py -a train -m latest

Test:

Download the test dataset(~455MB) and unpack it into code folder (See details in Test_Dataset_README.md). Then, run:

$ python main.py -a test -m latest

Performance Change

PSNR 38.02 -> 38.53

SSIM 0.9868 -> 0.9875

For generalization, we here stop at 40K steps.

All PSNR and SSIM of results are computed by using skimage.measure. Please use this to evaluate your works.

License

Please see License.txt file.

Acknowledgement

Code borrows from RESCAN by Xia Li. The CUDA extension references pyinn by Sergey Zagoruyko and DSC(CF-Caffe) by Xiaowei Hu. Thanks for sharing!

Contact

E-Mail: [email protected]

Spatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset (CVPR'19)

Related tags

Overview

Spatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset (CVPR'19)

Abstract

Citation

Dataset

Requirements

Setup

Train & Test

Performance Change

License

Acknowledgement

Contact

Owner

Steve Wong

The goal of the exercises below is to evaluate the candidate knowledge and problem solving expertise regarding the main development focuses for the iFood ML Platform team: MLOps and Feature Store development.

Shape-Adaptive Selection and Measurement for Oriented Object Detection

[ICCV'21] NEAT: Neural Attention Fields for End-to-End Autonomous Driving

Godot RL Agents is a fully Open Source packages that allows video game creators

Implements Stacked-RNN in numpy and torch with manual forward and backward functions

Data labels and scripts for fastMRI.org

PyTorch implementation of DeepLab v2 on COCO-Stuff / PASCAL VOC

This repo holds code for TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation

A curated list of awesome Active Learning

Implementation of SwinTransformerV2 in TensorFlow.

This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis, accepted at EMNLP 2021.

Face Alignment using python

Implementation of the state-of-the-art vision transformers with tensorflow

Official codebase for ICLR oral paper Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling

Cancer Drug Response Prediction via a Hybrid Graph Convolutional Network

Official implementation for the paper "Attentive Prototypes for Source-free Unsupervised Domain Adaptive 3D Object Detection"

《DeepViT: Towards Deeper Vision Transformer》(2021)

Learning Optical Flow from a Few Matches (CVPR 2021)

PyTorch implementation of the paper The Lottery Ticket Hypothesis for Object Recognition

Colar: Effective and Efficient Online Action Detection by Consulting Exemplars, CVPR 2022.