[TIP 2020] Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion

Last update: Dec 12, 2022

Related tags

Overview

Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion

Code for Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion. To acquire dataset, please contact [email protected].

Introduction

We proposed a unified network called CorrFusionNet for scene change detection. The proposed CorrFusionNet firstly extracts the features of the bi-temporal inputs with deep convolutional networks. Then the extracted features will be projected into a lower dimension space to computed the instance level canonical correlation. The cross-temporal fusion will be performed based on the computed correlation in the CorrFusion module. The final scene classification and scene change results are obtained with softmax activation layers. In the objective function, we introduced a new formulation for calculating the temporal correlation. The visual results and quantitative assessments both demonstrated that our proposed CorrFusionNet could outperform other scene change detection methods and some state-of-the-art methods for image classification.

CorrFusion Module

The proposed CorrFusion module:

The proposed CorrFusionNet:

Requirements

scipy==1.1.0
matplotlib==3.0.3
h5py==2.8.0
numpy==1.16.3
tensorflow_gpu==1.8.0
Pillow==6.2.1
scikit_learn==0.21.3

Data

Overview of our Wuhan dataset

The images are stored in npz format.

├─trn
│      0-5000.npz
│      10000-15000.npz
│      15000-16488.npz
│      5000-10000.npz
│
├─tst
│      0-4712.npz
│
└─val
       0-2355.npz

Usage

Install the requirements

pip install -r requirements.txt

Run the training code

python train_cnn.py [-h] [-g GPU] [-b BATCH_SIZE] [-e EPOCHES]
                    [-n NUM_CLASSES] [-tb USE_TFBOARD] [-sm SAVE_MODEL]
                    [-log SAVE_LOG] [-trn TRN_DIR] [-tst TST_DIR]
                    [-val VAL_DIR] [-lpath LOG_PATH] [-mpath MODEL_PATH]
                    [-tbpath TB_PATH] [-rpath RESULT_PATH]

(see parser.py)

Evaluate on a trained model:

Download a trained model here.
Evaluation

python evaluate_model.py [-h] [-g GPU] [-m MODEL_DIR] [-tst TST_DIR]
                         [-val VAL_DIR]

optional arguments:
  -h, --help            show this help message and exit
  -g GPU, --gpu GPU     gpu device ID
  -m MODEL_DIR, --model_dir MODEL_DIR
                        model directory
  -tst TST_DIR, --tst_dir TST_DIR
                        testing file dir
  -val VAL_DIR, --val_dir VAL_DIR
                        validation file dir

Results

The results of quantitative assessments:

Predictions on our dataset:

Contact

For any questions, you're welcomed to contact Lixiang Ru.

[TIP 2020] Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion

Related tags

Overview

Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion

Introduction

CorrFusion Module

Requirements

Data

Usage

Install the requirements

Run the training code

Evaluate on a trained model:

Results

Contact

Owner

Lixiang Ru

CVPR 2021 - Official code repository for the paper: On Self-Contact and Human Pose.

An Extendible (General) Continual Learning Framework based on Pytorch - official codebase of Dark Experience for General Continual Learning

Generate indoor scenes with Transformers

Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning

Code Release for the paper "TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation"

Data for "Driving the Herd: Search Engines as Content Influencers" paper

Project of 'TBEFN: A Two-branch Exposure-fusion Network for Low-light Image Enhancement '

The source code for the Cutoff data augmentation approach proposed in this paper: "A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation".

This is the official implementation of 3D-CVF: Generating Joint Camera and LiDAR Features Using Cross-View Spatial Feature Fusion for 3D Object Detection, built on SECOND.

Some tentative models that incorporate label propagation to graph neural networks for graph representation learning in nodes, links or graphs.

[CVPR'21] DeepSurfels: Learning Online Appearance Fusion

StyleGAN of All Trades: Image Manipulation withOnly Pretrained StyleGAN

Square Root Bundle Adjustment for Large-Scale Reconstruction

Audio Source Separation is the process of separating a mixture into isolated sounds from individual sources

Multiple-criteria decision-making (MCDM) with Electre, Promethee, Weighted Sum and Pareto

A pytorch implementation of Reading Wikipedia to Answer Open-Domain Questions.

The repository contains source code and models to use PixelNet architecture used for various pixel-level tasks. More details can be accessed at .

Pop-Out Motion: 3D-Aware Image Deformation via Learning the Shape Laplacian (CVPR 2022)

Codes for our paper The Stem Cell Hypothesis: Dilemma behind Multi-Task Learning with Transformer Encoders published to EMNLP 2021.

Curvlearn, a Tensorflow based non-Euclidean deep learning framework.