Official PyTorch Implementation of paper "Deep 3D Mask Volume for View Synthesis of Dynamic Scenes", ICCV 2021.

Last update: Oct 12, 2022

Related tags

Deep Learning deep-3dmask

Overview

Deep 3D Mask Volume for View Synthesis of Dynamic Scenes

Official PyTorch Implementation of paper "Deep 3D Mask Volume for View Synthesis of Dynamic Scenes", ICCV 2021.

Kai-En Lin¹, Lei Xiao², Feng Liu², Guowei Yang¹, Ravi Ramamoorthi¹

¹University of California, San Diego, ²Facebook Reality Labs

Requirements

Install required packages

Make sure you have up-to-date NVIDIA drivers supporting CUDA 11.1 (10.2 could work but need to change cudatoolkit package accordingly)

Run

conda env create -f environment.yml
conda activate video_viewsynth

Usage

Rendering

Download our pretrained checkpoint and testing data. Extract the content to [path_to_data_directory]. It contains frames and background folders, as well as poses_bounds.npy.
In configs, setup data path by changing render_video.txt

root_dir should point to the frames folder mentioned in 1. and bg_dir should point to background folder.

out_dir can be your desired output folder.

ckpt_path should be the pretrained checkpoint path.
Run python render_llff_video.py --config [config_file_path]

e.g. python render_llff_video.py --config ../configs/render_video.txt

(Optional) For your own data, please run prepare_data.sh

sh render.sh [frame_folder] [starting_frame] [ending_frame] [output_folder_name]

Make sure your data is in this structure before running
```
[frame_folder] --- cam00 --- 00000.jpg
                |         |- 00001.jpg
                |         ...
                |- cam01
                |- cam02
                ...
                |- poses_bounds.npy
```
e.g. sh render.sh ~/deep_3d_data/frames 0 20 qual

Training

Train MPI

Download RealEstate10K dataset and extract the frames. There are scripts in preprocessing folder which can be used to generate the data.

The order should be download_data.py -> extract_frames.py -> compress_data.py.

Remember to change the path in compress_data.py.
Change the paths in config file train_realestate10k.txt

Run

cd train_mpi
python train.py --config ../configs/train_realestate10k.txt

Train Mask

Once MPI is trained, we can use the checkpoint to train 3D mask network.

Download dataset
Change the paths in config file train_mask.txt

Run

cd train_mask
python train.py --config ../configs/train_mask.txt

Citation

@inproceedings {lin2021deep,
    title = {Deep 3D Mask Volume for View Synthesis of Dynamic Scenes},
    author = {Kai-En Lin and Lei Xiao and Feng Liu and Guowei Yang and Ravi Ramamoorthi},
    booktitle = {ICCV},
    year = {2021},
}

Official PyTorch Implementation of paper "Deep 3D Mask Volume for View Synthesis of Dynamic Scenes", ICCV 2021.

Related tags

Overview

Deep 3D Mask Volume for View Synthesis of Dynamic Scenes

Requirements

Install required packages

Usage

Rendering

Training

Train MPI

Train Mask

Citation

Owner

Ken Lin

Multimodal commodity image retrieval 多模态商品图像检索

Supplementary code for the experiments described in the 2021 ISMIR submission: Leveraging Hierarchical Structures for Few Shot Musical Instrument Recognition.

A highly efficient, fast, powerful and light-weight anime downloader and streamer for your favorite anime.

Code for DisCo: Remedy Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning

A Free and Open Source Python Library for Multiobjective Optimization

Interactive Image Segmentation via Backpropagating Refinement Scheme

The code for the CVPR 2021 paper Neural Deformation Graphs, a novel approach for globally-consistent deformation tracking and 3D reconstruction of non-rigid objects.

Learning RAW-to-sRGB Mappings with Inaccurately Aligned Supervision (ICCV 2021)

Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals.

NATS-Bench: Benchmarking NAS Algorithms for Architecture Topology and Size

AutoVideo: An Automated Video Action Recognition System

Some pre-commit hooks for OpenMMLab projects

PyTorch implementation of the end-to-end coreference resolution model with different higher-order inference methods.

Code for "Solving Graph-based Public Good Games with Tree Search and Imitation Learning"

Aydin is a user-friendly, feature-rich, and fast image denoising tool

PyMatting: A Python Library for Alpha Matting

ScaleNet: A Shallow Architecture for Scale Estimation

Face Mask Detection is a project to determine whether someone is wearing mask or not, using deep neural network.

Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation

Efficiently Disentangle Causal Representations