A study project using the AA-RMVSNet to reconstruct buildings from multiple images

Last update: Oct 17, 2022

Overview

3d-building-reconstruction

This is part of a study project using the AA-RMVSNet to reconstruct buildings from multiple images.

Introduction

It is exciting to connect the 2D world with 3D world using Multi-view Stereo(MVS) methods. In this project, we aim to reconstruct several architecture in our campus. Since it's outdoor reconstruction, We chose to use AA-RMVSNet to do this work for its marvelous performance is outdoor datasets after comparing some similar models such as CasMVSNet and D2HC-RMVSNet. The code is retrieved from here with some modification.

Reproduction

Here we summarize the main steps we took when doing this project. You can reproduce our result after these steps.

Installation

First, you need to create a virtual environment and install the necessary dependencies.

conda create -n test python=3.6
conda activate test
conda install pytorch==1.1.0 torchvision==0.3.0 cudatoolkit=10.0 -c pytorch
conda install -c conda-forge py-opencv plyfile tensorboardx

Other cuda versions can be found here

Struct from Motion

Camera parameters are required to conduct the MVSNet based methods. Please first download the open source software COLMAP.

The workflow is as follow:

Open the COLMAP, then successively click reconstruction-Automatic reconstruction options.
Select your Workspace folder and Image folder.
(Optional) Unclick Dense model to accelerate the reconstruction procedure.
Click Run.
After the completion of reconstruction, you should be able to see the result of sparse reconstruction as well as position of cameras.(Fig )
Click File - Export model as text. There should be a camera.txt in the output folder, each line represent a photo. In case there are photos that remain mismatched, you should dele these photos and rematch. Repeat this process until all the photos are mathced.
Move the there txts to the sparse folder.

AA-RMVSNet

To use AA-RMVSNet to reconstruct the building, please follow the steps listed below.

Clone this repository to a local folder.
The custom testing folder should be placed in the root directory of the cloned folder. This folder should have to subfolders names images and sparse. The images folder is meant to place the photos, and the sparse folder should have the three txt files recording the camera's parameters.
Find the file list-dtu-test.txt, and write the name of the folder which you wish to be tested.
Run colmap2mvsnet.py by
```
python ./sfm/colmap2mvsnet.py --dense_folder name --interval_scale 1.06 --max_d 512
```
The parameter dense_folder is compulsory, others being optional. You can also change the default value in the following shells.
When you get the result of the previous step, run the following commands
```
sh ./scripts/eval_dtu.sh
sh ./scripts/fusion_dtu.sh
```
Then you are should see the output .ply files in the outputs_dtu folder.

Here dtu means the data is organized in the format of DTU dataset.

Results

We reconstructed various spot of out campus. The reconstructed point cloud files is available here (Code: nz1e). You can visualize the file with Meshlab or CloudCompare .

A study project using the AA-RMVSNet to reconstruct buildings from multiple images

Related tags

Overview

3d-building-reconstruction

Introduction

Reproduction

Installation

Struct from Motion

AA-RMVSNet

Results

Owner

The source code of "SIDE: Center-based Stereo 3D Detector with Structure-aware Instance Depth Estimation", accepted to WACV 2022.

Unadversarial Examples: Designing Objects for Robust Vision

Code for SentiBERT: A Transferable Transformer-Based Architecture for Compositional Sentiment Semantics (ACL'2020).

Instance-wise Occlusion and Depth Orders in Natural Scenes (CVPR 2022)

AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages

Sample code from the Neural Networks from Scratch book.

A quick recipe to learn all about Transformers

Yolov5-opencv-cpp-python - Example of using ultralytics YOLO V5 with OpenCV 4.5.4, C++ and Python

ReAct: Out-of-distribution Detection With Rectified Activations

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

A very simple baseline to estimate 2D & 3D SMPL-compatible keypoints from a single color image.

This is a Python wrapper for TA-LIB based on Cython instead of SWIG.

Python implementation of a live deep learning based age/gender/expression recognizer

Deep Learning for Time Series Forecasting.

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.

Official Implementation of "Third Time's the Charm? Image and Video Editing with StyleGAN3" https://arxiv.org/abs/2201.13433

The Wearables Development Toolkit - a development environment for activity recognition applications with sensor signals

Training Confidence-Calibrated Classifier for Detecting Out-of-Distribution Samples / ICLR 2018

The official implementation of NeurIPS 2021 paper: Finding Optimal Tangent Points for Reducing Distortions of Hard-label Attacks

AI-UPV at IberLEF-2021 EXIST task: Sexism Prediction in Spanish and English Tweets Using Monolingual and Multilingual BERT and Ensemble Models