Learning 3D Part Assembly from a Single Image

Last update: Dec 21, 2022

Related tags

Overview

Learning 3D Part Assembly from a Single Image

This repository contains a PyTorch implementation of the paper:

Learning 3D Part Assembly from A Single Image.
Yichen Li*, Kaichun Mo*, Lin Shao, Minhyuk Sung, Leonidas Guibas,
ECCV 2020

Introduction

Autonomous assembly is a crucial capability for robots in many applications. For this task, several problems such as obstacle avoidance, motion planning, and actuator control have been extensively studied in robotics. However, when it comes to task specification, the space of possibilities remains underexplored. Towards this end, we introduce a novel problem, single-image-guided 3D part assembly, along with a learningbased solution. We study this problem in the setting of furniture assembly from a given complete set of parts and a single image depicting the entire assembled object. Multiple challenges exist in this setting, including handling ambiguity among parts (e.g., slats in a chair back and leg stretchers) and 3D pose prediction for parts and part subassemblies, whether visible or occluded. We address these issues by proposing a two-module pipeline that leverages strong 2D-3D correspondences and assembly-oriented graph message-passing to infer part relationships. In experiments with a PartNet-based synthetic benchmark, we demonstrate the effectiveness of our framework as compared with three baseline approaches.

Dependencies

Python 3.6
CUDA 10.0.
PyTorch. code tested with version 1.3.1
Blender. for visualization of results 2.7.9
(Optional) Tensorboard for visualization of the training process.
For the project it has been used TensorboardX

pip install -r requirements.txt

Chamfer Distance

cd exps/utils/cd
python setup.py install

Dataset

Data is available here: link.

wget http://download.cs.stanford.edu/orion/impartass/assembly_data.zip

Training

Training the segmentation stage first

cd exps/exp_segmentation
sh train.sh

modify your parameters including data_path, exp_name and etc. (see closed issues for details info)

Training the assembly stage

cd exps/exp_assemble
sh train.sh

Pre-trained models

Pretrained weights for the chair category is available at link.

wget http://download.cs.stanford.edu/orion/impartass/chair_weights.zip

Cite

Please cite our work if you find it useful:

@article{li2020impartass,
    title={Learning 3D Part Assembly from a Single Image},
    author={Li, Yichen and Mo, Kaichun and Shao, Lin and Sung, Minghyuk and Guibas, Leonidas},
    journal={European conference on computer vision (ECCV 2020)},
    year={2020}
}

Learning 3D Part Assembly from a Single Image

Related tags

Overview

Learning 3D Part Assembly from a Single Image

Introduction

Dependencies

Dataset

Training

Training the segmentation stage first

Training the assembly stage

Pre-trained models

Cite

Owner

Resilient projection-based consensus actor-critic (RPBCAC) algorithm

Official PyTorch Implementation of Embedding Transfer with Label Relaxation for Improved Metric Learning, CVPR 2021

MogFace: Towards a Deeper Appreciation on Face Detection

Source code for the ACL-IJCNLP 2021 paper entitled "T-DNA: Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adaptation" by Shizhe Diao et al.

This is a Tensorflow implementation of Learning to See in the Dark in CVPR 2018

Pairwise learning neural link prediction for ogb link prediction

Pytorch reimplement of the paper "A Novel Cascade Binary Tagging Framework for Relational Triple Extraction" ACL2020. The original code is written in keras.

Implementation of QuickDraw - an online game developed by Google, combined with AirGesture - a simple gesture recognition application

Implementation of our paper "DMT: Dynamic Mutual Training for Semi-Supervised Learning"

MPViT:Multi-Path Vision Transformer for Dense Prediction

Official PyTorch implementation of PS-KD

A pre-trained model with multi-exit transformer architecture.

Proof-Of-Concept Piano-Drums Music AI Model/Implementation

DANA paper supplementary materials

[ICRA2021] Reconstructing Interactive 3D Scene by Panoptic Mapping and CAD Model Alignment

PyTorch GPU implementation of the ES-RNN model for time series forecasting

🧮 Matrix Factorization for Collaborative Filtering is just Solving an Adjoint Latent Dirichlet Allocation Model after All

Ensemble Visual-Inertial Odometry (EnVIO)

You Only 👀 One Sequence

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥