Auto-Lama combines object detection and image inpainting to automate object removals

Last update: Dec 09, 2022

Related tags

Overview

Auto-Lama

Auto-Lama combines object detection and image inpainting to automate object removals. It is build on top of DE:TR from Facebook Research and Lama from Samsung Research. The entire process is extremely simple:

Objects are detected using the detector.
Masks are generated based on the bounding boxes drawn by the detector.
The original image is sent to the inpainter along with the masks.

Demo

Masking

There are currently a few ways of generating masks:

Masking objects with specified indices.
Masking one main object at a time.
Masking all other objects other than the main object.

Future Goals

Use a more precise segmentation method other than bounding boxes
Implementing a detector that has more

Environment Setup

Prerequisites

docker
make
conda

Building Environment

make build-conda-env
conda activate auto-lama
make build-env

Cleaning Directory

make clean

Detect and Inpaint

Setup

The default config for the detector is

PARAMETERS = {
    "model_name": "facebook/detr-resnet-50",
    "threshold": 0.9,
    "max_items": 10,
    "save_destination": "./test_images",
    "output_destination": "./output_images",
    "max_width": 2000,
    "max_height": 2000,
    "resize": True,
    "resize_scale": 0.75,
    "excluded_objects": [91],
    "image_format": "PNG",
    "mask_target_items": [],
}

Please reference here for the target items that you want to mask, as the default DE:TR uses the COCO Dataset,

Run

make detect_and_inpaint IMAGE_PATH=path/to/image or make detect_and_inpaint IMAGE_PATH={image_url}

Auto-Lama combines object detection and image inpainting to automate object removals

Related tags

Overview

Auto-Lama

Demo

Masking

Future Goals

Environment Setup

Prerequisites

Building Environment

Cleaning Directory

Detect and Inpaint

Setup

Run

Owner

Code for "Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans" CVPR 2021 best paper candidate

A tight inclusion function for continuous collision detection

An imperfect information game is a type of game with asymmetric information

Artifacts for paper "MMO: Meta Multi-Objectivization for Software Configuration Tuning"

Semantic Segmentation of images using PixelLib with help of Pascalvoc dataset trained with Deeplabv3+ framework.

Official implementation of AAAI-21 paper "Label Confusion Learning to Enhance Text Classification Models"

A curated (most recent) list of resources for Learning with Noisy Labels

Code for Subgraph Federated Learning with Missing Neighbor Generation (NeurIPS 2021)

SatelliteNeRF - PyTorch-based Neural Radiance Fields adapted to satellite domain

🚩🚩🚩

TransZero++: Cross Attribute-guided Transformer for Zero-Shot Learning

GAN encoders in PyTorch that could match PGGAN, StyleGAN v1/v2, and BigGAN. Code also integrates the implementation of these GANs.

A set of tools to pre-calibrate and calibrate (multi-focus) plenoptic cameras (e.g., a Raytrix R12) based on the libpleno.

Repository for the COLING 2020 paper "Explainable Automated Fact-Checking: A Survey."

Code for reproducing key results in the paper "InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets"

[CVPR-2021] UnrealPerson: An adaptive pipeline for costless person re-identification

[ICCV'21] NEAT: Neural Attention Fields for End-to-End Autonomous Driving

Codebase for ECCV18 "The Sound of Pixels"

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

Baseline powergrid model for NY