Official code for MPG2: Multi-attribute Pizza Generator: Cross-domain Attribute Control with Conditional StyleGAN

Last update: Sep 01, 2022

Related tags

Overview

This is the official code for Multi-attribute Pizza Generator (MPG2): Cross-domain Attribute Control with Conditional StyleGAN.

Paper	Demo

Setup Environment

can NOT run on CPU

conda create -n mpg python=3.8
conda activate mpg
git clone [email protected]:klory/food_project.git
cd food_project
pip install -r requirements.txt
pip install git+https://github.com/pytorch/[email protected]

Pretrained models

Pretrained models are stored in google-link, files are already in their desired locations, so following the same directory structure will minimize burdens to run the code inside the project (some files are not necessary for the current version of the project as of 2021-03-31).

Pizza10 dataset

Please follow MPG repository.

Ingredient classifier

Please follow MPG repository.

PizzaView dataset

Download PizzaView Dataset from google-link/data/Pizza3D.

cd to datasets/

$ python pizza3d.py

View regressor

cd to view_regressor/

Train

$ CUDA_VISIBLE_DEVICES=0 python train.py --wandb=0

Validate

Download the pretrained model google-link/view_regressor/runs/pizza3d/1ab8hru7/00004999.ckpt:

$ CUDA_VISIBLE_DEVICES=0 python val.py --ckpt_path=/runs/pizza3d/1ab8hru7/00004999.ckpt

MPG2

cd to mpg/,

Train

$ CUDA_VISIBLE_DEVICES=0,1 python train.py --wandb=0

Validate

Download the pretrained model google-linkmpg/runs/30cupu9m/00260000.ckpt.

cd to metrics/:

CUDA_VISIBLE_DEVICES=0 python generate_samples.py --model=mpg

Metrics

cd to metrics/,

For more about FID and mAP, follow MPG repository.

FID (Frechet Inception Distance)

To compute FID, we need to first compute the statistics of the real images.

CUDA_VISIBLE_DEVICES=0 python calc_inception.py

then

$ CUDA_VISIBLE_DEVICES=0 python fid.py --model=mpg

I got FID=6.33 using the provided checkpoint.

mAE (mean Absolute Error) for view attributes

Computing mAE uses the pre-trained view regressor.

$ CUDA_VISIBLE_DEVICES=0 python mAE.py --model=mpg

Demo

cd to metrics/.

CUDA_VISIBLE_DEVICES=0 streamlit run app.py

Official code for MPG2: Multi-attribute Pizza Generator: Cross-domain Attribute Control with Conditional StyleGAN

Related tags

Overview

Setup Environment

Pretrained models

Pizza10 dataset

Ingredient classifier

PizzaView dataset

View regressor

Train

Validate

MPG2

Train

Validate

Metrics

FID (Frechet Inception Distance)

mAE (mean Absolute Error) for view attributes

Demo

Owner

Fangda Han

This project provides an unsupervised framework for mining and tagging quality phrases on text corpora with pretrained language models (KDD'21).

Advanced Signal Processing Notebooks and Tutorials

An implementation of quantum convolutional neural network with MindQuantum. Huawei, classifying MNIST dataset

PyTorch implementation DRO: Deep Recurrent Optimizer for Structure-from-Motion

An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.

Code for ICDM2020 full paper: "Sub-graph Contrast for Scalable Self-Supervised Graph Representation Learning"

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Six - a Python 2 and 3 compatibility library

This repository contains the files for running the Patchify GUI.

LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference

Self-supervised learning algorithms provide a way to train Deep Neural Networks in an unsupervised way using contrastive losses

i3DMM: Deep Implicit 3D Morphable Model of Human Heads

A containerized REST API around OpenAI's CLIP model.

Official implementation of Representer Point Selection via Local Jacobian Expansion for Post-hoc Classifier Explanation of Deep Neural Networks and Ensemble Models at NeurIPS 2021

Image Super-Resolution by Neural Texture Transfer

Group project for MFIN7036. Our goal is to predict firm profitability with text-based competition measures.

AugLy is a data augmentations library that currently supports four modalities (audio, image, text & video) and over 100 augmentations

The repository contains source code and models to use PixelNet architecture used for various pixel-level tasks. More details can be accessed at .

Neural Turing Machine (NTM) & Differentiable Neural Computer (DNC) with pytorch & visdom

Unofficial implementation of MUSIQ (Multi-Scale Image Quality Transformer)