A PyTorch implementation of the paper "Semantic Image Synthesis via Adversarial Learning" in ICCV 2017

Last update: Nov 25, 2022

Related tags

Deep Learning dong_iccv_2017

Overview

Semantic Image Synthesis via Adversarial Learning

This is a PyTorch implementation of the paper Semantic Image Synthesis via Adversarial Learning.

Requirements

PyTorch 0.2
Torchvision
Pillow
fastText.py (Note: if you have a problem when loading a pretrained model, try my fixed code)
NLTK

Pretrained word vectors for fastText

Download a pretrained English word vectors. You can see the list of pretrained vectors on this page.

Datasets

Oxford-102 flowers: images and captions
Caltech-200 birds: images and captions

The caption data is from this repository. After downloading, modify CONFIG file so that all paths of the datasets point to the data you downloaded.

Run

scripts/train_text_embedding_[birds/flowers].sh
Train a visual-semantic embedding model using the method of Kiros et al..
scripts/train_[birds/flowers].sh
Train a GAN using a pretrained text embedding model.
scripts/test_[birds/flowers].sh
Generate some examples using original images and semantically relevant texts.

Results

Acknowledgements

We would like to thank Hao Dong, who is one of the first authors of the paper Semantic Image Synthesis via Adversarial Learning, for providing helpful advice for the implementation.

A PyTorch implementation of the paper "Semantic Image Synthesis via Adversarial Learning" in ICCV 2017

Related tags

Overview

Semantic Image Synthesis via Adversarial Learning

Requirements

Pretrained word vectors for fastText

Datasets

Run

Results

Acknowledgements

Owner

Seonghyeon Nam

This project aims to be a handler for input creation and running of multiple RICEWQ simulations.

social humanoid robots with GPGPU and IoT

[NeurIPS 2021] Shape from Blur: Recovering Textured 3D Shape and Motion of Fast Moving Objects

Forecasting Nonverbal Social Signals during Dyadic Interactions with Generative Adversarial Neural Networks

[AAAI 2022] Separate Contrastive Learning for Organs-at-Risk and Gross-Tumor-Volume Segmentation with Limited Annotation

Extract MNIST handwritten digits dataset binary file into bmp images

existing and custom freqtrade strategies supporting the new hyperstrategy format.

UniFormer - official implementation of UniFormer

Synthesize photos from PhotoDNA using machine learning 🌱

Degree-Quant: Quantization-Aware Training for Graph Neural Networks.

A scanpy extension to analyse single-cell TCR and BCR data.

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

A framework for joint super-resolution and image synthesis, without requiring real training data

Code for the paper "PortraitNet: Real-time portrait segmentation network for mobile device" @ CAD&Graphics2019

People log into different sites every day to get information and browse through these sites one by one

General purpose GPU compute framework for cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends)

Code for the Paper: Alexandra Lindt and Emiel Hoogeboom.

Implement of "Training deep neural networks via direct loss minimization" in PyTorch for 0-1 loss

DiffStride: Learning strides in convolutional neural networks

An example to implement a new backbone with OpenMMLab framework.