Generate Cartoon Images using Generative Adversarial Network

Last update: Dec 29, 2022

Overview

AvatarGAN ✨

Generate Cartoon Images using DC-GAN

Deep Convolutional GAN is a generative adversarial network architecture. It uses a couple of guidelines, in particular:

Replacing any pooling layers with strided convolutions (discriminator) and fractional-strided convolutions (generator).
Using batchnorm in both the generator and the discriminator.
Removing fully connected hidden layers for deeper architectures.
Using ReLU activation in generator for all layers except for the output, which uses tanh.
Using LeakyReLU activation in the discriminator for all layer.

Checkout the detailed explanation of AvatarGAN in the article AvatarGAN

GAN Model

Define Generator and Discriminator network architecture
Train the Generator model to generate the fake data that can fool Discriminator
Train the Discriminator model to distinguish real vs fake data
Continue the training for several epochs and save the Generator model

Dataset Setup

Cartoon Set which is a collection of random 2D cartoon avatar images. Download the dataset using the shell script.

sh download-dataset.sh

This will download the dataset in data/ directory. If you want to train the model in Google Colab, upload the dataset folder to Google Drive. The destination path should be projects/cartoons/.

Model Training

Check out the model being trained to generate cartoon images.

Generate Cartoon Images using Generative Adversarial Network

Related tags

Overview

AvatarGAN ✨

GAN Model

Dataset Setup

Model Training

Model Prediction

Owner

Aakash Jhawar

Uses Open AI Gym environment to create autonomous cryptocurrency bot to trade cryptocurrencies.

A web application that provides real time temperature and humidity readings of a house.

Author's PyTorch implementation of TD3 for OpenAI gym tasks

Dashboard for the COVID19 spread

MANO hand model porting for the GraspIt simulator

Bib-parser - Convenient script to parse .bib files with the ACM Digital Library like metadata

FastReID is a research platform that implements state-of-the-art re-identification algorithms.

Official code repository for Continual Learning In Environments With Polynomial Mixing Times

Attentive Implicit Representation Networks (AIR-Nets)

Code release of paper "Deep Multi-View Stereo gone wild"

🏖 Keras Implementation of Painting outside the box

Autonomous Perception: 3D Object Detection with Complex-YOLO

TorchDistiller - a collection of the open source pytorch code for knowledge distillation, especially for the perception tasks, including semantic segmentation, depth estimation, object detection and instance segmentation.

Code for the ICML 2021 paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

Multi-Anchor Active Domain Adaptation for Semantic Segmentation (ICCV 2021 Oral)

Compositional Sketch Search

Demystifying How Self-Supervised Features Improve Training from Noisy Labels

Using Self-Supervised Pretext Tasks for Active Learning - Official Pytorch Implementation

Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition

Official code release for ICCV 2021 paper SNARF: Differentiable Forward Skinning for Animating Non-rigid Neural Implicit Shapes.