Paper: De-rendering Stylized Texts

Last update: Dec 18, 2022

Related tags

Overview

Paper: De-rendering Stylized Texts

Wataru Shimoda¹, Daichi Haraguchi², Seiichi Uchida², Kota Yamaguchi¹
¹CyberAgent.Inc, ² Kyushu University
Accepted to ICCV2021. [Publication] [Arxiv] [project-page]

Introduction

This repository contains the codes for "De-rendering stylized texts".

Concept

We propose to parse rendering parameters of stylized texts utilizing a neural net.

Demo

The proposed model parses rendering parameters based on famous 2d graphic engine[Skia.org|python implementation], which has compatibility with CSS in the Web. We can export the estimated rendering parameters and edit texts by an off-the-shelf rendering engine.

Installation

Requirements

Python >= 3.7
Pytorch >= 1.8.1
torchvision >= 0.9.1

pip install -r requiements.txt

Font data

The proposed model is trained with google fonts.
Download google fonts and locate in data/fonts/ as gfonts.

cd data/fonts
git clone https://github.com/google/fonts.git gfonts

Pre-rendered alpha maps

The proposed model parses rendering parameters and refines them through the differentiable rendering model, which uses pre-rendered alpha maps.
Generate pre-rendered alpha maps.

python -m util_lib.gen_pams

Pre-rendered alpha maps would be generated in data/fonts/prerendered_alpha.

Usage

Test

Download the pre-trained weight from this link (weight).
Locate the weight file in weights/font100_unified.pth.

Example usage.

python test.py --imgfile=example/sample.jpg

Note

imgfile option: path of an input image
results would be generated in res/

Data generation

in progress

Train

in progress

Todo

Testing codes
Codes for the text image generator
Training codes
Add notebooks for the guide

Reference

@InProceedings{Shimoda_2021_ICCV,
    author    = {Shimoda, Wataru and Haraguchi, Daichi and Uchida, Seiichi and Yamaguchi, Kota},
    title     = {De-Rendering Stylized Texts},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {1076-1085}
}

Contact

This repository is maintained by Wataru shimoda(wataru_shimoda[at]cyberagent.co.jp).

Paper: De-rendering Stylized Texts

Related tags

Overview

Paper: De-rendering Stylized Texts

Introduction

Concept

Demo

Installation

Requirements

Font data

Pre-rendered alpha maps

Usage

Test

Data generation

Train

Todo

Reference

Contact

Owner

CyberAgent AI Lab

Implementation of ETSformer, state of the art time-series Transformer, in Pytorch

Align and Prompt: Video-and-Language Pre-training with Entity Prompts

Learning from Synthetic Humans, CVPR 2017

Pytorch implementation of Straight Sampling Network For Point Cloud Learning (ICIP2021).

Normalizing Flows with a resampled base distribution

A medical imaging framework for Pytorch

RAMA: Rapid algorithm for multicut problem

PyTorch implementation of "Dataset Knowledge Transfer for Class-Incremental Learning Without Memory" (WACV2022)

Bayesian dessert for Lasagne

Enhancing Column Generation by a Machine-Learning-BasedPricing Heuristic for Graph Coloring

The official implementation of Variable-Length Piano Infilling (VLI).

Degree-Quant: Quantization-Aware Training for Graph Neural Networks.

Removing Inter-Experimental Variability from Functional Data in Systems Neuroscience

Supplemental learning materials for "Fourier Feature Networks and Neural Volume Rendering"

Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.

This repository provides the code for MedViLL(Medical Vision Language Learner).

Music Classification: Beyond Supervised Learning, Towards Real-world Applications

Self-Supervised Pre-Training for Transformer-Based Person Re-Identification

Using pretrained language models for biomedical knowledge graph completion.

Fully Convolutional Refined Auto Encoding Generative Adversarial Networks for 3D Multi Object Scenes