Official implementation of Self-supervised Image-to-text and Text-to-image Synthesis

Last update: Jul 31, 2022

Related tags

Overview

Self-supervised Image-to-text and Text-to-image Synthesis

This is the official implementation of Self-supervised Image-to-text and Text-to-image Synthesis. The architecture of and are shown.

Dataset

We use Caltech-UCSD Birds-200-2011 and Oxford-102 datasets in this work.

Download Flower images
Rename the jpg folder to images and unzip 102flowers.zip and put it inside 102flowers folder
put 102flowers folder inside data folder
Download Birds data and put inside Data/
Download image data Extract them to Data/birds/

Dependencies

pytorch
torchvision
tensorboardX
pickle

Training

Training the image autoencoder

The driver program for training the image autoencoder is main.py

To train the image autoencoder on flower dataset

python main.py --cfg cfg/flowers_3stages.yml --gpu 0

To train the image autoencoder birds dataset

python main.py --cfg cfg/birds_3stages.yml --gpu 0

Models will automatically saved after a fixed number of iteration, to restart from a failed step edit netG_version in respective .yml file

Training the text autoencoder

python run_text_test.py dataset_type Input_Folder output_file.txt

For Flower Dataset dataset_type=1, for Birds Dataset dataset_type=2 e.g.

python run_text_test.py 2 /home/user/dev/unsup/data_datasets/CUB_200_2011 outbirds_n.txt

Training the mapping networks

Train the GAN-based mapping network

python MappingImageText.py Dataset_folder

e.g.

python MappingImageText.py /home/user/dev/unsup/data_datasets/CUB_200_2011

Train the MMD-based mapping network

python mmd_ganTI.py --dataset /home/das/dev/data_datasets/birds_dataset/CUB_200_2011 --gpu_device 0

python mmd_ganIT.py --dataset /home/das/dev/data_datasets/birds_dataset/CUB_200_2011 --gpu_device 0

Official implementation of Self-supervised Image-to-text and Text-to-image Synthesis

Related tags

Overview

Self-supervised Image-to-text and Text-to-image Synthesis

Dataset

Dependencies

Training

Training the image autoencoder

To train the image autoencoder on flower dataset

To train the image autoencoder birds dataset

Training the text autoencoder

Training the mapping networks

Train the GAN-based mapping network

Train the MMD-based mapping network

Owner

FlowTorch is a PyTorch library for learning and sampling from complex probability distributions using a class of methods called Normalizing Flows

Unsupervised Learning of Multi-Frame Optical Flow with Occlusions

Official Implementation of HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic Segmentation

Team nan solution repository for FPT data-centric competition. Data augmentation, Albumentation, Mosaic, Visualization, KNN application

git《Beta R-CNN: Looking into Pedestrian Detection from Another Perspective》(NeurIPS 2020) GitHub:[fig3]

Fog Simulation on Real LiDAR Point Clouds for 3D Object Detection in Adverse Weather

3D Pose Estimation for Vehicles

Image-retrieval-baseline - MUGE Multimodal Retrieval Baseline

Algorithms for outlier, adversarial and drift detection

C3d-pytorch - Pytorch porting of C3D network, with Sports1M weights

A curated list of neural network pruning resources.

PyVideoAI: Action Recognition Framework

Source code for paper "ATP: AMRize Than Parse! Enhancing AMR Parsing with PseudoAMRs" @NAACL-2022

🍷 Gracefully claim weekly free games and monthly content from Epic Store.

ML-Decoder: Scalable and Versatile Classification Head

This is a JAX implementation of Neural Radiance Fields for learning purposes.

source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT

Software associated to AAAI paper "Planning with Biological Neurons and Synapses"

Pre-Training Graph Neural Networks for Cold-Start Users and Items Representation.

《Where am I looking at? Joint Location and Orientation Estimation by Cross-View Matching》(CVPR 2020)