Code release for Hu et al. Segmentation from Natural Language Expressions. in ECCV, 2016

Last update: May 24, 2022

Related tags

Overview

Segmentation from Natural Language Expressions

This repository contains the code for the following paper:

R. Hu, M. Rohrbach, T. Darrell, Segmentation from Natural Language Expressions. in ECCV, 2016. (PDF)

@article{hu2016segmentation,
  title={Segmentation from Natural Language Expressions},
  author={Hu, Ronghang and Rohrbach, Marcus and Darrell, Trevor},
  journal={Proceedings of the European Conference on Computer Vision (ECCV)},
  year={2016}
}

Project Page: http://ronghanghu.com/text_objseg

Installation

Install Google TensorFlow (v1.0.0 or higher) following the instructions here.
Download this repository or clone with Git, and then cd into the root directory of the repository.

Demo

Download the trained models:
exp-referit/tfmodel/download_trained_models.sh.
Run the language-based segmentation model demo in ./demo/text_objseg_demo.ipynb with Jupyter Notebook (IPython Notebook).

Training and evaluation on ReferIt Dataset

Download dataset and VGG network

Download ReferIt dataset:
exp-referit/referit-dataset/download_referit_dataset.sh.
Download VGG-16 network parameters trained on ImageNET 1000 classes:
models/convert_caffemodel/params/download_vgg_params.sh.

Training

You may need to add the repository root directory to Python's module path: export PYTHONPATH=.:$PYTHONPATH.
Build training batches for bounding boxes:
python exp-referit/build_training_batches_det.py.
Build training batches for segmentation:
python exp-referit/build_training_batches_seg.py.
Select the GPU you want to use during training:
export GPU_ID=<gpu id>. Use 0 for <gpu id> if you only have one GPU on your machine.
Train the language-based bounding box localization model:
python exp-referit/exp_train_referit_det.py $GPU_ID.
Train the low resolution language-based segmentation model (from the previous bounding box localization model):
python exp-referit/init_referit_seg_lowres_from_det.py && python exp-referit/exp_train_referit_seg_lowres.py $GPU_ID.
Train the high resolution language-based segmentation model (from the previous low resolution segmentation model):
python exp-referit/init_referit_seg_highres_from_lowres.py && python exp-referit/exp_train_referit_seg_highres.py $GPU_ID.

Alternatively, you may skip the training procedure and download the trained models directly:
exp-referit/tfmodel/download_trained_models.sh.

Evaluation

Select the GPU you want to use during testing: export GPU_ID=<gpu id>. Use 0 for <gpu id> if you only have one GPU on your machine. Also, you may need to add the repository root directory to Python's module path: export PYTHONPATH=.:$PYTHONPATH.
Run evaluation for the high resolution language-based segmentation model:
python exp-referit/exp_test_referit_seg.py $GPU_ID
This should reproduce the results in the paper.
You may also evaluate the language-based bounding box localization model:
python exp-referit/exp_test_referit_det.py $GPU_ID
The results can be compared to this paper.

Code release for Hu et al. Segmentation from Natural Language Expressions. in ECCV, 2016

Related tags

Overview

Segmentation from Natural Language Expressions

Installation

Demo

Training and evaluation on ReferIt Dataset

Download dataset and VGG network

Training

Evaluation

Owner

Ronghang Hu

simple demo codes for Learning to Teach with Dynamic Loss Functions

Line-level Handwritten Text Recognition (HTR) system implemented with TensorFlow.

Neural-PIL: Neural Pre-Integrated Lighting for Reflectance Decomposition - NeurIPS2021

3.8% and 18.3% on CIFAR-10 and CIFAR-100

PyTorch Implementation of Realtime Multi-Person Pose Estimation project.

Code for Contrastive-Geometry Networks for Generalized 3D Pose Transfer

Unofficial implementation of the paper: PonderNet: Learning to Ponder in TensorFlow

NCVX (NonConVeX): A User-Friendly and Scalable Package for Nonconvex Optimization in Machine Learning.

Pytorch tutorials for Neural Style transfert

This initial strategy was developed specifically for larger pools and is based on taking a moving average and deriving Bollinger Bands to create a projected active liquidity range.

Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing (CVPR 2018).

This repository contains the code for TABS, a 3D CNN-Transformer hybrid automated brain tissue segmentation algorithm using T1w structural MRI scans

A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection

A toolkit for Lagrangian-based constrained optimization in Pytorch

DeFMO: Deblurring and Shape Recovery of Fast Moving Objects (CVPR 2021)

[CVPR 2022 Oral] Balanced MSE for Imbalanced Visual Regression https://arxiv.org/abs/2203.16427

Gym for multi-agent reinforcement learning

基于Paddlepaddle复现yolov5，支持PaddleDetection接口

Official code for ICCV2021 paper "M3D-VTON: A Monocular-to-3D Virtual Try-on Network"

Official repository with code and data accompanying the NAACL 2021 paper "Hurdles to Progress in Long-form Question Answering" (https://arxiv.org/abs/2103.06332).