Which Apple Keeps Which Doctor Away? Colorful Word Representations with Visual Oracles

Last update: Apr 14, 2022

Related tags

Text Data & NLP AppleLM

Overview

AppleLM

Which Apple Keeps Which Doctor Away? Colorful Word Representations with Visual Oracles (TASLP 2022)

Setup

This implementation is based on Transformers.

Preparation

Download GLUE datasets

The datasets can be downloaded automatically. Please refer to https://github.com/nyu-mll/GLUE-baselines

git clone https://github.com/nyu-mll/GLUE-baselines.git
python download_glue_data.py --data_dir glue_data --tasks all

It is recommended to put the folder glue_data to data/. The architecture looks like:

AppleLM
└───data
│   └───glue_data
│       │   CoLA/
│       │   MRPC/
│       │   ...

Visual Features

Pre-extracted visual features can be downloaded from Google Drive borrowed from the repo Multi30K.

The features are used in image embedding layer for indexing. Extract train-resnet50-avgpool.npy and put it in the data/ folder.

Training & Evaluate

export GLUE_DIR=data/glue_data/
export CUDA_VISIBLE_DEVICES="0"
export TASK_NAME=CoLA
python ./examples/run_glue_visual-tfidf_att.py \
    --model_type bert \
    --model_name_or_path bert-large-uncased-whole-word-masking \
    --task_name $TASK_NAME \
    --do_eval \
    --do_lower_case \
    --data_dir $GLUE_DIR/$TASK_NAME \
    --max_seq_length 128 \
    --per_gpu_eval_batch_size=32   \
    --per_gpu_train_batch_size=16   \
    --learning_rate 1e-5 \
    --eval_all_checkpoints \
    --save_steps 500 \
    --max_steps 5336 \
    --warmup_steps 320 \
    --image_dir data/train.lc.norm.tok.en \
    --image_embedding_file data/train-resnet50-avgpool.npy \
    --num_img 3 \
    --tfidf 5 \
    --image_merge att-gate \
    --stopwords_dir data/stopwords-en.txt \
    --output_dir experiments/CoLA_bert_wwm

Reference

Please kindly cite this paper in your publications if it helps your research:

@ARTICLE{zhang2022which,
  author={Zhang, Zhuosheng and Yu, Haojie and Zhao, Hai and Utiyama, Masao},
  journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing}, 
  title={Which Apple Keeps Which Doctor Away? Colorful Word Representations With Visual Oracles}, 
  year={2022},
  volume={30},
  number={},
  pages={49-59},
  doi={10.1109/TASLP.2021.3130972}
}

Which Apple Keeps Which Doctor Away? Colorful Word Representations with Visual Oracles

Related tags

Overview

AppleLM

Setup

Preparation

Training & Evaluate

Reference

Owner

Zhuosheng Zhang

nlpcommon is a python Open Source Toolkit for text classification.

The model is designed to train a single and large neural network in order to predict correct translation by reading the given sentence.

Indobenchmark are collections of Natural Language Understanding (IndoNLU) and Natural Language Generation (IndoNLG)

REST API for sentence tokenization and embedding using Multilingual Universal Sentence Encoder.

AutoGluon: AutoML for Text, Image, and Tabular Data

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion

NLP command-line assistant powered by OpenAI

Py65 65816 - Add support for the 65C816 to py65

Train BPE with fastBPE, and load to Huggingface Tokenizer.

xFormers is a modular and field agnostic library to flexibly generate transformer architectures by interoperable and optimized building blocks.

NeuTex: Neural Texture Mapping for Volumetric Neural Rendering

AEC_DeepModel - Deep learning based acoustic echo cancellation baseline code

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together

open-information-extraction-system, build open-knowledge-graph(SPO, subject-predicate-object) by pyltp(version==3.4.0)

This repository contains the code for EMNLP-2021 paper "Word-Level Coreference Resolution"

The entmax mapping and its loss, a family of sparse softmax alternatives.

A natural language modeling framework based on PyTorch

A minimal code for fairseq vq-wav2vec model inference.

FB ID CLONER WUTHOT CHECKPOINT, FACEBOOK ID CLONE FROM FILE