This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”

Last update: Dec 11, 2022

Related tags

Deep Learning prompt_semantics

Overview

This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”

Usage

To replicate our results in Section 4, run:

python3 prompt_tune.py \
    --save-dir ../runs/prompt_tuned_sec4/ \
    --prompt-path ../data/binary_NLI_prompts.csv \
    --experiment-name sec4 \
    --few-shots 3,5,10,20,30,50,100,250 \
    --production \
    --seeds 1

Add --fully-train if you want to train on the entire training set in addition to few-shot settings.

To replicate Section 5, run:

python3 prompt_tune.py \
    --save-dir ../runs/prompt_tuned_sec5/ \
    --prompt-path ../data/binary_NLI_prompts_permuted.csv \
    --experiment-name sec5 \
    --few-shots 3,5,10,20,30,50,100,250 \
    --production \
    --seeds 1

To get a fine-tuning baseline (Figure 1):

python3 fine_tune.py \
    --save-dir ../runs/fine_tune/ \
    --epochs 5 \
    --few-shots 3,5,10,20,30,50,100,250 \
    --fully-train \
    --production \
    --seeds 1

To replicate our exact results, use --seeds 1,2,3,4,5,6,7,8, which yields starting_example_index of 550,231,974,966,1046,2350,1326,928 respectively. This is important for ensuring that all models trained under the same seed always see exactly the same training examples. See paper Section 3 for more details.

If these seeds do not generate the same starting_example_index for you (which you can check in the output CSV files), you will have to manually specify the few-shot subset of training examples. I plan to add an argparse argument for this to make it easy.

All other hyperparameters are the same as the argparse default.

Miscellaneous Notes

You might notice that the code and output files are set up to produce a fine-grained analysis of HANS (McCoy et al., 2019). We actually run all of our main experiments on HANS as well and got similar results, which we plan to write up in a future version of our paper. Meanwhile, if you’re curious, feel free to add --do-diagnosis which will report the results on HANS.

Requirements

Python 3.9.

3.7 should mostly work too. You’d have to just replace the new built-in type hints and dictionary union operators with their older equivalents.

Activate your preferred virtual envrionment and then run pip install -r requirements.txt. If you want to replicate our exact results, use

torch==1.9.0+cu111
transformers==4.9.2
datasets==1.11.0

This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”

Related tags

Overview

Usage

Miscellaneous Notes

Requirements

Owner

Albert Webson

A Topic Modeling toolbox

IOT: Instance-wise Layer Reordering for Transformer Structures

Code in PyTorch for the convex combination linear IAF and the Householder Flow, J.M. Tomczak & M. Welling

LETR: Line Segment Detection Using Transformers without Edges

LTR_CrossEncoder: Legal Text Retrieval Zalo AI Challenge 2021

Generating Radiology Reports via Memory-driven Transformer

yolov5 deepsort 行人车辆跟踪检测计数

This repo is to be freely used by ML devs to check the GAN performances without coding from scratch.

Hierarchical Memory Matching Network for Video Object Segmentation (ICCV 2021)

Technical experimentations to beat the stock market using deep learning :chart_with_upwards_trend:

Simultaneous Demand Prediction and Planning

Structure-Preserving Deraining with Residue Channel Prior Guidance (ICCV2021)

PyTorch Implementation of Vector Quantized Variational AutoEncoders.

Implementation for "Manga Filling Style Conversion with Screentone Variational Autoencoder" (SIGGRAPH ASIA 2020 issue)

Clustering is a popular approach to detect patterns in unlabeled data

Hypernetwork-Ensemble Learning of Segmentation Probability for Medical Image Segmentation with Ambiguous Labels

[ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets"

A highly efficient, fast, powerful and light-weight anime downloader and streamer for your favorite anime.

Optimized Gillespie algorithm for simulating Stochastic sPAtial models of Cancer Evolution (OG-SPACE)

Clockwork Variational Autoencoder

This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”

Related tags

Overview

Usage

Miscellaneous Notes

Requirements

Owner

Albert Webson

A Topic Modeling toolbox

IOT: Instance-wise Layer Reordering for Transformer Structures

Code in PyTorch for the convex combination linear IAF and the Householder Flow, J.M. Tomczak & M. Welling

LETR: Line Segment Detection Using Transformers without Edges

LTR_CrossEncoder: Legal Text Retrieval Zalo AI Challenge 2021

Generating Radiology Reports via Memory-driven Transformer

yolov5 deepsort 行人 车辆 跟踪 检测 计数

This repo is to be freely used by ML devs to check the GAN performances without coding from scratch.

Hierarchical Memory Matching Network for Video Object Segmentation (ICCV 2021)

Technical experimentations to beat the stock market using deep learning :chart_with_upwards_trend:

Simultaneous Demand Prediction and Planning

Structure-Preserving Deraining with Residue Channel Prior Guidance (ICCV2021)

PyTorch Implementation of Vector Quantized Variational AutoEncoders.

Implementation for "Manga Filling Style Conversion with Screentone Variational Autoencoder" (SIGGRAPH ASIA 2020 issue)

Clustering is a popular approach to detect patterns in unlabeled data

Hypernetwork-Ensemble Learning of Segmentation Probability for Medical Image Segmentation with Ambiguous Labels

[ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets"

A highly efficient, fast, powerful and light-weight anime downloader and streamer for your favorite anime.

Optimized Gillespie algorithm for simulating Stochastic sPAtial models of Cancer Evolution (OG-SPACE)

Clockwork Variational Autoencoder

yolov5 deepsort 行人车辆跟踪检测计数