A Model for Natural Language Attack on Text Classification and Inference

Last update: Dec 16, 2022

Overview

TextFooler

A Model for Natural Language Attack on Text Classification and Inference

This is the source code for the paper: Jin, Di, et al. "Is BERT Really Robust? Natural Language Attack on Text Classification and Entailment." arXiv preprint arXiv:1907.11932 (2019). If you use the code, please cite the paper:

@article{jin2019bert,
  title={Is BERT Really Robust? Natural Language Attack on Text Classification and Entailment},
  author={Jin, Di and Jin, Zhijing and Zhou, Joey Tianyi and Szolovits, Peter},
  journal={arXiv preprint arXiv:1907.11932},
  year={2019}
}

Data

Our 7 datasets are here.

Prerequisites:

Required packages are listed in the requirements.txt file:

pip install -r requirements.txt

How to use

Run the following code to install the esim package:

cd ESIM
python setup.py install
cd ..

(Optional) Run the following code to pre-compute the cosine similarity scores between word pairs based on the counter-fitting word embeddings.

python comp_cos_sim_mat.py [PATH_TO_COUNTER_FITTING_WORD_EMBEDDINGS]

Run the following code to generate the adversaries for text classification:

python attack_classification.py

For Natural langauge inference:

python attack_nli.py

Examples of run code for these two files are in run_attack_classification.py and run_attack_nli.py. Here we explain each required argument in details:

--dataset_path: The path to the dataset. We put the 1000 examples for each dataset we used in the paper in the folder data.
--target_model: Name of the target model such as ''bert''.
--target_model_path: The path to the trained parameters of the target model. For ease of replication, we shared the trained BERT model parameters, the trained LSTM model parameters, and the trained CNN model parameters on each dataset we used.
--counter_fitting_embeddings_path: The path to the counter-fitting word embeddings.
--counter_fitting_cos_sim_path: This is optional. If given, then the pre-computed cosine similarity scores based on the counter-fitting word embeddings will be loaded to save time. If not, it will be calculated.
--USE_cache_path: The path to save the USE model file (Downloading is automatic if this path is empty).

Two more things to share with you:

In case someone wants to replicate our experiments for training the target models, we shared the used seven datasets we have processed for you!
In case someone may want to use our generated adversary results towards the benchmark data directly, here it is.

A Model for Natural Language Attack on Text Classification and Inference

Related tags

Overview

TextFooler

Data

Prerequisites:

How to use

Owner

Di Jin

Codebase for the solution that won first place and was awarded the most human-like agent in the 2021 NeurIPS Competition MineRL BASALT Challenge.

Establishing Strong Baselines for TripClick Health Retrieval; ECIR 2022

Implementation of the SUMO (Slim U-Net trained on MODA) model

Website for D2C paper

An official source code for paper Deep Graph Clustering via Dual Correlation Reduction, accepted by AAAI 2022

🏅 Top 5% in 제2회 연구개발특구 인공지능 경진대회 AI SPARK 챌린지

Optimizes image files by converting them to webp while also updating all references.

Direct design of biquad filter cascades with deep learning by sampling random polynomials.

Manifold-Mixup implementation for fastai V2

Official pytorch implementation of the paper: "SinGAN: Learning a Generative Model from a Single Natural Image"

StarGAN v2-Tensorflow - Simple Tensorflow implementation of StarGAN v2

Machine Translation Implement By Bi-GRU And Transformer

Pose Detection and Machine Learning for real-time body posture analysis during exercise to provide audiovisual feedback on improvement of form.

✔️ Visual, reactive testing library for Julia. Time machine included.

We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances to transformations applied to both the audio and video streams.

A custom DeepStack model that has been trained detecting ONLY the USPS logo

WebUAV-3M: A Benchmark Unveiling the Power of Million-Scale Deep UAV Tracking

Relative Uncertainty Learning for Facial Expression Recognition

Semantic Segmentation of images using PixelLib with help of Pascalvoc dataset trained with Deeplabv3+ framework.

Learning to Segment Instances in Videos with Spatial Propagation Network