Code for the paper TestRank: Bringing Order into Unlabeled Test Instances for Deep Learning Tasks

Last update: May 19, 2022

Overview

TestRank in Pytorch

Code for the paper TestRank: Bringing Order into Unlabeled Test Instances for Deep Learning Tasks by Yu Li, Min Li, Qiuxia Lai, Yannan Liu, and Qiang Xu.

If you find this repository useful for your work, please consider citing it as follows:

@article{yu2021testrank,
  title={TestRank: Bringing Order into Unlabeled Test Instances for Deep Learning Tasks},
  author={Yu Li, Min Li, Qiuxia Lai, Yannan Liu, and Qiang Xu},
  journal={NeurIPS},
  year={2021}
}

1. Setup

Install dependencies

conda env create -f environment.yml

Please run the code on GPU.

2. Runing

There are mainly three steps involved:

Prepare the DL models to be tested
Prepare the unsupervised BYOL feature extractor
Launch a specific test input prioritization technique

We illustrate these steps as the following.

2.1. Download the Pre-trained DL model under test

ResNet-18 trained on CIFAR10 dataset.
Wide-ResNet trained on SVHN dataset.
ResNet-34 trained on STL10 dataset.

Please download the classifiers to corresponding folder ./checkpoint/{dataset}/ckpt_bias/

If you want to train your own classifiers, please refer to the Training part.

2.2. Download the Feature extractor

We papare pretrained feature extractor for the each (e.g. CIFAR-10, SVHN, STL10) dataset. Please put the downloaded file in the "./ckpt_byol/" folder.

If you want to train your own classifiers, please refer to the Training part.

2.3. Perform Test Selection

Call the 'run.sh' file with argument 'selection':

  ./run.sh selection

Configure your run.sh follow the discription below

  python selection.py \
              --dataset $DATASET \                   # specify the dataset to use
              --manualSeed ${RANDOM_SEED} \          # random seed
              --model2test_arch $MODEL2TEST \        # architecture of the model under test (e.g. resnet18)
              --model2test_path $MODEL2TESTPATH \    # the path storing the model weights 
              --model_number $MODEL_NO \             # which model to test, model 0, 1, or 2?
              --save_path ${save_path} \             # The result will be stored in here
              --data_path ${DATA_ROOT} \             # Dataset root path
              --graph_nn \                           # use graph neural network in testrank
              --feature_extractor_id ${feature_extractor_id} \ # type of feature extractor, 0: BYOL model, 1: the model under test
              --no_neighbors ${no_neighbors} \       # number of neighbors in to constract graph
              --learn_mixed                          # use mlp to combine intrinsic and contextual attributes; otherwise they are brute force combined (multiplication two scores)
              --baseline_gini                        # Use certain baseline method to perform selection, otherwise leave it blank

The result is stored in '{save_path}/{date}/{dataset}_{model}/xxx_result.csv' in where xxx stands for the selection method used (e.g. for testrank, the file would be gnn_result.csv)
The TRC value is in the last column, and the forth column shows the corresponding budget in percent.
To compare with baselines, please specify the corresponding baseline method (e.g. baseline_gini, baseline_uncertainty, baseline_dsa, baseline_mcp):
To evaluate different models, change the MODEL_NO to the corresponding model: [0, 1, 2]

3. Training

3.1. Train classifier

If you want to train your own DL model instead of using the pretrained ones, run this command:

./run.sh trainm

The trained model will be stored in path './checkpoint/dataset/ckpt_bias/*'.
Each model will be assigned with a unique ID (e.g. 0, 1, 2).
The code used to train the model are resides in the train_classifier.py file. If you want to change the dataset or model architecture, please modify 'DATASET=dataset_name' or 'MODEL=name'with the desired ones in the run.sh file.

3.2 Train BYOL Feature Extractor

Please refer to this code.

4. Contact

If there are any questions, feel free to send a message to [email protected]

Code for the paper TestRank: Bringing Order into Unlabeled Test Instances for Deep Learning Tasks

Related tags

Overview

TestRank in Pytorch

1. Setup

2. Runing

2.1. Download the Pre-trained DL model under test

2.2. Download the Feature extractor

2.3. Perform Test Selection

3. Training

3.1. Train classifier

3.2 Train BYOL Feature Extractor

4. Contact

Owner

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

A Streamlit web app that generates Rick and Morty stories using GPT2.

An implementation of the Pay Attention when Required transformer

GVT is a generic translation tool for parts of text on the PC screen with Text to Speak functionality.

Textlesslib - Library for Textless Spoken Language Processing

Python bot created with Selenium that can guess the daily Wordle word correct 96.8% of the time.

My Implementation for the paper EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks using Tensorflow

Voilà turns Jupyter notebooks into standalone web applications

vits chinese, tts chinese, tts mandarin

Twewy-discord-chatbot - Build a Discord AI Chatbot that Speaks like Your Favorite Character

DLO8012: Natural Language Processing & CSL804: Computational Lab - II

Automatically search Stack Overflow for the command you want to run

CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation

Long text token classification using LongFormer

ConvBERT: Improving BERT with Span-based Dynamic Convolution

Implementaion of our ACL 2022 paper Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation

The PyTorch based implementation of continuous integrate-and-fire (CIF) module.

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

fastai ulmfit - Pretraining the Language Model, Fine-Tuning and training a Classifier

Pattern Matching in Python