Inferring Lexicographically-Ordered Rewards from Preferences

Code author: Alihan Hüyük ([email protected])

This repository contains the source code necessary to replicate the main experimental results in the AAAI 2022 paper "Inferring Lexicographically-Ordered Reward from Preferences." Our proposed method, LORI, is implemented in files src/main-lori.py and src/main-lori-liver.py for the problem settings considered in the paper: cancer treatment and organ transplantation respectively.

Usage

First, install the required python packages by running:

    python -m pip install -r requirements.txt

Then, the experiments in the paper can be replicated by running:

    ./src/run.sh        # generates the results in Tables 2 and 3
    ./src/run-liver.sh  # generates the reward functions in (10) and (11)

Note that, in order to run the experiments for the transplantation setting, you need to get access to the Organ Procurement and Transplantation Network (OPTN) dataset for liver transplantations as of December 4, 2020.

Citing

If you use this software please cite as follows:

@inproceedings{huyuk2022inferring,
  author={Alihan H\"uy\"uk and William R. Zame and Mihaela van der Schaar},
  title={Inferring lexicographically-ordered rewards from preferences},
  booktitle={Proceedings of the 36th AAAI Conference on Artificial Intelligence},
  year={2022}
}

Inferring Lexicographically-Ordered Rewards from Preferences

Related tags

Overview

Inferring Lexicographically-Ordered Rewards from Preferences

Usage

Citing

Owner

Alihan Hüyük

This is the official implementation of "One Question Answering Model for Many Languages with Cross-lingual Dense Passage Retrieval".

Neural Network Libraries

A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way :chestnut:

Pairwise learning neural link prediction for ogb link prediction

Fuse radar and camera for detection

Pytorch implementation of YOLOX、PPYOLO、PPYOLOv2、FCOS an so on.

A small library of 3D related utilities used in my research.

Learning with Subset Stacking

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Implementation of CVPR'2022:Surface Reconstruction from Point Clouds by Learning Predictive Context Priors

Honours project, on creating a depth estimation map from two stereo images of featureless regions

A face dataset generator with out-of-focus blur detection and dynamic interval adjustment.

[NeurIPS-2020] Self-paced Contrastive Learning with Hybrid Memory for Domain Adaptive Object Re-ID.

Focal and Global Knowledge Distillation for Detectors

Keras Implementation of The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation by (Simon Jégou, Michal Drozdzal, David Vazquez, Adriana Romero, Yoshua Bengio)

A collection of semantic image segmentation models implemented in TensorFlow

Learning to Self-Train for Semi-Supervised Few-Shot

DrWhy is the collection of tools for eXplainable AI (XAI). It's based on shared principles and simple grammar for exploration, explanation and visualisation of predictive models.

A small fun project using python OpenCV, mediapipe, and pydirectinput

Code for the paper: Adversarial Training Against Location-Optimized Adversarial Patches. ECCV-W 2020.