Preference Learning

Preference learning aims to align artificial intelligence models, particularly large language models, with human preferences by learning from human feedback on model outputs. Current research focuses on developing efficient algorithms, such as direct preference optimization and reinforcement learning from human feedback, often incorporating advanced model architectures like diffusion models and variational autoencoders to handle complex preference structures, including intransitivity. This field is crucial for building trustworthy and beneficial AI systems, improving their performance on various tasks and ensuring alignment with human values in diverse applications ranging from robotics to natural language processing.

Papers

September 22, 2023

Data-driven Preference Learning Methods for Sorting Problems with Multiple Temporal Criteria
Yijun Li, Mengzhuo Guo, Miłosz Kadziński, Qingpeng Zhang
Preference Learning Related Problem Temporal Relation Monotonic Neural Network Preference Dynamic Discount Factor

September 7, 2023

Interactive Hyperparameter Optimization in Multi-Objective Problems via Preference Learning
Joseph Giovanelli, Alexander Tornede, Tanja Tornede, Marius Lindauer
Hyperparameter Optimization Multi Objective Preference Learning Pareto Front

September 6, 2023

Everyone Deserves A Reward: Learning Customized Human Preferences
Pengyu Cheng, Jiawen Xie, Ke Bai, Yong Dai, Nan Du
Preference Learning Reward Report User Preference Preference Datasets Iterative Preference Learning Specific Preference

August 18, 2023

Meta-learning enhanced next POI recommendation by leveraging check-ins from auxiliary cities
Jinze Wang, Lu Zhang, Zhu Sun, Yew-Soon Ong
Meta Learning Urban Environment Movie Recommendation Preference Learning Point of Interest

June 9, 2023

Response Time Improves Choice Prediction and Function Estimation for Gaussian Process Models of Perception and Preferences
Michael Shvartsman, Benjamin Letham, Stephen Keeley
Gaussian Process Preference Feedback Value Function Preference Learning Response Time Policy Diffusion Choice Prediction

June 8, 2023

Prefer to Classify: Improving Text Classifiers via Auxiliary Preference Learning
Jaehyung Kim, Jinwoo Shin, Dongyeop Kang
Preference Learning Text Classifier Annotation Data Task Preference

May 30, 2023

Who Would be Interested in Services? An Entity Graph Learning System for User Targeting
Dan Yang, Binbin Hu, Xiaoyan Yang, Yue Shen, Zhiqiang Zhang, Jinjie Gu, Guannan Zhang
Preference Learning Service Provider Entity Graph

May 24, 2023

Inverse Preference Learning: Preference-based RL without a Reward Function
Joey Hejna, Dorsa Sadigh
Reinforcement Learning Reward Function Reward Model Preference Learning Preference Based Reinforcement Learning

May 18, 2023

Preference or Intent? Double Disentangled Collaborative Filtering
Chao Wang, Hengshu Zhu, Dazhong Shen, Wei wu, Hui Xiong
Preference Feedback Intent Detection Collaborative Filtering Preference Learning Human Intent Preference Representation Preference Model

April 12, 2023

NaviSTAR: Socially Aware Robot Navigation with Hybrid Spatio-Temporal Graph Transformer and Preference Learning
Weizheng Wang, Ruiqi Wang, Le Mao, Byung-Cheol Min
Graph Transformer Policy Reinforcement Learning Preference Learning Social Navigation Navigation Behavior

April 7, 2023

Representer Theorems for Metric and Preference Learning: A Geometric Perspective
Peyman Morteza
Metric Learning Preference Learning Reproducing Kernel Hilbert Space North Star Metric Representer Theorem

March 1, 2023

Efficient Explorative Key-term Selection Strategies for Conversational Contextual Bandits
Zhiyong Wang, Xutong Liu, Shuai Li, John C. S. Lui
High Efficiency Preference Learning Keyword Extraction Conversational Bandit Term Level Knowledge

December 6, 2022

Few-Shot Preference Learning for Human-in-the-Loop RL
Joey Hejna, Dorsa Sadigh
Reinforcement Learning Multi Task Learning Reward Function Preference Learning Human in the Loop Reinforcement Informative Reward

September 11, 2022

Learning Consumer Preferences from Bundle Sales Data
Ningyuan Chen, Setareh Farajollahzadeh, Guan Wang
Preference Learning Discrete Choice Estimation Problem Bundle Sale Data

July 25, 2022

Modelling non-reinforced preferences using selective attention
Noor Sajid, Panagiotis Tigas, Zafeirios Fountas, Qinghai Guo, Alexey Zakharov, Lancelot Da Costa
Preference Learning User Preference

July 7, 2022

One for All: Simultaneous Metric and Preference Learning over Multiple Users
Gregory Canal, Blake Mason, Ramya Korlakai Vinayak, Robert Nowak
Sample Complexity Metric Learning Preference Learning Preference Pair Multi User Simultaneous Tracking

June 10, 2022

Interactively Learning Preference Constraints in Linear Bandits
David Lindner, Sebastian Tschiatschek, Katja Hofmann, Andreas Krause
Linear Bandit Sequential Decision Preference Learning Constraint Learning

June 9, 2022

Comprehensive Fair Meta-learned Recommender System
Tianxin Wei, Jingrui He
Preference Learning Cold Start Recommendation

May 16, 2022

Expected Frequency Matrices of Elections: Computation, Geometry, and Preference Learning
Niclas Boehmer, Robert Bredereck, Edith Elkind, Piotr Faliszewski, Stanisław Szufa
Geometric Analysis Computation Method Preference Learning Election Result Structured Matrix Specific Preference 2 Dimensional Skeleton

April 14, 2022

RankNEAT: Outperforming Stochastic Gradient Search in Preference Learning Tasks
Kosmas Pinitas, Konstantinos Makantasis, Antonios Liapis, Georgios N. Yannakakis
Stochastic Gradient Descent Preference Learning

Preference Learning

Papers

Data-driven Preference Learning Methods for Sorting Problems with Multiple Temporal Criteria

Interactive Hyperparameter Optimization in Multi-Objective Problems via Preference Learning

Everyone Deserves A Reward: Learning Customized Human Preferences

Meta-learning enhanced next POI recommendation by leveraging check-ins from auxiliary cities

Response Time Improves Choice Prediction and Function Estimation for Gaussian Process Models of Perception and Preferences

Prefer to Classify: Improving Text Classifiers via Auxiliary Preference Learning

Who Would be Interested in Services? An Entity Graph Learning System for User Targeting

Inverse Preference Learning: Preference-based RL without a Reward Function

Preference or Intent? Double Disentangled Collaborative Filtering

NaviSTAR: Socially Aware Robot Navigation with Hybrid Spatio-Temporal Graph Transformer and Preference Learning

Representer Theorems for Metric and Preference Learning: A Geometric Perspective

Efficient Explorative Key-term Selection Strategies for Conversational Contextual Bandits

Few-Shot Preference Learning for Human-in-the-Loop RL

Learning Consumer Preferences from Bundle Sales Data

Modelling non-reinforced preferences using selective attention

One for All: Simultaneous Metric and Preference Learning over Multiple Users

Interactively Learning Preference Constraints in Linear Bandits

Comprehensive Fair Meta-learned Recommender System

Expected Frequency Matrices of Elections: Computation, Geometry, and Preference Learning

RankNEAT: Outperforming Stochastic Gradient Search in Preference Learning Tasks