Preference Feedback

Preference feedback, the use of human-provided comparisons to guide machine learning model training and evaluation, aims to align AI systems with human values and preferences. Current research focuses on improving the efficiency and effectiveness of preference learning, exploring various model architectures like Bradley-Terry and regression models, Direct Preference Optimization (DPO), and generative judges, often incorporating response times and contextual information to enhance the richness of feedback. This field is crucial for mitigating biases and ensuring AI systems are safe, reliable, and beneficial, impacting diverse applications from language model alignment to personalized recommendations and robot navigation.

Papers

September 30, 2023

The Physics of Preference: Unravelling Imprecision of Human Preferences through Magnetisation Dynamics
Ivan S. Maksymov, Ganna Pogrebna
Preference Feedback Human Preference Theoretical Physic Human Decision Preference Change Noisy Preference

September 19, 2023

Large language models can accurately predict searcher preferences
Paul Thomas, Seth Spielman, Nick Craswell, Bhaskar Mitra
Large Language Model Language Model Preference Feedback Relevance Label Human Labelers

September 8, 2023

Revealing the preference for correcting separated aberrations in joint optic-image design
Jingwen Zhou, Shiqi Chen, Zheng Ren, Wenguan Zhang, Jiapu Yan, Huajun Feng, Qi Li, Yueting Chen
Preference Feedback Optical Aberration Joint Image Lens Design Optical Aberration Correction

August 30, 2023

Peering Through Preferences: Unraveling Feedback Acquisition for Aligning Large Language Models
Hritik Bansal, John Dang, Aditya Grover
Large Language Model Language Model Human Feedback Preference Feedback Annotation Bias Feedback Annotation

August 23, 2023

PREFER: Prompt Ensemble Learning via Feedback-Reflect-Refine
Chenrui Zhang, Lin Liu, Jinpeng Wang, Chuyuan Wang, Xiao Sun, Hongyu Wang, Mingchen Cai
Preference Feedback Weak Learner Ensemble Machine Learning Prompt Ensemble

August 18, 2023

Automata Learning from Preference and Equivalence Queries
Eric Hsiung, Joydeep Biswas, Swarat Chaudhuri
Preference Feedback Sequence of Sequence Partial Feedback Reward Machine Non Markovian Reward

August 1, 2023

Collaborative filtering to capture AI user's preferences as norms
Marc Serramia, Natalia Criado, Michael Luck
Artificial Intelligence Recommender System Preference Feedback Collaborative Filtering Learning Norm

July 22, 2023

A Flexible Framework for Incorporating Patient Preferences Into Q-Learning
Joshua P. Zitovsky, Leslie Wilson, Michael R. Kosorok
Q Learning Preference Feedback Multiple Outcome Dynamic Treatment Regime Utility Learning

July 19, 2023

Learning Formal Specifications from Membership and Preference Queries
Ameesh Shah, Marcell Vazquez-Chanlatte, Sebastian Junges, Sanjit A. Seshia
Active Learning Preference Feedback Pairwise Preference Formal Specification Specification Mining

July 13, 2023

Learning to Select and Rank from Choice-Based Feedback: A Simple Nested Approach
Junwen Yang, Yifan Feng
Preference Feedback Item Popularity Asymptotic Optimality Elimination Based Algorithm Simple Algorithm

July 7, 2023

Federated Learning over a Wireless Network: Distributed User Selection through Random Access
Chen Sun, Shiyao Ma, Ce Zheng, Songtao Wu, Tao Cui, Lingjuan Lyu
Preference Feedback Wireless Network Random Access Multiple Access Radio Resource Management User Selection

June 26, 2023

Proportional Aggregation of Preferences for Sequential Decision Making
Nikhil Chandak, Shashwat Goel, Dominik Peters
Preference Feedback Sequential Decision Making Voting Rule Multiwinner Voting Approval Voting Fairness Aware Aggregation Proportional Representation

June 9, 2023

Response Time Improves Choice Prediction and Function Estimation for Gaussian Process Models of Perception and Preferences
Michael Shvartsman, Benjamin Letham, Stephen Keeley
Gaussian Process Preference Feedback Value Function Preference Learning Response Time Policy Diffusion Choice Prediction

May 29, 2023

Provable Reward-Agnostic Preference-Based Reinforcement Learning
Wenhao Zhan, Masatoshi Uehara, Wen Sun, Jason D. Lee
Reinforcement Learning Preference Feedback Preference Based Reinforcement Learning Preference Model

May 26, 2023

Learning Interpretable Models of Aircraft Handling Behaviour by Reinforcement Learning from Human Feedback
Tom Bewley, Jonathan Lawry, Arthur Richards
Reinforcement Learning Human Feedback Interpretable Model Preference Feedback Robust Flight Reward Tree

May 18, 2023

Preference or Intent? Double Disentangled Collaborative Filtering
Chao Wang, Hengshu Zhu, Dazhong Shen, Wei wu, Hui Xiong
Preference Feedback Intent Detection Collaborative Filtering Preference Learning Human Intent Preference Representation Preference Model

May 4, 2023

"My Unconditional Homework Buddy:'' Exploring Children's Preferences for a Homework Companion Robot
Bengisu Cagiltay, Bilge Mutlu, Joseph E Michaelis
Preference Feedback Nine Year Old Child Companion Robot

April 12, 2023

ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation
Jiazheng Xu, Xiao Liu, Yuchen Wu, Yuxuan Tong, Qinkai Li, Ming Ding, Jie Tang, Yuxiao Dong
LeArning Abstract Text to Image Generation Text to Image Model Preference Feedback Text to Image Synthesis Preference Reward

April 10, 2023

Learning a Universal Human Prior for Dexterous Manipulation from Human Preference
Zihan Ding, Yuanpei Chen, Allen Z. Ren, Shixiang Shane Gu, Qianxu Wang, Hao Dong, Chi Jin
Reinforcement Learning LeArning Abstract Preference Feedback Human Preference Dexterous Manipulation Dual Arm Human Prior

March 10, 2023

Pacos: Modeling Users' Interpretable and Context-Dependent Choices in Preference Reversals
Qingming Li, H. Vicky Zhao
Decision Making Preference Feedback User Base Decision Problem Preference Change Context Effect Single Peaked Preference

Preference Feedback

Papers

The Physics of Preference: Unravelling Imprecision of Human Preferences through Magnetisation Dynamics

Large language models can accurately predict searcher preferences

Revealing the preference for correcting separated aberrations in joint optic-image design

Peering Through Preferences: Unraveling Feedback Acquisition for Aligning Large Language Models

PREFER: Prompt Ensemble Learning via Feedback-Reflect-Refine

Automata Learning from Preference and Equivalence Queries

Collaborative filtering to capture AI user's preferences as norms

A Flexible Framework for Incorporating Patient Preferences Into Q-Learning

Learning Formal Specifications from Membership and Preference Queries

Learning to Select and Rank from Choice-Based Feedback: A Simple Nested Approach

Federated Learning over a Wireless Network: Distributed User Selection through Random Access

Proportional Aggregation of Preferences for Sequential Decision Making

Response Time Improves Choice Prediction and Function Estimation for Gaussian Process Models of Perception and Preferences

Provable Reward-Agnostic Preference-Based Reinforcement Learning

Learning Interpretable Models of Aircraft Handling Behaviour by Reinforcement Learning from Human Feedback

Preference or Intent? Double Disentangled Collaborative Filtering

"My Unconditional Homework Buddy:'' Exploring Children's Preferences for a Homework Companion Robot

ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation

Learning a Universal Human Prior for Dexterous Manipulation from Human Preference

Pacos: Modeling Users' Interpretable and Context-Dependent Choices in Preference Reversals