Preference Feedback

Preference feedback, the use of human-provided comparisons to guide machine learning model training and evaluation, aims to align AI systems with human values and preferences. Current research focuses on improving the efficiency and effectiveness of preference learning, exploring various model architectures like Bradley-Terry and regression models, Direct Preference Optimization (DPO), and generative judges, often incorporating response times and contextual information to enhance the richness of feedback. This field is crucial for mitigating biases and ensuring AI systems are safe, reliable, and beneficial, impacting diverse applications from language model alignment to personalized recommendations and robot navigation.

Papers

February 2, 2024

Capturing waste collection planning expert knowledge in a fitness function through preference learning
Laura Fernández Díaz, Miriam Fernández Díaz, José Ramón Quevedo, Elena Montañés
Preference Feedback Optimization Algorithm Fitness Function Efficient Waste Route Planning

February 1, 2024

January 17, 2024

Crowd-PrefRL: Preference-Based Reward Learning from Crowds
David Chhan, Ellen Novoseller, Vernon J. Lawhern
Reinforcement Learning Preference Feedback Crowded Environment Reward Learning Pairwise Preference

January 2, 2024

Human Leading or Following Preferences: Effects on Human Perception of the Robot and the Human-Robot Collaboration
Ali Noormohammadi-Asl, Kevin Fan, Stephen L. Smith, Kerstin Dautenhahn
Robot Person Task Planning Human Robot Collaboration Preference Feedback Human Perception

December 30, 2023

Two-Step Offline Preference-Based Reinforcement Learning with Constrained Actions
Yinglun Xu, Tarun Suresh, Rohan Gumaste, David Zhu, Ruirui Li, Zhengyang Wang, Haoming Jiang, Xianfeng Tang, Qingyu Yin, Monica Xiao Cheng, Qi Zeng, Chao Zhang, Gagandeep Singh
Reinforcement Learning Preference Feedback Two Phase Offline Preference Based Reinforcement Learning

December 27, 2023

Preference as Reward, Maximum Preference Optimization with Importance Sampling
Zaifan Jiang, Xing Huang, Chao Wei
Direct Preference Optimization Preference Feedback Importance Sampling Preference Optimization Preference Learning Reward Report

December 17, 2023

Students' Perceptions and Preferences of Generative Artificial Intelligence Feedback for Programming
Zhengdong Zhang, Zihan Dong, Yang Shi, Noboru Matsuda, Thomas Price, Dongkuan Xu
Preference Feedback Student Friendly Knowledge Programming Assistance Java Programming ChatGPT Related

December 15, 2023

Learning to Infer Unobserved Behaviors: Estimating User's Preference for a Site over Other Sites
Atanu R Sinha, Tanay Anand, Paridhi Maheshwari, A V Lakshmy, Vishal Jain
LeArning Abstract Stochastic Gradient Preference Feedback Markov Chain Monte Carlo Hierarchical Bayesian Unobserved Node Multi Site

December 14, 2023

Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences
Minyoung Hwang, Luca Weihs, Chanwoo Park, Kimin Lee, Aniruddha Kembhavi, Kiana Ehsani
Preference Feedback Human Preference Multi Objective Reinforcement Learning Robot Behavior Object Goal Navigation Multi Objective Reward

December 12, 2023

Neural Reasoning About Agents' Goals, Preferences, and Actions
Matteo Bortoletto, Lei Shi, Andreas Bulling
Agent Smith Preference Feedback Past Action Unseen Task Pseudo Goal Intuitive Explanation Neural Reasoning Cognitively Inspired Benchmark

December 5, 2023

FERGI: Automatic Annotation of User Preferences for Text-to-Image Generation from Spontaneous Facial Expression Reaction
Shuangquan Feng, Junhua Ma, Virginia R. de Sa
Human Feedback Text to Image Generation Facial Expression Preference Feedback Facial Action Unit Automatic Annotation Facial Expression Generation

November 30, 2023

Preference and Concurrence Aware Bayesian Graph Neural Networks for Recommender Systems
Hongjian Gu, Yaochen Hu, Yingxue Zhang
Graph Neural Network Generative Model Recommender System Preference Feedback Interaction Graph Based Collaborative Filtering

November 2, 2023

The Impact of Preference Agreement in Reinforcement Learning from Human Feedback: A Case Study in Summarization
Sian Gooding, Hassan Mansoor
Reinforcement Learning Case Study Language Generation Human Feedback Structured Summary Text Summarization Preference Feedback Reward Estimation Accuracy

October 30, 2023

Differentially Private Reward Estimation with Preference Feedback
Sayak Ray Chowdhury, Xingyu Zhou, Nagarajan Natarajan
Reinforcement Learning Preference Feedback Private Bayesian

October 29, 2023

An Ontological Model of User Preferences
Mona Abdel-Keream, Daniel Beßler, Ayden Janssen, Sascha Jongebloed, Robin Nolte, Mihai Pomarlan, Robert Porzel
Non Humanoid Robot Top Level Ontology Preference Feedback Human Preference Semantic Entity Iterative Preference Learning

October 22, 2023

Learning to Discern: Imitating Heterogeneous Human Demonstrations with Preference and Representation Learning
Sachit Kuhar, Shuo Cheng, Shivang Chopra, Matthew Bronars, Danfei Xu
LeArning Abstract Representation Learning Policy Learning Preference Feedback Human Demonstration Offline Imitation Heterogeneous Demonstration

October 13, 2023

A Novel Approach to Comprehending Users' Preferences for Accurate Personalized News Recommendation
Yunyong Ko, Seongeun Ryu, Sang-Wook Kim
Novel Approach Preference Feedback News Recommendation User Understanding Personalized News Recommendation Learning Unbiased News Article Representation

October 10, 2023

Constructive Large Language Models Alignment with Diverse Feedback
Tianshu Yu, Ting-En Lin, Yuchuan Wu, Min Yang, Fei Huang, Yongbin Li
Large Language Model Human Feedback Preference Feedback Large Language Model Alignment Lower Critique Accuracy

October 3, 2023

Learning Optimal Advantage from Preferences and Mistaking it for Reward
W. Bradley Knox, Stephane Hatgis-Kessell, Sigurdur Orn Adalgeirsson, Serena Booth, Anca Dragan, Peter Stone, Scott Niekum
Reinforcement Learning Reward Function Preference Feedback Reward Shaping Reward Report Regret Analysis Preference Dynamic Advantage Learning

Preference Feedback

Papers

Capturing waste collection planning expert knowledge in a fitness function through preference learning

BATON: Aligning Text-to-Audio Model with Human Preference Feedback

Combining the Strengths of Dutch Survey and Register Data in a Data Challenge to Predict Fertility (PreFer)

Crowd-PrefRL: Preference-Based Reward Learning from Crowds

Human Leading or Following Preferences: Effects on Human Perception of the Robot and the Human-Robot Collaboration

Two-Step Offline Preference-Based Reinforcement Learning with Constrained Actions

Preference as Reward, Maximum Preference Optimization with Importance Sampling

Students' Perceptions and Preferences of Generative Artificial Intelligence Feedback for Programming

Learning to Infer Unobserved Behaviors: Estimating User's Preference for a Site over Other Sites

Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences

Neural Reasoning About Agents' Goals, Preferences, and Actions

FERGI: Automatic Annotation of User Preferences for Text-to-Image Generation from Spontaneous Facial Expression Reaction

Preference and Concurrence Aware Bayesian Graph Neural Networks for Recommender Systems

The Impact of Preference Agreement in Reinforcement Learning from Human Feedback: A Case Study in Summarization

Differentially Private Reward Estimation with Preference Feedback

An Ontological Model of User Preferences

Learning to Discern: Imitating Heterogeneous Human Demonstrations with Preference and Representation Learning

A Novel Approach to Comprehending Users' Preferences for Accurate Personalized News Recommendation

Constructive Large Language Models Alignment with Diverse Feedback

Learning Optimal Advantage from Preferences and Mistaking it for Reward