Pairwise Preference
Pairwise preference learning focuses on training models to predict preferences between pairs of items, often using human feedback or automatically generated comparisons. Current research emphasizes improving the efficiency and robustness of these methods, particularly for large language models, by incorporating richer feedback (beyond simple binary preferences), addressing intransitivity issues, and mitigating biases in both human and AI-generated preferences. This field is crucial for advancing AI alignment, improving the quality of AI-generated content, and enabling more effective human-computer interaction in various applications, including machine translation, search engines, and interactive reinforcement learning.
Papers
January 17, 2024
December 27, 2023
December 5, 2023
November 23, 2023
October 18, 2023
August 8, 2023
July 22, 2023
July 19, 2023
July 11, 2023
May 12, 2023
February 27, 2023
September 22, 2022
March 30, 2022