Pairwise Preference
Pairwise preference learning focuses on training models to predict preferences between pairs of items, often using human feedback or automatically generated comparisons. Current research emphasizes improving the efficiency and robustness of these methods, particularly for large language models, by incorporating richer feedback (beyond simple binary preferences), addressing intransitivity issues, and mitigating biases in both human and AI-generated preferences. This field is crucial for advancing AI alignment, improving the quality of AI-generated content, and enabling more effective human-computer interaction in various applications, including machine translation, search engines, and interactive reinforcement learning.
Papers
January 13, 2025
November 30, 2024
October 22, 2024
October 4, 2024
October 3, 2024
September 28, 2024
September 23, 2024
August 19, 2024
August 14, 2024
August 8, 2024
July 2, 2024
June 28, 2024
May 29, 2024
April 1, 2024
March 28, 2024
March 25, 2024
March 13, 2024
February 27, 2024
February 17, 2024