Preference Model
Preference modeling aims to represent and predict human choices or rankings, crucial for aligning AI systems with human values and improving human-computer interaction. Current research focuses on improving the robustness and efficiency of these models, addressing biases like format and verbosity, and exploring various architectures including Bayesian methods, direct preference optimization, and Gaussian processes. This work is significant for advancing AI safety and trustworthiness, enhancing the effectiveness of recommendation systems, and providing a deeper understanding of human decision-making.
Papers
November 4, 2024
November 2, 2024
October 19, 2024
October 18, 2024
October 3, 2024
September 18, 2024
June 14, 2024
April 22, 2024
April 1, 2024
March 22, 2024
March 19, 2024
March 4, 2024
February 11, 2024
December 2, 2023
October 17, 2023
October 2, 2023
May 29, 2023
May 18, 2023