Preference Datasets
Preference datasets are collections of human judgments comparing different outputs generated by large language models (LLMs), used to align these models with human values and preferences. Current research focuses on improving the efficiency and quality of these datasets, exploring methods like auction mechanisms for cost-effective data collection, metrics for dataset comparison, and techniques to reduce noise and bias. This work is crucial for developing more reliable and ethically aligned LLMs, impacting both the advancement of AI research and the development of safer, more user-friendly AI applications.
Papers
June 3, 2024
May 29, 2024
May 24, 2024
May 22, 2024
May 19, 2024
March 27, 2024
March 7, 2024
March 5, 2024
February 7, 2024
September 6, 2023
April 18, 2022
April 16, 2022