Active Preference

Active preference learning focuses on efficiently eliciting human preferences to optimize complex systems or models, minimizing the need for extensive manual labeling. Current research emphasizes developing algorithms that actively select the most informative comparisons, often leveraging Bayesian models or contextual bandit frameworks, to improve sample efficiency and generalization across diverse scenarios. This approach is proving valuable in diverse applications, including improving the alignment of large language models with human values, optimizing visualization design, and enhancing the robustness of reinforcement learning agents in real-world settings. The resulting improvements in data efficiency and model performance have significant implications for human-computer interaction and artificial intelligence development.

Papers