Primacy Bias

Primacy bias, the tendency for initial experiences to disproportionately influence learning outcomes, is a significant challenge across various machine learning paradigms, including deep reinforcement learning (RL) and large language models (LLMs). Current research focuses on understanding the underlying mechanisms of this bias, particularly in relation to value overestimation in RL and the impact of model architecture and training procedures on both RL agents and LLMs. Addressing primacy bias is crucial for improving the robustness and generalization capabilities of these models, leading to more reliable and effective AI systems in diverse applications.

Papers