Linear Reward
Linear reward models are central to many online learning problems, aiming to optimize decision-making by learning the relationship between actions (features) and their resulting rewards. Current research focuses on improving efficiency and robustness in various settings, including sparse data, adversarial environments, and privacy constraints, often employing algorithms like contextual bandits and Thompson sampling, along with techniques to handle non-linearity and high dimensionality. These advancements have significant implications for applications such as personalized recommendations, traffic routing, and autonomous driving, where efficient and reliable learning from limited or noisy data is crucial.
Papers
September 13, 2024
June 3, 2024
April 15, 2024
November 14, 2023
November 8, 2023
March 14, 2023
February 1, 2023
January 23, 2023
September 15, 2022
June 10, 2022
March 23, 2022
December 12, 2021