Exploration Exploitation
The exploration-exploitation dilemma describes the fundamental challenge in decision-making systems of balancing the need to explore unknown options against the desire to exploit known, rewarding ones. Current research focuses on improving the efficiency of this trade-off across various domains, employing techniques like contextual bandits, Thompson sampling, and Bayesian optimization, often integrated with neural networks and graph neural networks to handle complex data and large action spaces. These advancements are significantly impacting fields like reinforcement learning, recommendation systems, and automated planning, leading to more efficient algorithms and improved performance in diverse applications.
Papers
September 7, 2024
August 19, 2024
June 30, 2024
May 26, 2024
May 19, 2024
April 26, 2024
February 13, 2024
December 27, 2023
December 26, 2023
December 2, 2023
October 10, 2023
July 4, 2023
June 9, 2023
May 31, 2023
May 15, 2023
February 9, 2023
February 8, 2023
September 15, 2022
August 16, 2022