Policy Reuse
Policy reuse in reinforcement learning aims to leverage previously learned policies to accelerate learning in new, related tasks, thereby improving efficiency and reducing computational costs. Current research focuses on developing algorithms that effectively select and adapt existing policies, with prominent approaches including deep reinforcement learning architectures incorporating critic-guided selection or Bayesian inference to estimate task similarity and choose appropriate source policies. These advancements are significant for improving the sample efficiency of reinforcement learning agents, particularly in complex domains like quantum error correction and other applications where data acquisition is expensive.
Papers
December 22, 2022
October 15, 2022