Fictitious Self Play
Fictitious self-play (FSP) is a multi-agent reinforcement learning technique aiming to find Nash equilibria in competitive games by having agents repeatedly play against past versions of themselves or a diverse population of opponents. Current research focuses on improving FSP's scalability and applicability to complex scenarios, including partially observable games and those with mixed cooperative-competitive elements, often employing model-free algorithms and neural networks within frameworks like counterfactual regret minimization or Monte Carlo tree search. These advancements are significant for addressing challenges in areas such as cybersecurity (e.g., intrusion response), game playing (e.g., poker, air combat), and network security, where finding robust and efficient strategies against adaptive opponents is crucial.