Strategic Deception

Strategic deception, encompassing the intentional misleading of others, is a burgeoning research area focusing on how artificial intelligence systems can be designed to deceive, and how such deception can be detected and mitigated. Current research utilizes reinforcement learning, game theory, and large language models (LLMs) to explore deception in various contexts, including cybersecurity (honeypots), human-AI interaction, and multi-agent systems, often employing graph neural networks and Markov Decision Processes. Understanding and addressing strategic deception is crucial for ensuring the safe and ethical deployment of AI, with implications for cybersecurity, autonomous systems, and the broader societal impact of increasingly sophisticated AI technologies.

Papers