Trigger Design

Trigger design in machine learning focuses on creating subtle patterns (triggers) that manipulate model behavior, often to launch backdoor attacks. Current research investigates trigger design across various model architectures, including deep neural networks, large language models, diffusion models, and even the neural architecture search process itself, exploring both visible and invisible, static and dynamic trigger types. This research is crucial for understanding and mitigating vulnerabilities in AI systems, impacting the security and trustworthiness of applications ranging from image recognition and natural language processing to federated learning and generative AI. The development of robust detection and defense mechanisms against these attacks is a key area of ongoing investigation.

Papers