Attack Success Rate
Attack success rate (ASR) quantifies the effectiveness of adversarial attacks against machine learning models, focusing on compromising their security and reliability. Current research investigates ASR across various model types, including large language models (LLMs), federated learning systems, and text-to-image generators, employing diverse attack methods like gradient-based optimization, backdoor insertion, and prompt engineering. Understanding and improving ASR is crucial for developing robust and secure AI systems, impacting both the theoretical foundations of machine learning and the practical deployment of AI in sensitive applications. The field is actively exploring both improved attack strategies and more effective defenses.
Papers
May 31, 2024
May 29, 2024
May 24, 2024
May 19, 2024
May 10, 2024
April 26, 2024
April 22, 2024
April 18, 2024
April 15, 2024
April 3, 2024
March 18, 2024
February 23, 2024
February 12, 2024
January 11, 2024
December 22, 2023
December 19, 2023
November 23, 2023
November 15, 2023