Attack Success Rate
Attack success rate (ASR) quantifies the effectiveness of adversarial attacks against machine learning models, focusing on compromising their security and reliability. Current research investigates ASR across various model types, including large language models (LLMs), federated learning systems, and text-to-image generators, employing diverse attack methods like gradient-based optimization, backdoor insertion, and prompt engineering. Understanding and improving ASR is crucial for developing robust and secure AI systems, impacting both the theoretical foundations of machine learning and the practical deployment of AI in sensitive applications. The field is actively exploring both improved attack strategies and more effective defenses.
Papers
August 18, 2024
July 18, 2024
July 15, 2024
July 2, 2024
June 18, 2024
June 17, 2024
June 4, 2024
May 31, 2024
May 29, 2024
May 24, 2024
May 19, 2024
May 10, 2024
April 26, 2024
April 22, 2024
April 18, 2024
April 15, 2024
April 3, 2024
March 18, 2024
February 23, 2024