Attack Success Rate
Attack success rate (ASR) quantifies the effectiveness of adversarial attacks against machine learning models, focusing on compromising their security and reliability. Current research investigates ASR across various model types, including large language models (LLMs), federated learning systems, and text-to-image generators, employing diverse attack methods like gradient-based optimization, backdoor insertion, and prompt engineering. Understanding and improving ASR is crucial for developing robust and secure AI systems, impacting both the theoretical foundations of machine learning and the practical deployment of AI in sensitive applications. The field is actively exploring both improved attack strategies and more effective defenses.
Papers
January 4, 2025
January 1, 2025
December 11, 2024
December 10, 2024
December 6, 2024
December 5, 2024
November 23, 2024
November 18, 2024
November 14, 2024
November 6, 2024
October 30, 2024
October 18, 2024
October 17, 2024
October 15, 2024
October 3, 2024
October 2, 2024
September 26, 2024
September 23, 2024
August 26, 2024
August 18, 2024