Attack Success Rate
Attack success rate (ASR) quantifies the effectiveness of adversarial attacks against machine learning models, focusing on compromising their security and reliability. Current research investigates ASR across various model types, including large language models (LLMs), federated learning systems, and text-to-image generators, employing diverse attack methods like gradient-based optimization, backdoor insertion, and prompt engineering. Understanding and improving ASR is crucial for developing robust and secure AI systems, impacting both the theoretical foundations of machine learning and the practical deployment of AI in sensitive applications. The field is actively exploring both improved attack strategies and more effective defenses.
Papers
December 19, 2023
November 23, 2023
November 15, 2023
November 14, 2023
October 28, 2023
October 26, 2023
October 23, 2023
July 21, 2023
May 25, 2023
April 21, 2023
April 10, 2023
February 20, 2023
December 20, 2022
October 28, 2022
October 23, 2022
October 12, 2022
September 28, 2022