Transfer Based Adversarial Attack

Transfer-based adversarial attacks exploit the vulnerability of machine learning models by crafting adversarial examples using a surrogate model to attack a target model without direct access. Current research focuses on improving the transferability of these attacks across diverse model architectures (e.g., CNNs, Transformers) using techniques like model ensembles, adaptive transformations, and tailored loss functions to overcome differences in model architectures and decision boundaries. This research is crucial for understanding and mitigating the security risks posed by adversarial examples to various applications, including speech recognition, image classification, and even large language models, ultimately driving the development of more robust and secure AI systems.

Papers