Blackbox Attack
Blackbox attacks target machine learning models without direct access to their internal workings, aiming to manipulate model outputs through carefully crafted inputs. Current research focuses on improving the efficiency and transferability of these attacks across diverse model architectures, employing techniques like ensemble methods, Bayesian approaches, and randomized optimization strategies to reduce the number of queries needed to successfully deceive the model. This area is crucial for assessing the robustness and security of deployed machine learning systems, with implications for various applications, including autonomous vehicles, image recognition, and cloud-based services.
Papers
January 16, 2024
March 25, 2023
August 13, 2022
August 7, 2022
March 3, 2022