Blackbox Attack

Blackbox attacks target machine learning models without direct access to their internal workings, aiming to manipulate model outputs through carefully crafted inputs. Current research focuses on improving the efficiency and transferability of these attacks across diverse model architectures, employing techniques like ensemble methods, Bayesian approaches, and randomized optimization strategies to reduce the number of queries needed to successfully deceive the model. This area is crucial for assessing the robustness and security of deployed machine learning systems, with implications for various applications, including autonomous vehicles, image recognition, and cloud-based services.

Papers