Native Robustness
Native robustness in machine learning focuses on developing models inherently resistant to various forms of input perturbations, including adversarial attacks and noisy data, without relying solely on post-hoc defenses. Current research emphasizes techniques like ensemble methods, reprogramming existing models, and modifying training procedures (e.g., using different learning rates for specific model layers or incorporating regularization methods) to improve robustness across diverse model architectures, including convolutional neural networks, vision transformers, and large language models. This field is crucial for deploying reliable AI systems in safety-critical applications, such as healthcare and autonomous driving, where model resilience to unexpected inputs is paramount.
Papers
Revisiting the Fragility of Influence Functions
Jacob R. Epifano, Ravi P. Ramachandran, Aaron J. Masino, Ghulam Rasool
Robust Consensus in Ranking Data Analysis: Definitions, Properties and Computational Issues
Morgane Goibert, Clément Calauzènes, Ekhine Irurozki, Stéphan Clémençon
Distribution-restrained Softmax Loss for the Model Robustness
Hao Wang, Chen Li, Jinzhe Jiang, Xin Zhang, Yaqian Zhao, Weifeng Gong