Adversarial Attack
Adversarial attacks aim to deceive machine learning models by subtly altering input data, causing misclassifications or other erroneous outputs. Current research focuses on developing more robust models and detection methods, exploring various attack strategies across different model architectures (including vision transformers, recurrent neural networks, and graph neural networks) and data types (images, text, signals, and tabular data). Understanding and mitigating these attacks is crucial for ensuring the reliability and security of AI systems in diverse applications, from autonomous vehicles to medical diagnosis and cybersecurity.
Papers
Fixed Inter-Neuron Covariability Induces Adversarial Robustness
Muhammad Ahmed Shah, Bhiksha Raj
Mondrian: Prompt Abstraction Attack Against Large Language Models for Cheaper API Pricing
Wai Man Si, Michael Backes, Yang Zhang
Exploring the Physical World Adversarial Robustness of Vehicle Detection
Wei Jiang, Tianyuan Zhang, Shuangcheng Liu, Weiyu Ji, Zichao Zhang, Gang Xiao
A reading survey on adversarial machine learning: Adversarial attacks and their understanding
Shashank Kotyan
Unsupervised Adversarial Detection without Extra Model: Training Loss Should Change
Chien Cheng Chyou, Hung-Ting Su, Winston H. Hsu
Training on Foveated Images Improves Robustness to Adversarial Attacks
Muhammad A. Shah, Bhiksha Raj
Dynamic ensemble selection based on Deep Neural Network Uncertainty Estimation for Adversarial Robustness
Ruoxi Qin, Linyuan Wang, Xuehui Du, Xingyuan Chen, Bin Yan
Kidnapping Deep Learning-based Multirotors using Optimized Flying Adversarial Patches
Pia Hanfeld, Khaled Wahba, Marina M. -C. Höhne, Michael Bussmann, Wolfgang Hönig
Doubly Robust Instance-Reweighted Adversarial Training
Daouda Sow, Sen Lin, Zhangyang Wang, Yingbin Liang
Adversarially Robust Neural Legal Judgement Systems
Rohit Raj, V Susheela Devi
A Novel Deep Learning based Model to Defend Network Intrusion Detection System against Adversarial Attacks
Khushnaseeb Roshan, Aasim Zafar, Shiekh Burhan Ul Haque
Defense of Adversarial Ranking Attack in Text Retrieval: Benchmark and Baseline via Detection
Xuanang Chen, Ben He, Le Sun, Yingfei Sun