Adversarial Example

Adversarial examples are subtly altered inputs designed to fool machine learning models, primarily deep neural networks (DNNs), into making incorrect predictions. Current research focuses on improving model robustness against these attacks, exploring techniques like ensemble methods, multi-objective representation learning, and adversarial training, often applied to architectures such as ResNets and Vision Transformers. Understanding and mitigating the threat of adversarial examples is crucial for ensuring the reliability and security of AI systems across diverse applications, from image classification and natural language processing to malware detection and autonomous driving. The development of robust defenses and effective attack detection methods remains a significant area of ongoing investigation.

963papers

Papers - Page 28

July 16, 2023

On the Robustness of Split Learning against Adversarial Attacks
Mingyuan Fan, Cen Chen, Chengyu Wang, Wenmeng Zhou, Jun Huang
Adversarial Color Adversarial Attack Collaborative Deep Learning Native Robustness Adversarial Example Split Learning

July 15, 2023

Why Does Little Robustness Help? Understanding and Improving Adversarial Transferability from Surrogate Training
Yechao Zhang, Shengshan Hu, Leo Yu Zhang, Junyu Shi, Minghui Li, Xiaogeng Liu, Wei Wan, Hai Jin
Adversarial Sample Adversarial Example Surrogate Model Training Adversarial Training Robustness Problem

July 14, 2023

July 13, 2023

Microbial Genetic Algorithm-based Black-box Attack against Interpretable Deep Learning Systems
Eldor Abdukhamidov, Mohammed Abuhamad, Simon S. Woo, Eric Chan-Tin, Tamer Abuhmed
Interpretable Deep Learning Adversarial Sample Adversarial Example Deep Learning Model Black Box Attack

July 11, 2023

ATWM: Defense against adversarial malware based on adversarial training
Kun Li, Fan Zhang, Wei Guo
Adversarial Example Adversarial Malware Adversarial Training Adversarial DEfense

July 9, 2023

GNP Attack: Transferable Adversarial Examples via Gradient Norm Penalty
Tao Wu, Tie Luo, Donald C. Wunsch
Norm Penalty Adversarial Example Black Box Model Transfer Attack Black Box Attack

July 6, 2023

July 4, 2023

SCAT: Robust Self-supervised Contrastive Learning via Adversarial Training for Text Classification
Junjie Wu, Dit-Yan Yeung
Robust Version Adversarial Example Adversarial Training Textual Adversarial Attack Contrastive Learning Text Classification

July 3, 2023

Interpretability and Transparency-Driven Detection and Transformation of Textual Adversarial Examples (IT-DT)
Bushra Sabir, M. Ali Babar, Sharif Abuadbba
Character Transformation Generative Adversarial Inherent Interpretability Adversarial Example Textual Adversarial Example Adversarial Attack Adversarial Input

July 1, 2023

June 29, 2023

June 28, 2023

June 26, 2023

Are aligned neural networks adversarially aligned?
Nicholas Carlini, Milad Nasr, Christopher A. Choquette-Choo, Matthew Jagielski, Irena Gao, Anas Awadalla, Pang Wei Koh, Daphne Ippolito+3
Adversarial Alignment Adversarial Example Neural Network Adversarial Input

Adversarial Example

Papers - Page 28

On the Robustness of Split Learning against Adversarial Attacks

Why Does Little Robustness Help? Understanding and Improving Adversarial Transferability from Surrogate Training

Mitigating Adversarial Vulnerability through Causal Parameter Estimation by Adversarial Double Machine Learning

Alleviating the Effect of Data Imbalance on Adversarial Training

Vulnerability-Aware Instance Reweighting For Adversarial Training

Microbial Genetic Algorithm-based Black-box Attack against Interpretable Deep Learning Systems

ATWM: Defense against adversarial malware based on adversarial training

GNP Attack: Transferable Adversarial Examples via Gradient Norm Penalty

Quantification of Uncertainty with Adversarial Models

Sampling-based Fast Gradient Rescaling Method for Highly Transferable Adversarial Attacks

SCAT: Robust Self-supervised Contrastive Learning via Adversarial Training for Text Classification

Interpretability and Transparency-Driven Detection and Transformation of Textual Adversarial Examples (IT-DT)

Adversarial Attacks and Defenses on 3D Point Cloud Classification: A Survey

Common Knowledge Learning for Generating Transferable Adversarial Examples

Post-train Black-box Defense via Bayesian Boundary Correction

Towards Optimal Randomized Strategies in Adversarial Example Game

Does Saliency-Based Training bring Robustness for Deep Neural Networks in Image Classification?

Enrollment-stage Backdoor Attacks on Speaker Recognition Systems via Adversarial Ultrasound

Boosting Adversarial Transferability with Learnable Patch-wise Masks

Are aligned neural networks adversarially aligned?