Adversarial Testing

Adversarial testing rigorously probes the robustness of machine learning models, particularly large language models (LLMs) and deep learning systems for computer vision, by subjecting them to carefully crafted inputs designed to elicit failures or biases. Current research focuses on developing automated adversarial attack methods, such as generative agents and single-turn crescendo attacks, and improving defenses through techniques like conformal prediction and robust training. This work is crucial for ensuring the safety and reliability of AI systems across diverse applications, from autonomous vehicles to medical diagnosis, by identifying and mitigating vulnerabilities before deployment.

Papers

February 10, 2023

Large Language Models for Code: Security Hardening and Adversarial Testing
Jingxuan He, Martin Vechev
Code Generation Real World Code Adversarial Testing Code Security Security Hardening

October 31, 2022

A Simple, Yet Effective Approach to Finding Biases in Code Generation
Spyridon Mouselinos, Mateusz Malinowski, Henryk Michalewski
Large Language Model Code Generation Topic Bias Effective Approach Adversarial Testing

July 19, 2022

ANTI-CARLA: An Adversarial Testing Framework for Autonomous Vehicles in CARLA
Shreyas Ramakrishna, Baiting Luo, Christopher Kuhn, Gabor Karsai, Abhishek Dubey
Autonomous Driving Autonomous Vehicle Adversarial Scenario Car Learning to Act Adversarial Testing Adversarial Weather

July 6, 2022

Adversarial Robustness of Visual Dialog
Lu Yu, Verena Rieser
Adversarial Robustness Model Robustness Dialog Model Adversarial Testing Visual Dialog Text Attack

June 19, 2022

Adversarial Scrutiny of Evidentiary Statistical Software
Rediet Abebe, Moritz Hardt, Angela Jin, John Miller, Ludwig Schmidt, Rebecca Wexler
Algorithmic Fairness Adversarial Testing Multi Step Adversarial Attack Adversarial Perspective

May 26, 2022

DeepTechnome: Mitigating Unknown Bias in Deep Learning Based Assessment of CT Images
Simon Langer, Oliver Taubmann, Felix Denzinger, Andreas Maier, Alexander Mühlberg
Bias Mitigation CT Image Technology Information Data Inherent Bias Adversarial Testing Biased Feature Based Assessment

March 31, 2022

Scalable Whitebox Attacks on Tree-based Models
Giuseppe Castiglione, Gavin Ding, Masoud Hashemi, Christopher Srinivasa, Ga Wu
Adversarial Attack Adversarial Robustness White Box Tree Based Model Adversarial Testing

February 24, 2022

Testing Deep Learning Models: A First Comparative Study of Multiple Testing Techniques
Mohit Kumar Ahuja, Arnaud Gotlieb, Helge Spieker
Deep Learning Autonomous Driving Comparative Study Perception System Adversarial Testing Deep Learning Testing Vision Based System

December 4, 2021

Generalized Likelihood Ratio Test for Adversarially Robust Hypothesis Testing
Bhagyashree Puranik, Upamanyu Madhow, Ramtin Pedarsani
Adversarial Attack Adversarial Perturbation Adversarial Testing Likelihood Ratio Test