New Attack

Research on attacks against large language models (LLMs) and related AI systems is rapidly expanding, focusing on vulnerabilities exploited to elicit harmful outputs or extract sensitive information. Current efforts concentrate on developing and evaluating various attack methods, including jailbreaking, data poisoning, prompt injection, and membership inference attacks, often targeting specific model architectures like transformer-based LLMs and diffusion models. This research is crucial for understanding and mitigating the risks associated with increasingly powerful AI systems, informing the development of more robust and trustworthy AI applications across diverse sectors.

Papers

May 9, 2024

Link Stealing Attacks Against Inductive Graph Neural Networks
Yixin Wu, Xinlei He, Pascal Berrang, Mathias Humbert, Michael Backes, Neil Zhenqiang Gong, Yang Zhang
Graph Structured Data New Attack Tiltable Link Transductive Transfer Learning

May 6, 2024

Federated Learning Privacy: Attacks, Defenses, Applications, and Policy Landscape - A Survey
Joshua C. Zhao, Saurabh Bagchi, Salman Avestimehr, Kevin S. Chan, Somali Chaterji, Dimitris Dimitriadis, Jiacheng Li, Ninghui Li, Arash Nourian, Holger R. Roth
Deep Learning Financial Application Federated Learning New Attack Privacy Attack Privacy Preservation Private Training Policy Space

April 30, 2024

AttackBench: Evaluating Gradient-based Attacks for Adversarial Examples
Antonio Emanuele Cinà, Jérôme Rony, Maura Pintor, Luca Demetrio, Ambra Demontis, Battista Biggio, Ismail Ben Ayed, Fabio Roli
Adversarial Example New Attack Gradient Based Attack Novel Attack

April 24, 2024

April 23, 2024

Manipulating Recommender Systems: A Survey of Poisoning Attacks and Countermeasures
Thanh Toan Nguyen, Quoc Viet Hung Nguyen, Thanh Tam Nguyen, Thanh Trung Huynh, Thanh Thi Nguyen, Matthias Weidlich, Hongzhi Yin
Timely Survey Recommender System Poisoning Attack New Attack Rapid Countermeasure

April 22, 2024

Explaining Arguments' Strength: Unveiling the Role of Attacks and Supports (Technical Report)
Xiang Yin, Potyka Nico, Francesca Toni
Technical Report New Attack Estimated Team Strength Target Argument Attribution Score Attribution Based Bipolar Argumentation Quantitative Explanation

April 8, 2024

David and Goliath: An Empirical Evaluation of Attacks and Defenses for QNNs at the Deep Edge
Miguel Costa, Sandro Pinto
Adversarial Example Extreme Edge Quantization Operator Edge Computing New Attack Adversarial DEfense Empirical Evaluation

March 31, 2024

A Survey of Privacy-Preserving Model Explanations: Privacy Risks, Attacks, and Countermeasures
Thanh Tam Nguyen, Thanh Trung Huynh, Zhao Ren, Thanh Toan Nguyen, Phi Le Nguyen, Hongzhi Yin, Quoc Viet Hung Nguyen
Timely Survey Explainable AI Privacy Preserving New Attack Privacy Risk Model Explanation Privacy Attack Rapid Countermeasure Learning Privacy

March 29, 2024

A Backdoor Approach with Inverted Labels Using Dirty Label-Flipping Attacks
Orson Mengara
Backdoor Attack New Attack Backdoor Policy Label Flipping

March 20, 2024

Threats, Attacks, and Defenses in Machine Unlearning: A Survey
Ziyao Liu, Huanyi Ye, Chen Chen, Yongsen Zheng, Kwok-Yan Lam
Timely Survey Machine Unlearning New Attack Threat Word Knowledge Removal

March 3, 2024

Breaking Down the Defenses: A Comparative Survey of Attacks on Large Language Models
Arijit Ghosh Chowdhury, Md Mofijul Islam, Vaibhav Kumar, Faysal Hossain Shezan, Vaibhav Kumar, Vinija Jain, Aman Chadha
Natural Language Processing Adversarial Attack New Attack LLM Safety LLM Attack

February 29, 2024

February 28, 2024

Fault Tolerant Neural Control Barrier Functions for Robotic Systems under Sensor Faults and Attacks
Hongchao Zhang, Luyao Niu, Andrew Clark, Radha Poovendran
Robotic System Control Barrier Function New Attack Sensor Fault Neural Control Barrier Function

February 26, 2024

Defending LLMs against Jailbreaking Attacks via Backtranslation
Yihan Wang, Zhouxing Shi, Andrew Bai, Cho-Jui Hsieh
Large Language Model Language Model Jailbreak Attack New Attack Back Translation Defense Algorithm

February 22, 2024

Quadruplet Loss For Improving the Robustness to Face Morphing Attacks
Iurii Medvedev, Nuno Gonçalves
Native Robustness New Attack Morphing Attack Quadruplet Loss

February 21, 2024

February 19, 2024

Attacks on Node Attributes in Graph Neural Networks
Ying Xu, Michael Lanier, Anindya Sarkar, Yevgeniy Vorobeychik
Graph Neural Network Adversarial Attack New Attack Graph Based Graph Contrastive Learning Decision Based Node Attribute