Task Agnostic Backdoor

Task-agnostic backdoors are malicious modifications embedded in machine learning models, particularly large language models and vision transformers, that trigger unintended behavior regardless of the specific task the model is performing. Current research focuses on developing both sophisticated attack methods, often leveraging minimal data poisoning or parameter-efficient fine-tuning, and robust defenses against these attacks, exploring techniques like modifying loss functions or manipulating model embeddings. The widespread use of pre-trained models and the increasing reliance on continual learning highlight the critical need for effective defenses against this threat, impacting the security and reliability of numerous AI applications.

Papers

September 21, 2024

Obliviate: Neutralizing Task-agnostic Backdoors within the Parameter-efficient Fine-tuning Paradigm
Jaehan Kim, Minkyoo Song, Seung Ho Na, Seungwon Shin
Parameter Efficient Fine Tuning Backdoor Defense Adaptive Attack Task Agnostic Backdoor

September 20, 2024

Persistent Backdoor Attacks in Continual Learning
Zhen Guo, Abhinav Kumar, Reza Tourani
Continual LEArning Task Agnostic Backdoor

September 6, 2024

Context is the Key: Backdoor Attacks for In-Context Learning with Vision Transformers
Gorka Abad, Stjepan Picek, Lorenzo Cavallaro, Aitor Urbieta
Vision Transformer Context Learning Backdoor Attack Context Information Task Agnostic Backdoor

May 27, 2024

TrojFM: Resource-efficient Backdoor Attacks against Very Large Foundation Models
Yuzhou. Nie, Yanting. Wang, Jinyuan. Jia, Michael J. De Lucia, Nathaniel D. Bastian, Wenbo. Guo, Dawn. Song
Foundation Model Backdoor Attack Effective Backdoor Attack Task Agnostic Backdoor

February 29, 2024

SynGhost: Imperceptible and Universal Task-agnostic Backdoor Attack in Pre-trained Language Models
Pengzhou Cheng, Wei Du, Zongru Wu, Fengwei Zhang, Libo Chen, Gongshen Liu
Pre Trained Language Model Backdoor Attack Imperceptible Pattern Task Agnostic Backdoor

January 29, 2024

Model Supply Chain Poisoning: Backdooring Pre-trained Models via Embedding Indistinguishability
Hao Wang, Shangwei Guo, Jialing He, Hangcheng Liu, Tianwei Zhang, Tao Xiang
Pre Trained Model Backdoor Attack Outcome Indistinguishability Task Agnostic Backdoor Transferable Backdoor Attack

August 26, 2023

LMSanitator: Defending Prompt-Tuning Against Task-Agnostic Backdoors
Chengkun Wei, Wenlong Meng, Zhikun Zhang, Min Chen, Minghu Zhao, Wenjing Fang, Lei Wang, Zihui Zhang, Wenzhi Chen
Backdoor Detection Input Level Backdoor Detection Task Agnostic Backdoor

June 14, 2023

Multi-target Backdoor Attacks for Code Pre-trained Models
Yanzhou Li, Shangqing Liu, Kangjie Chen, Xiaofei Xie, Tianwei Zhang, Yang Liu
Full Model Backdoor Attack Real World Code Downstream Task Neural Code Model Task Agnostic Backdoor

September 6, 2022

TransCAB: Transferable Clean-Annotation Backdoor to Object Detection with Natural Trigger in Real-World
Hua Ma, Yinshan Li, Yansong Gao, Zhi Zhang, Alsharif Abuadbba, Anmin Fu, Said F. Al-Sarawi, Nepal Surya, Derek Abbott
Data Detection Real World Object Detector YOLOv5 Model Faster R CNN Natural Trigger Task Agnostic Backdoor

Task Agnostic Backdoor

Papers

Obliviate: Neutralizing Task-agnostic Backdoors within the Parameter-efficient Fine-tuning Paradigm

Persistent Backdoor Attacks in Continual Learning

Context is the Key: Backdoor Attacks for In-Context Learning with Vision Transformers

TrojFM: Resource-efficient Backdoor Attacks against Very Large Foundation Models

SynGhost: Imperceptible and Universal Task-agnostic Backdoor Attack in Pre-trained Language Models

Model Supply Chain Poisoning: Backdooring Pre-trained Models via Embedding Indistinguishability

LMSanitator: Defending Prompt-Tuning Against Task-Agnostic Backdoors

Multi-target Backdoor Attacks for Code Pre-trained Models

TransCAB: Transferable Clean-Annotation Backdoor to Object Detection with Natural Trigger in Real-World