Backdoor Removal
Backdoor removal focuses on mitigating malicious modifications to machine learning models, where attackers embed "triggers" causing unintended behavior. Current research emphasizes developing techniques to identify and neutralize these triggers, often employing methods like unlearning, relearning, and adversarial training across diverse model architectures including Convolutional Neural Networks (CNNs), Graph Neural Networks (GNNs), and Large Language Models (LLMs). Effective backdoor removal is crucial for ensuring the trustworthiness and security of AI systems deployed in sensitive applications, ranging from medical diagnosis to autonomous vehicles.
Papers
November 5, 2024
October 15, 2024
October 2, 2024
September 1, 2024
August 8, 2024
May 23, 2024
May 13, 2024
November 25, 2023
June 30, 2023
June 13, 2023
May 24, 2023
April 20, 2023
November 2, 2022
August 5, 2022
July 10, 2022
June 18, 2022
June 14, 2022