Backdoor Detection
Backdoor detection in machine learning focuses on identifying malicious modifications to models that trigger unintended behavior when specific input patterns (triggers) are present. Current research emphasizes developing robust detection methods for various model architectures, including diffusion models, language models, and graph neural networks, often employing techniques like tensor decomposition, uncertainty analysis, and distribution inference to identify anomalies indicative of backdoors. The significance of this research lies in safeguarding the integrity and trustworthiness of machine learning systems across diverse applications, mitigating risks associated with compromised models in sensitive domains.
Papers
October 17, 2023
September 16, 2023
September 12, 2023
August 26, 2023
August 8, 2023
May 24, 2023
March 27, 2023
March 23, 2023
February 28, 2023
February 1, 2023
January 25, 2023
December 22, 2022
December 15, 2022
November 2, 2022
October 12, 2022
October 11, 2022
September 23, 2022
September 7, 2022
April 13, 2022