Trojan Attack
Trojan attacks involve the malicious insertion of hidden functionalities into machine learning models or hardware circuits, causing unintended behavior triggered by specific inputs. Current research focuses on detecting and mitigating these attacks across various domains, including deep neural networks, large language models, and analog/mixed-signal circuits, employing techniques like large language models (LLMs), adversarial learning, and analysis of attention mechanisms or network sparsity. The significance of this research lies in securing increasingly prevalent AI systems and hardware components, safeguarding against potentially catastrophic consequences in safety-critical applications.
Papers
October 17, 2024
August 25, 2024
August 15, 2024
July 9, 2024
July 8, 2024
May 5, 2024
April 21, 2024
February 21, 2024
December 23, 2023
October 1, 2023
June 12, 2023
May 28, 2023
April 25, 2023
April 2, 2023
March 10, 2023
March 9, 2023
March 3, 2023
November 22, 2022
November 20, 2022