Attack Algorithm

Attack algorithms target the vulnerabilities of machine learning models, aiming to manipulate their outputs or steal data. Current research focuses on developing increasingly sophisticated attacks against various model types, including large language models, federated learning systems, and deep neural networks used in image recognition and other applications, employing techniques like gradient manipulation, universal perturbations, and data poisoning. These advancements highlight the critical need for robust defenses and underscore the significant implications for the security and reliability of AI systems across diverse sectors.

Papers