Adversarial Text
Adversarial text research focuses on creating and defending against text inputs designed to deceive natural language processing (NLP) models, often by subtly altering wording while maintaining semantic similarity to a human reader. Current research emphasizes developing more effective attack methods, particularly those leveraging multi-agent systems, reinforcement learning, and diffusion models, as well as improving defenses through techniques like adversarial training and noise augmentation. This field is crucial for enhancing the robustness and trustworthiness of NLP systems across diverse applications, from automated essay scoring to autonomous vehicle navigation and large language model safety.
Papers
November 12, 2024
November 11, 2024
November 9, 2024
November 3, 2024
October 31, 2024
October 29, 2024
October 28, 2024
October 24, 2024
October 21, 2024
October 17, 2024
October 15, 2024
October 11, 2024
October 7, 2024
October 6, 2024
October 4, 2024
September 21, 2024
September 8, 2024
September 7, 2024
September 3, 2024