Adversarial Text
Adversarial text research focuses on creating and defending against text inputs designed to deceive natural language processing (NLP) models, often by subtly altering wording while maintaining semantic similarity to a human reader. Current research emphasizes developing more effective attack methods, particularly those leveraging multi-agent systems, reinforcement learning, and diffusion models, as well as improving defenses through techniques like adversarial training and noise augmentation. This field is crucial for enhancing the robustness and trustworthiness of NLP systems across diverse applications, from automated essay scoring to autonomous vehicle navigation and large language model safety.
Papers
January 8, 2025
December 27, 2024
December 22, 2024
December 17, 2024
December 15, 2024
December 11, 2024
December 9, 2024
December 8, 2024
December 3, 2024
December 2, 2024
December 1, 2024
November 28, 2024
November 25, 2024
November 21, 2024
November 19, 2024
November 12, 2024
November 11, 2024