Counterfactual Generation Method
Counterfactual generation methods aim to minimally alter input data to change a model's prediction, providing insights into model behavior and decision-making processes. Current research focuses on developing robust and efficient algorithms for generating these counterfactuals across various data types, including text and tabular data, often leveraging large language models or data morphology techniques to improve quality and interpretability. This work is significant for enhancing the trustworthiness and explainability of AI systems, with applications ranging from improving educational tools to detecting and mitigating bias in natural language processing models.
Papers
CREST: A Joint Framework for Rationalization and Counterfactual Text Generation
Marcos Treviso, Alexis Ross, Nuno M. Guerreiro, André F. T. Martins
Counterfactuals of Counterfactuals: a back-translation-inspired approach to analyse counterfactual editors
Giorgos Filandrianos, Edmund Dervakos, Orfeas Menis-Mastromichalakis, Chrysoula Zerva, Giorgos Stamou