Rationale Extraction
Rationale extraction aims to identify the specific parts of an input text most influential in a model's prediction, enhancing model interpretability and trustworthiness. Current research focuses on developing more accurate and faithful rationale extraction methods, often employing attention mechanisms, multi-task learning, and adversarial training within various model architectures, including transformers and graph neural networks. This work is significant because it addresses the "black box" nature of many machine learning models, improving both our understanding of model behavior and the reliability of their predictions across diverse applications like legal judgment prediction and abusive language detection.
Papers
December 30, 2024
December 11, 2024
October 8, 2024
October 4, 2024
February 13, 2024
December 1, 2023
November 4, 2023
October 22, 2023
May 16, 2023
May 2, 2023
March 14, 2023
January 15, 2023
December 2, 2022
November 30, 2022
May 12, 2022
December 20, 2021