Attribution Score
Attribution scores quantify the influence of individual input features on a model's prediction, aiming to enhance model interpretability and trustworthiness. Current research focuses on improving the accuracy and efficiency of attribution methods across diverse model architectures, including deep neural networks, message-passing neural networks, and large language models, often employing techniques like Shapley values, integrated gradients, and influence functions. These advancements are crucial for building more reliable and understandable AI systems, with applications ranging from improving model fairness and debugging to facilitating better decision-making in various fields like finance and healthcare.
Papers
November 12, 2024
November 4, 2024
October 24, 2024
October 17, 2024
September 24, 2024
September 18, 2024
September 9, 2024
June 8, 2024
June 7, 2024
May 27, 2024
May 22, 2024
April 25, 2024
April 22, 2024
April 16, 2024
April 4, 2024
March 27, 2024
March 15, 2024
March 12, 2024
March 8, 2024