Attribution Based
Attribution-based methods aim to explain the decisions of machine learning models by assigning importance scores to input features, enhancing model transparency and trustworthiness. Current research focuses on improving the accuracy, stability, and interpretability of these attributions across diverse data types (images, text, time series) and model architectures (deep neural networks, large language models), often employing techniques like Shapley values and influence functions. This work is crucial for building more reliable and accountable AI systems, particularly in high-stakes applications where understanding model behavior is paramount, such as medical diagnosis and financial modeling.
Papers
September 26, 2024
September 24, 2024
September 9, 2024
July 28, 2024
July 27, 2024
July 21, 2024
June 28, 2024
May 14, 2024
April 22, 2024
February 18, 2024
January 8, 2024
August 23, 2023
August 6, 2023
July 25, 2023
June 8, 2023
February 13, 2023
November 23, 2022
November 22, 2022
September 6, 2022