Feature Attribution Method
Feature attribution methods aim to explain the predictions of complex machine learning models by assigning importance scores to input features. Current research focuses on improving the robustness and reliability of these methods, particularly for deep learning models (including transformers and convolutional neural networks), addressing issues like inconsistency across different techniques and sensitivity to perturbations. This work is crucial for building trust in AI systems across various applications, from industrial process optimization to healthcare diagnostics, by enhancing transparency and facilitating better understanding of model behavior.
Papers
April 22, 2022
April 11, 2022
February 22, 2022
December 29, 2021
November 14, 2021