Explanation Method

Explanation methods aim to make the decision-making processes of complex machine learning models more transparent and understandable. Current research focuses on improving the faithfulness, stability, and user-friendliness of explanations, exploring various approaches including SHAP, LIME, gradient-based methods, and the use of large language models to generate more natural and engaging explanations. This work is crucial for building trust in AI systems, particularly in high-stakes applications like healthcare and finance, and for facilitating better model debugging and design. A key challenge remains developing robust evaluation metrics that capture the multifaceted nature of explanation quality and its impact on human understanding.

Papers

October 31, 2022

September 9, 2022

Explanation Method for Anomaly Detection on Mixed Numerical and Categorical Spaces
Iñigo López-Riobóo Botana, Carlos Eiras-Franco, Julio Hernandez-Castro, Amparo Alonso-Betanzos
Anomaly Detection Explanation Method Category Theory Anomalous Data Anomaly Free Explainable Anomaly Detection

September 8, 2022

ReX: A Framework for Incorporating Temporal Information in Model-Agnostic Local Explanation Techniques
Junhao Liu, Xin Zhang
New Framework Line by Line Explanation Temporal Information Explanation Method Local Interpretable Model Agnostic Explanation

August 4, 2022

Explaining Classifiers Trained on Raw Hierarchical Multiple-Instance Data
Tomáš Pevný, Viliam Lisý, Branislav Bošanský, Petr Somol, Michal Pěchouček
Line by Line Explanation Multiple Instance Learning Structured Data Explanation Method Instance Learning Explainable Classification

July 19, 2022

Alterfactual Explanations -- The Relevance of Irrelevance for Explaining AI Systems
Silvan Mertes, Christina Karle, Tobias Huber, Katharina Weitz, Ruben Schlagowski, Elisabeth André
Counterfactual Explanation Explainable Artificial Intelligence Explanation Method Relative Relevance Irrelevant Information Explainable AI System Counterfactual Explanation Method

July 14, 2022

Beware the Rationalization Trap! When Language Model Explainability Diverges from our Mental Models of Language
Rita Sevastjanova, Mennatallah El-Assady
Language Model Line by Line Explanation Human Language Explanation Method Mental Model Middle Intelligence Trap

July 5, 2022

"Even if ..." -- Diverse Semifactual Explanations of Reject
André Artelt, Barbara Hammer
Explainable AI Explanation Method Risky Prompt Rejection Reject Option Example Based Semifactual Explanation

June 24, 2022

Robustness of Explanation Methods for NLP Models
Shriya Atmakuri, Tejas Chheda, Dinesh Kandula, Nishant Yadav, Taesung Lee, Hessel Tuinhof
Native Robustness Adversarial Robustness Text Modality NLP Model Explanation Method Textual Explanation Successful Adversarial Attack

June 22, 2022

OpenXAI: Towards a Transparent Evaluation of Model Explanations
Chirag Agarwal, Dan Ley, Satyapriya Krishna, Eshika Saxena, Martin Pawelczyk, Nari Johnson, Isha Puri, Marinka Zitnik, Himabindu Lakkaraju
Model Explanation Explanation Method

June 7, 2022

GRETEL: A unified framework for Graph Counterfactual Explanation Evaluation
Mario Alfonso Prado-Romero, Giovanni Stilo
Unified Framework Explainability Method Explanation Method Counterfactual Explanation Method Graph Counterfactual

June 2, 2022

Which Explanation Should I Choose? A Function Approximation Perspective to Characterizing Post Hoc Explanations
Tessa Han, Suraj Srinivas, Himabindu Lakkaraju
Line by Line Explanation High Explainability Explanation Method Function Approximation Post Hoc Explanation Local Approximation

June 1, 2022

OmniXAI: A Library for Explainable AI
Wenzhuo Yang, Hung Le, Tanmay Laud, Silvio Savarese, Steven C. H. Hoi
Explainable AI Explanation Method Easy to Use Library eXplainable Artificial Intelligence Gradient Based Explanation

May 27, 2022

A Design Space for Explainable Ranking and Ranking Models
I. Al Hazwani, J. Schmid, M. Sachdeva, J. Bernard
Recommender System Explainable AI Explanation Method Ranking Model

May 13, 2022

Comparison of attention models and post-hoc explanation methods for embryo stage identification: a case study
Tristan Gomez, Thomas Fréour, Harold Mouchère
Case Study Consistent Comparison Explanation Method Attention Model Interpretable AI Faithfulness Metric Human Embryo Objective Metric

April 19, 2022

A survey on improving NLP models with human explanations
Mareike Hartmann, Daniel Sonntag
Natural Language Processing Timely Survey Line by Line Explanation Natural Language Processing Model Explanation Method Human Explanation

April 10, 2022

Re-Examining Human Annotations for Interpretable NLP
Cheng-Han Chiang, Hung-yi Lee
Natural Language Processing Human Annotation Explanation Method

March 30, 2022

Example-based Explanations with Adversarial Attacks for Respiratory Sound Analysis
Yi Chang, Zhao Ren, Thanh Tam Nguyen, Wolfgang Nejdl, Björn W. Schuller
Adversarial Attack Inherent Interpretability Line by Line Explanation Explanation Method Respiratory Sound Example Based

March 27, 2022

A Unified Study of Machine Learning Explanation Evaluation Metrics
Yipei Wang, Xiaoqian Wang
Line by Line Explanation Unified Framework Evaluation Metric Explanation Method Interpretability Research

March 25, 2022

A Meta Survey of Quality Evaluation Criteria in Explanation Methods
Helena Löfström, Karl Hammar, Ulf Johansson
Explainable AI High Explainability Explanation Method Explanation Quality Meta Review