Post Hoc Explanation

Post-hoc explanation methods aim to make the decision-making processes of "black box" machine learning models more transparent, primarily by identifying which input features most influence a model's predictions. Current research focuses on improving the accuracy, efficiency, and interpretability of these explanations, often employing techniques like Shapley values, LIME, and various neural network architectures (e.g., transformers, CNNs) to generate explanations in different data modalities (audio, images, text, graphs). This work is crucial for building trust in AI systems and enabling better understanding of model behavior, particularly in high-stakes applications like healthcare and finance, where model transparency is paramount.

Papers

November 6, 2023

Advancing Post Hoc Case Based Explanation with Feature Highlighting
Eoin Kenny, Eoin Delaney, Mark Keane
Explainable AI XAI Method ImageNet Dataset Post Hoc Explanation

October 25, 2023

Learning to Explain: A Model-Agnostic Framework for Explaining Black Box Models
Oren Barkan, Yuval Asher, Amit Eshel, Yehonatan Elisha, Noam Koenigstein
LeArning Abstract Deep Convolutional Neural Network High Explainability Vision Model Black Box Model Model Agnostic Post Hoc Explanation Explanation Map

October 11, 2023

SurroCBM: Concept Bottleneck Surrogate Models for Generative Post-hoc Explanation
Bo Pan, Zhenke Liu, Yifei Zhang, Liang Zhao
Concept Bottleneck Model Post Hoc Explanation Concept Activation Vector Concept Based Explanation Interpretable Surrogate Model

July 23, 2023

Right for the Wrong Reason: Can Interpretable ML Techniques Detect Spurious Correlations?
Susu Sun, Lisa M. Koch, Christian F. Baumgartner
Machine Learning Spurious Correlation Post Hoc Explanation Interpretable Classifier Interpretable Machine Learning Method

July 1, 2023

The future of human-centric eXplainable Artificial Intelligence (XAI) is not post-hoc explanations
Vinitra Swamy, Jibril Frej, Tanja Käser
Explainable Artificial Intelligence Future Reasoning Interpretable Model xAI Community Post Hoc Explanation Interpretable Neural Network Post Hoc Explainability Human Centered Explainable AI Human Centered XAI

June 19, 2023

Explaining the Model and Feature Dependencies by Decomposition of the Shapley Value
Joran Michiels, Maarten De Vos, Johan Suykens
Full Model Shapley Value Microbial Decomposition Game Theory Post Hoc Explanation Complex Model

May 25, 2023

Robust Ante-hoc Graph Explainer using Bilevel Optimization
Kha-Dinh Luong, Mert Kosan, Arlei Lopes Da Silva, Ambuj Singh
Line by Line Explanation Bilevel Optimization Post Hoc Explanation Graph Explainability Chemical Domain

May 21, 2023

Self-Explainable Graph Neural Networks for Link Prediction
Huaisheng Zhu, Dongsheng Luo, Xianfeng Tang, Junjie Xu, Hui Liu, Suhang Wang
Graph Neural Network Link Prediction Post Hoc Explanation GNN Explainers Self Explainable Graph Neural Network

May 19, 2023

Post Hoc Explanations of Language Models Can Improve Language Models
Satyapriya Krishna, Jiaqi Ma, Dylan Slack, Asma Ghandeharioun, Sameer Singh, Himabindu Lakkaraju
Language Model Context Learning Post Hoc Explanation Chain of Thought Prompting Human Annotated Rationale

April 24, 2023

Generating Post-hoc Explanations for Skip-gram-based Node Embeddings by Identifying Important Nodes with Bridgeness
Hogun Park, Jennifer Neville
Jina Embeddings Node Classification Node Representation Node Embeddings Post Hoc Explanation Influential Node Level Explanation Skip Gram

March 15, 2023

Understanding Post-hoc Explainers: The Case of Anchors
Gianluigi Lopardo, Frederic Precioso, Damien Garreau
Inherent Interpretability Case Relevance Interpretable Model Explainability Method Interpretability Method Post Hoc Explanation Interpretable Rule Visual Information Anchor

February 15, 2023

Streamlining models with explanations in the learning loop
Francesco Lomuscio, Paolo Bajardi, Alan Perotti, Elvio G. Amparore
Line by Line Explanation Black Box Black Box Model Local Explanation Post Hoc Explanation Learning Loop Explainable AI Approach

February 11, 2023

Informing clinical assessment by contextualizing post-hoc explanations of risk prediction models in type-2 diabetes
Shruthi Chari, Prasant Acharya, Daniel M. Gruen, Olivia Zhang, Elif K. Eyigoz, Mohamed Ghalwash, Oshani Seneviratne, Fernando Suarez Saiz, Pablo Meyer, Prithwish Chakraborty, Deborah L. McGuinness
Risk Prediction Post Hoc Explanation AI Risk Type 2 Diabetes Algorithmic Reasoning Clinical Assessment

January 13, 2023

Uncertainty Quantification for Local Model Explanations Without Model Access
Surin Ahn, Justin Grana, Yafet Tamene, Kristian Holsheimer
Uncertainty Quantification Black Box Model Post Hoc Explanation Confidence Interval Local Interpretable Model Agnostic Explanation Explanation Uncertainty Model Sharing

December 19, 2022

Explaining Classifications to Non Experts: An XAI User Study of Post Hoc Explanations for a Classifier When People Lack Expertise
Courtney Ford, Mark T Keane
Explainable AI Line by Line Explanation XAI Method Simple Classifier Human Decision Post Hoc Explanation Non Expert XAI Research

December 16, 2022

Robust Explanation Constraints for Neural Networks
Matthew Wicker, Juyeon Heo, Luca Costabello, Adrian Weller
Neural Network Gradient Based Explanation Method Post Hoc Explanation Robust Explanation Gradient Based Explanation

December 10, 2022

Identifying the Source of Vulnerability in Explanation Discrepancy: A Case Study in Neural Text Classification
Ruixuan Tang, Hanjie Chen, Yangfeng Ji
Case Study Source Table Post Hoc Explanation Latent Vulnerability Task Discrepancy Neural Text Classification

December 6, 2022

Towards a more efficient computation of individual attribute and policy contribution for post-hoc explanation of cooperative multi-agent systems using Myerson values
Giorgio Angelotti, Natalia Díaz-Rodríguez
Deep Reinforcement Learning Multi Agent System Shapley Value Policy Value Post Hoc Explanation Collaborative Task Efficient Computation Cooperative Multi Agent System Coalition Game

December 2, 2022

Evaluation of Explanation Methods of AI -- CNNs in Image Classification Tasks with Reference-based and No-reference Metrics
A. Zhukov, J. Benois-Pineau, R. Giot
Artificial Intelligence CNN Model Explanation Method Post Hoc Explanation Image Classification Task Machine Learning Paradigm Reference Free Metric Artificial Intelligence Decision Explanation Map

November 14, 2022

Explainer Divergence Scores (EDS): Some Post-Hoc Explanations May be Effective for Detecting Unknown Spurious Correlations
Shea Cardozo, Gabriel Islas Montero, Dmitry Kazhdan, Botty Dimanov, Maleakhi Wijaya, Mateja Jamnik, Pietro Lio
Deep Neural Network Spurious Correlation Post Hoc Explanation Plausible Explanation

Post Hoc Explanation

Papers

Advancing Post Hoc Case Based Explanation with Feature Highlighting

Learning to Explain: A Model-Agnostic Framework for Explaining Black Box Models

SurroCBM: Concept Bottleneck Surrogate Models for Generative Post-hoc Explanation

Right for the Wrong Reason: Can Interpretable ML Techniques Detect Spurious Correlations?

The future of human-centric eXplainable Artificial Intelligence (XAI) is not post-hoc explanations

Explaining the Model and Feature Dependencies by Decomposition of the Shapley Value

Robust Ante-hoc Graph Explainer using Bilevel Optimization

Self-Explainable Graph Neural Networks for Link Prediction

Post Hoc Explanations of Language Models Can Improve Language Models

Generating Post-hoc Explanations for Skip-gram-based Node Embeddings by Identifying Important Nodes with Bridgeness

Understanding Post-hoc Explainers: The Case of Anchors

Streamlining models with explanations in the learning loop

Informing clinical assessment by contextualizing post-hoc explanations of risk prediction models in type-2 diabetes

Uncertainty Quantification for Local Model Explanations Without Model Access

Explaining Classifications to Non Experts: An XAI User Study of Post Hoc Explanations for a Classifier When People Lack Expertise

Robust Explanation Constraints for Neural Networks

Identifying the Source of Vulnerability in Explanation Discrepancy: A Case Study in Neural Text Classification

Towards a more efficient computation of individual attribute and policy contribution for post-hoc explanation of cooperative multi-agent systems using Myerson values

Evaluation of Explanation Methods of AI -- CNNs in Image Classification Tasks with Reference-based and No-reference Metrics

Explainer Divergence Scores (EDS): Some Post-Hoc Explanations May be Effective for Detecting Unknown Spurious Correlations