Consensus Based Explanation

Consensus-based explanation aims to improve the trustworthiness and understandability of machine learning models by generating consistent explanations across different explanation methods. Current research focuses on quantifying and mitigating disagreements between popular explanation techniques like SHAP and LIME, sometimes incorporating explanation consensus directly into model training. This work is crucial for building more reliable AI systems, particularly in high-stakes applications, by ensuring that explanations are not only accurate but also consistent and trustworthy, thereby fostering user confidence and facilitating effective human-AI collaboration.

Papers

November 4, 2024

EXAGREE: Towards Explanation Agreement in Explainable Machine Learning
Sichao Li, Quanling Deng, Amanda S. Barnard
Line by Line Explanation Explainable Machine Learning Attribution Score Explanation Model Consensus Based Explanation

April 22, 2023

Trust and Reliance in Consensus-Based Explanations from an Anti-Misinformation Agent
Takane Ueno, Yeongdae Kim, Hiroki Oura, Katie Seaborn
Appropriate Trust Consensus Group Decision Misinformation Detection Optical Illusion Consensus Based Explanation

April 15, 2023

The XAISuite framework and the implications of explanatory system dissonance
Shreyan Mitra, Leilani Gilpin
Machine Learning Model Future Implication Importance Score North System Consensus Based Explanation

March 23, 2023

Reckoning with the Disagreement Problem: Explanation Consensus as a Training Objective
Avi Schwarzschild, Max Cembalest, Karthik Rao, Keegan Hines, John Dickerson
Feature Attribution Consensus Value Training Objective Feature Attribution Explanation Disagreement Problem Explanation Regularization Consensus Based Explanation

Consensus Based Explanation

Papers

EXAGREE: Towards Explanation Agreement in Explainable Machine Learning

Trust and Reliance in Consensus-Based Explanations from an Anti-Misinformation Agent

The XAISuite framework and the implications of explanatory system dissonance

Reckoning with the Disagreement Problem: Explanation Consensus as a Training Objective