Better Explainability

Improving the explainability of machine learning models aims to make their decision-making processes more transparent and understandable, fostering trust and enabling better model debugging and refinement. Current research focuses on developing novel explanation methods, including those based on feature attribution, counterfactual examples, and structured argumentation, often applied to deep neural networks, transformers, and reinforcement learning agents. These advancements are crucial for deploying AI systems responsibly in high-stakes domains like healthcare and autonomous systems, where understanding model behavior is paramount.

Papers

May 26, 2023

GLOBE-CE: A Translation-Based Approach for Global Counterfactual Explanations
Dan Ley, Saumitra Mishra, Daniele Magazzeni
High Explainability Counterfactual Explanation Better Explainability Translation Based Explainability Tool Global Counterfactual Globe Ce

April 12, 2023

A Closer Look at the Explainability of Contrastive Language-Image Pre-training
Yi Li, Hualiang Wang, Yiqun Duan, Jiheng Zhang, Xiaomeng Li
Large Multimodal Model Contrastive Language Image Open Vocabulary Feature Enhancement Open Vocabulary Semantic Segmentation Open Vocabulary Segmentation Better Explainability

March 17, 2023

Approximation of group explainers with coalition structure using Monte Carlo sampling on the product space of coalitions and features
Konstandinos Kotsiopoulos, Alexey Miroshnikov, Khashayar Filom, Arjun Ravi Kannan
Average Approximation Convergence Analysis Game Theoretic Monte Carlo Cooperative Game Theory Better Explainability Coalition Formation Bayesian Game

December 2, 2022

VeriX: Towards Verified Explainability of Deep Neural Networks
Min Wu, Haoze Wu, Clark Barrett
Deep Neural Network Line by Line Explanation High Quality Counterfactuals Better Explainability Robust Explanation

November 22, 2022

Explainability of Traditional and Deep Learning Models on Longitudinal Healthcare Records
Lin Lee Cheong, Tesfagabir Meharizghi, Wynona Black, Yang Guang, Weilin Meng
Deep Learning Model High Explainability Explainability Method Better Explainability Explanation Performance

October 13, 2022

On the Evaluation of the Plausibility and Faithfulness of Sentiment Analysis Explanations
Julia El Zini, Mohamad Mansour, Basel Mousi, Mariette Awad
Natural Language Processing Global Evaluation High Explainability Explainability Method Better Explainability Explanation Plausibility

June 26, 2022

FlowX: Towards Explainable Graph Neural Networks via Message Flows
Shurui Gui, Hao Yuan, Jie Wang, Qicheng Lao, Kang Li, Shuiwang Ji
Graph Neural Network High Explainability Graph Node Better Explainability Explanation Type Message Flow

June 15, 2022

On the Eve of True Explainability for OWL Ontologies: Description Logic Proofs with Evee and Evonne (Extended Version)
Christian Alrabbaa, Stefan Borgwardt, Tom Friese, Patrick Koopmann, Julián Méndez, Alexej Popovič
Extended Version Description Logic Better Explainability OWL Ontology Ontology Design

May 25, 2022

How explainable are adversarially-robust CNNs?
Mehdi Nourelahi, Lars Kotthoff, Peijie Chen, Anh Nguyen
Convolutional Neural Network Adversarial Robustness CNN Architecture Better Explainability Robust Convolutional Neural Network

April 14, 2022

Global Counterfactual Explanations: Investigations, Implementations and Improvements
Dan Ley, Saumitra Mishra, Daniele Magazzeni
High Explainability Counterfactual Explanation Large Relevance Improvement Comprehensive Investigation Better Explainability Implementation Detail Explainability Tool Global Counterfactual

February 15, 2022

XAI for Transformers: Better Explanations through Conservative Propagation
Ameen Ali, Thomas Schnake, Oliver Eberle, Grégoire Montavon, Klaus-Robert Müller, Lior Wolf
Transformer Megatron Decepticons Transformer Model xAI Community Interpretability Method Gradient Information Better Explainability Explanation Performance Hierarchical Propagation

December 29, 2021

Explainability Is in the Mind of the Beholder: Establishing the Foundations of Explainable Artificial Intelligence
Kacper Sokol, Peter Flach
Inherent Interpretability High Explainability Explainable Artificial Intelligence Human Mind Flawed Foundation Human Centered Better Explainability

December 7, 2021

Tell me why! Explanations support learning relational and causal structure
Andrew K. Lampinen, Nicholas A. Roy, Ishita Dasgupta, Stephanie C. Y. Chan, Allison C. Tam, James L. McClelland, Chen Yan, Adam Santoro, Neil C. Rabinowitz, Jane X. Wang, Felix Hill
Causal Structure Causal Learning Causal Knowledge Better Explainability Deep Reinforcement Learning Agent Task Relationship Agent Learning