Model Based Explanation

Model-based explanation aims to make the decisions of machine learning models understandable and trustworthy, addressing the "black box" problem hindering wider adoption. Current research focuses on developing faster, more general explanation methods, including model-agnostic approaches that don't rely on specific model architectures, and evaluating the robustness and reliability of these explanations, particularly in high-stakes domains like medicine. This work is crucial for building confidence in AI systems and ensuring responsible deployment across various applications, from healthcare to safety-critical systems, by providing human-interpretable insights into model behavior.

Papers

December 19, 2024

Active Inference and Human--Computer Interaction
Roderick Murray-Smith, John H. Williamson, Sebastian Stein
Generative Model Generative Modeling Active Inference Human Machine Interaction Model Based Explanation

May 29, 2024

Fast Explainability via Feasible Concept Sets Generator
Deng Pan, Nuno Moniz, Nitesh Chawla
Model Agnostic Synchronous Generator Robust Explanation Model Agnostic Explanation Feasible Task Model Based Explanation

February 28, 2024

Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions
Hanjie Chen, Zhouxiang Fang, Yash Singla, Mark Dredze
Choice Question Answering Clinical Decision Medical Question Model Based Explanation Clinical Question

November 26, 2023

Exploring the Robustness of Model-Graded Evaluations and Automated Interpretability
Simon Lermen, Ondřej Kvapil
Language Model Native Robustness Language Understanding Model Based Explanation Model Based Evaluation

October 18, 2023

Rather a Nurse than a Physician -- Contrastive Explanations under Investigation
Oliver Eberle, Ilias Chalkidis, Laura Cabello, Stephanie Brandl
Comprehensive Investigation Contrastive Explanation Internal Medicine DOCTOR Post Hoc Explainability Human Annotated Rationale Healthcare Professional Model Based Explanation

March 16, 2023

Model Based Explanations of Concept Drift
Fabian Hinder, Valerie Vaquet, Johannes Brinkrolf, Barbara Hammer
Machine Learning Model Concept Drift Lifelong Learning Data Dimensionality Drift Explanation Model Based Explanation

October 24, 2022

Logic-Based Explainability in Machine Learning
Joao Marques-Silva
Machine Learning Machine Learning Model High Explainability Unstructured Text Machine Learning Architecture Model Based Explanation