the latest in aiBeta

Inherent Interpretability

Inherent interpretability in machine learning focuses on designing models and methods that are inherently transparent and understandable, aiming to reduce the "black box" nature of many AI systems. Current research emphasizes developing intrinsically interpretable model architectures, such as those based on decision trees, rule-based systems, and specific neural network designs (e.g., Kolmogorov-Arnold Networks), alongside techniques like feature attribution and visualization methods to enhance understanding of model behavior. This pursuit is crucial for building trust in AI, particularly in high-stakes applications like healthcare and finance, where understanding model decisions is paramount for responsible deployment and effective human-AI collaboration.

591papers

Papers - Page 25

July 16, 2023

SHAMSUL: Systematic Holistic Analysis to investigate Medical Significance Utilizing Local interpretability methods in deep learning for chest radiography pathology prediction
Local Interpretable Deep Learning Local Interpretability Clinical Application Interpretability Method Inherent Interpretability

July 12, 2023

Trainability, Expressivity and Interpretability in Gated Neural ODEs
Complex Dynamic Neural Network Dynamic Behavior Expressivity Style Inherent Interpretability Neural Ordinary Differential Equation Neural ODE Continuous Attractor

July 11, 2023

July 5, 2023

Harmonizing Feature Attributions Across Deep Learning Architectures: Enhancing Interpretability and Consistency
Convolutional Neural Network Deep Learning Architecture Feature Attribution Inherent Interpretability Strong Consistency

July 4, 2023

Interpretable Computer Vision Models through Adversarial Training: Unveiling the Robustness-Interpretability Connection
Interpretable Computer Vision Native Robustness Adversarial Training Robust Model Adversarial Attack Inherent Interpretability Robust Interpretation

July 3, 2023

Interpretability and Transparency-Driven Detection and Transformation of Textual Adversarial Examples (IT-DT)
Character Transformation Generative Adversarial Inherent Interpretability Adversarial Example Textual Adversarial Example Adversarial Attack Adversarial Input

July 2, 2023

Minimum Levels of Interpretability for Artificial Moral Agents
Minimum Number Moral Agent Inherent Interpretability Interpretable AI Moral Decision Artificial Intelligence

June 28, 2023

Interpretable Anomaly Detection in Cellular Networks by Learning Concepts in Variational Autoencoders
Latent Dimension Variational Autoencoders Inherent Interpretability Cellular Network Concept Learning Interpretable Representation Anomaly Detection Representation Learning

June 26, 2023

PWSHAP: A Path-Wise Explanation Model for Targeted Variables
Causal Pathway Inherent Interpretability Target Population Black Box Local Effect

June 25, 2023

Interpretable Neural Embeddings with Sparse Self-Representation
Interpretable Embeddings Word Embeddings Inherent Interpretability Better Interpretability

June 21, 2023

Investigating Poor Performance Regions of Black Boxes: LIME-based Exploration in Sepsis Detection
Agnostic Exploration Sepsis Detection Black Box Performance Degradation Local Interpretable Model Agnostic Explanation Inherent Interpretability Sepsis Diagnosis

June 19, 2023

B-cos Alignment for Inherently Interpretable CNNs and Vision Transformers
Weight Re Mapping Vision Transformer B Co Inherent Interpretability Rule Based Explanation

June 15, 2023

Towards Interpretability in Audio and Visual Affective Machine Learning: A Review
Audio Driven Narrative Review Inherent Interpretability Affective Computing Interpretability Method

June 12, 2023

FIRE: An Optimization Approach for Fast Interpretable Rule Extraction
Inherent Interpretability Interpretable Model Fire Occurrence Like Rule Ensemble Algorithm Rule Extraction

June 6, 2023

Explainable AI using expressive Boolean formulas
Explainable AI Interpretable Machine Inherent Interpretability

June 2, 2023

June 1, 2023

SPINEX: Similarity-based Predictions and Explainable Neighbors Exploration for Regression and Classification Tasks in Machine Learning
Novel Regression Feature Interaction Classification Task Machine Learning Inherent Interpretability Ensemble Learning

May 31, 2023

Information Fusion via Symbolic Regression: A Tutorial in the Context of Human Health
Information Fusion Tutorial Review Interpretable Model Inherent Interpretability Symbolic Regression Context Information