Interpretable Deep Learning

Interpretable deep learning aims to make the decision-making processes of deep neural networks transparent and understandable, addressing the "black box" problem that hinders trust and adoption in high-stakes applications. Current research focuses on developing novel architectures like concept bottleneck models and incorporating techniques such as attention mechanisms, counterfactual explanations, and Shapley values to provide insights into model predictions. This field is crucial for building reliable and trustworthy AI systems across various domains, from healthcare and finance to neuroimaging and environmental monitoring, by enabling better understanding, validation, and debugging of complex models.

Papers

August 23, 2023

Relational Concept Bottleneck Models
Pietro Barbiero, Francesco Giannini, Gabriele Ciravegna, Michelangelo Diligenti, Giuseppe Marra
Interpretable Deep Learning Relational Model Concept Based Model Relational Deep Learning

July 21, 2023

Unveiling Vulnerabilities in Interpretable Deep Learning Systems with Query-Efficient Black-box Attacks
Eldor Abdukhamidov, Mohammed Abuhamad, Simon S. Woo, Eric Chan-Tin, Tamer Abuhmed
Adversarial Attack Adversarial Example New Attack Attack Success Rate Interpretable Deep Learning Unveiling Vulnerability

July 14, 2023

Looking deeper into interpretable deep learning in neuroimaging: a comprehensive survey
Md. Mahfuzur Rahman, Vince D. Calhoun, Sergey M. Plis
Deep Learning Model Comprehensive Survey Model Interpretability Interpretable Deep Learning Built in Interpretability Interpretability Tool

July 13, 2023

Microbial Genetic Algorithm-based Black-box Attack against Interpretable Deep Learning Systems
Eldor Abdukhamidov, Mohammed Abuhamad, Simon S. Woo, Eric Chan-Tin, Tamer Abuhmed
Deep Learning Model Adversarial Example Adversarial Sample Black Box Attack Interpretable Deep Learning

July 12, 2023

Single-Class Target-Specific Attack against Interpretable Deep Learning Systems
Eldor Abdukhamidov, Mohammed Abuhamad, George K. Thiruvathukal, Hyoungshick Kim, Tamer Abuhmed
Adversarial Attack Adversarial Sample Adversarial Loss Interpretable Deep Learning Targeted Attack

June 5, 2023

Interpretable Alzheimer's Disease Classification Via a Contrastive Diffusion Autoencoder
Ayodeji Ijishakin, Ahmed Abdulaal, Adamos Hadjivasiliou, Sophie Martin, James Cole
Deep Learning Model Latent Space Alzheimer'S Disease Disease Classification Interpretable Deep Learning Diffusion Autoencoder

April 8, 2023

Deep Prototypical-Parts Ease Morphological Kidney Stone Identification and are Competitively Robust to Photometric Perturbations
Daniel Flores-Araiza, Francisco Lopez-Tiro, Jonathan El-Beze, Jacques Hubert, Miguel Gonzalez-Mendoza, Gilberto Ochoa-Ruiz, Christian Daul
Deep Learning Interpretable Deep Learning Ex Vivo Kidney Stone Kidney Stone Type

February 11, 2023

Interpretable Deep Learning for Forecasting Online Advertising Costs: Insights from the Competitive Bidding Landscape
Fynn Oldenburg, Qiwei Han, Maximilian Kaiser
DCU Insight AQ Interpretable Deep Learning Online Advertising Online Advertising Revenue Ad Market

November 29, 2022

Interpretations Cannot Be Trusted: Stealthy and Effective Adversarial Perturbations against Interpretable Deep Learning
Eldor Abdukhamidov, Mohammed Abuhamad, Simon S. Woo, Eric Chan-Tin, Tamer Abuhmed
Deep Learning Deep Learning Model Adversarial Perturbation Adversarial Input Interpretable Deep Learning

July 11, 2022

A multi-level interpretable sleep stage scoring system by infusing experts' knowledge into a deep network architecture
Hamid Niknazar, Sara C. Mednick
Deep Learning Inherent Interpretability Expert Knowledge Knowledge Based Deep Learning Architecture Interpretable Deep Learning Sleep Stage Sleep Scoring

May 8, 2022

N-ACT: An Interpretable Deep Learning Model for Automatic Cell Type and Salient Gene Identification
A. Ali Heydari, Oscar A. Davalos, Katrina K. Hoyer, Suzanne S. Sindi
Interpretable Deep Learning Single Cell RNA Cell Classification Gene Set Cell Identity

March 31, 2022

RobIn: A Robust Interpretable Deep Network for Schizophrenia Diagnosis
Daniel Organisciak, Hubert P. H. Shum, Ephraim Nwoye, Wai Lok Woo
Deep Learning Interpretable Deep Learning Schizophrenia Diagnosis

March 7, 2022

Covariate-Balancing-Aware Interpretable Deep Learning models for Treatment Effect Estimation
Kan Chen, Qishuo Yin, Qi Long
Treatment Effect Outcome Prediction Interpretable Deep Learning Average Treatment Effect Contrastive Energy

February 25, 2022

Sparse Neural Additive Model: Interpretable Deep Learning with Feature Selection via Group Sparsity
Shiyun Xu, Zhiqi Bu, Pratik Chaudhari, Ian J. Barnett
Feature Selection Additive Model Interpretable Deep Learning Generalized Additive Model Neural Additive Model Group Lasso Group Sparsity Group Sparse

December 1, 2021

Interpretable Deep Learning-Based Forensic Iris Segmentation and Recognition
Andrey Kuehlkamp, Aidan Boyd, Adam Czajka, Kevin Bowyer, Patrick Flynn, Dennis Chute, Eric Benjamin
Recognition Rate Interpretable Deep Learning Iris Recognition Iris Segmentation Post Mortem Iris Forensic Iris

November 25, 2021

Extending the Relative Seriality Formalism for Interpretable Deep Learning of Normal Tissue Complication Probability Models
Tahir I. Yusufaly
Convolutional Neural Network Convolution Layer Pooling Layer Interpretable Deep Learning Tissue Phenotyping Postoperative Complication

December 22, 2020

Unboxing Engagement in YouTube Influencer Videos: An Attention-Based Approach
Prashant Rajaram, Puneet Manchanda
Attention Based Interpretable Deep Learning Active CR Engagement Dot Product Attention

Interpretable Deep Learning

Papers

Relational Concept Bottleneck Models

Unveiling Vulnerabilities in Interpretable Deep Learning Systems with Query-Efficient Black-box Attacks

Looking deeper into interpretable deep learning in neuroimaging: a comprehensive survey

Microbial Genetic Algorithm-based Black-box Attack against Interpretable Deep Learning Systems

Single-Class Target-Specific Attack against Interpretable Deep Learning Systems

Interpretable Alzheimer's Disease Classification Via a Contrastive Diffusion Autoencoder

Deep Prototypical-Parts Ease Morphological Kidney Stone Identification and are Competitively Robust to Photometric Perturbations

Interpretable Deep Learning for Forecasting Online Advertising Costs: Insights from the Competitive Bidding Landscape

Interpretations Cannot Be Trusted: Stealthy and Effective Adversarial Perturbations against Interpretable Deep Learning

A multi-level interpretable sleep stage scoring system by infusing experts' knowledge into a deep network architecture

N-ACT: An Interpretable Deep Learning Model for Automatic Cell Type and Salient Gene Identification

RobIn: A Robust Interpretable Deep Network for Schizophrenia Diagnosis

Covariate-Balancing-Aware Interpretable Deep Learning models for Treatment Effect Estimation

Sparse Neural Additive Model: Interpretable Deep Learning with Feature Selection via Group Sparsity

Interpretable Deep Learning-Based Forensic Iris Segmentation and Recognition

Extending the Relative Seriality Formalism for Interpretable Deep Learning of Normal Tissue Complication Probability Models

Unboxing Engagement in YouTube Influencer Videos: An Attention-Based Approach