DNN Interpretation

DNN interpretation aims to understand the internal workings of deep neural networks, moving beyond their "black box" nature to enhance trust and facilitate debugging. Current research focuses on developing methods to visualize and quantify the contribution of individual neurons, layers, or image regions to model predictions, employing techniques like class activation maps and analyzing activation patterns. These efforts are crucial for improving model reliability, identifying biases, and ultimately building more trustworthy and explainable AI systems across various applications, including healthcare and image analysis. Furthermore, research is actively addressing the scalability challenges of formally verifying DNN behavior.

5papers

Papers

April 3, 2024

CAPE: CAM as a Probabilistic Ensemble for Enhanced DNN Interpretation
Ensemble Averaging Class Activation Image Classification Benchmark DNN Interpretation Deep Neural Network Fm G Cam

January 17, 2024

On-Off Pattern Encoding and Path-Count Encoding as Deep Neural Network Representations
Deep Neural Network Deep Representation Global Encoder Neural Network Representation Recursive Encoding DNN Interpretation

June 29, 2023

Scaling Model Checking for DNN Analysis via State-Space Reduction and Input Segmentation (Extended Version)
Extended Version Verification Framework State Space DNN Interpretation Model Checking

July 27, 2022

Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks
Interpretability Tool Interpretability Research Inner Structure Deep Neural Network Transparent AI DNN Interpretation Timely Survey

March 30, 2022

Concept Evolution in Deep Learning Training: A Unified Interpretation Framework and Discoveries
Scientific Discovery Deep Neural Network DNN Interpretation Concept Shift DNN Architecture Unified Framework

DNN Interpretation

Papers

CAPE: CAM as a Probabilistic Ensemble for Enhanced DNN Interpretation

On-Off Pattern Encoding and Path-Count Encoding as Deep Neural Network Representations

Scaling Model Checking for DNN Analysis via State-Space Reduction and Input Segmentation (Extended Version)

Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks

Concept Evolution in Deep Learning Training: A Unified Interpretation Framework and Discoveries