Scalable Interpretability

Scalable interpretability aims to make the decision-making processes of complex machine learning models, such as deep neural networks, understandable and transparent, even as model size and data volume increase. Current research focuses on developing novel architectures and algorithms, including sparse feature circuits, in-database interpretability frameworks, and scalable polynomial additive models, that balance high predictive performance with readily accessible explanations. These advancements are crucial for building trust in AI systems across diverse applications, from medical image analysis to database querying, and for facilitating responsible AI development.

Papers

December 28, 2024

Injecting Explainability and Lightweight Design into Weakly Supervised Video Anomaly Detection Systems
Wen-Dong Jiang, Chih-Yung Chang, Hsiang-Chuan Chang, Ji-Yuan Chen, Diptendu Sinha Roy
Anomaly Detection Weakly Supervised Video Anomaly Detection Lightweight High Video Anomaly Cross Modal Contrastive Learning Weakly Supervised Anomaly Detection Simple Convolutional Scalable Interpretability

November 8, 2024

SASWISE-UE: Segmentation and Synthesis with Interpretable Scalable Ensembles for Uncertainty Estimation
Weijie Chen, Alan McMillan
Segmentation Based Approach Uncertainty Estimation Diverse Ensemble Critical Synthesis Diverse Model Uncertainty Map Body Segmentation Scalable Interpretability

March 28, 2024

Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models
Samuel Marks, Can Rager, Eric J. Michaud, Yonatan Belinkov, David Bau, Aaron Mueller
Language Model Strong Generalization Causal Graph Scientific Discovery Interpretable Feature Language Model Behavior Scalable Interpretability

February 23, 2023

A Scalable Space-efficient In-database Interpretability Framework for Embedding-based Semantic SQL Queries
Prabhakar Kudva, Rajesh Bordawekar, Apoorva Nitsure
Relational Database Entity Relation AI Powered Autonomous Database Scalable Interpretability

December 21, 2022

Circumventing interpretability: How to defeat mind-readers
Lee Sharkey
Artificial Intelligence Human Intent Interpretability Analysis Scalable Interpretability

May 27, 2022

Scalable Interpretability via Polynomials
Abhimanyu Dubey, Filip Radenovic, Dhruv Mahajan
Tensor Decomposition Interpretable Machine Learning Polynomial System Additive Model Generalized Additive Model Large Scale Machine Learning Scalable Interpretability

February 24, 2022

Factorizer: A Scalable Interpretable Approach to Context Modeling for Medical Image Segmentation
Pooya Ashtari, Diana M. Sima, Lieven De Lathauwer, Dominique Sappey-Marinier, Frederik Maes, Sabine Van Huffel
Convolutional Neural Network Semantic Segmentation Medical Image Segmentation End to End Transformer Based Model Brain Tumor Segmentation Context Modeling Scalable Interpretability

Scalable Interpretability

Papers

Injecting Explainability and Lightweight Design into Weakly Supervised Video Anomaly Detection Systems

SASWISE-UE: Segmentation and Synthesis with Interpretable Scalable Ensembles for Uncertainty Estimation

Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

A Scalable Space-efficient In-database Interpretability Framework for Embedding-based Semantic SQL Queries

Circumventing interpretability: How to defeat mind-readers

Scalable Interpretability via Polynomials

Factorizer: A Scalable Interpretable Approach to Context Modeling for Medical Image Segmentation