Concept Intervention

Concept intervention focuses on improving the interpretability and performance of machine learning models by allowing human users to correct erroneous intermediate representations, called concepts, that the model generates before making a final prediction. Current research emphasizes developing model architectures, such as Concept Bottleneck Models (CBMs) and their variants (e.g., stochastic, counterfactual, energy-based CBMs), that efficiently incorporate these interventions, often by explicitly modeling relationships between concepts. This research aims to enhance model accuracy, reduce the number of interventions needed, and improve the overall usability of these explainable AI systems, ultimately leading to more reliable and trustworthy AI applications.

Papers

June 27, 2024

Stochastic Concept Bottleneck Models
Moritz Vandenhirtz, Sonia Laguna, Ričards Marcinkevičs, Julia E. Vogt
Concept Bottleneck Model Semantic Annotation Concept Intervention

May 28, 2024

Understanding Inter-Concept Relationships in Concept-Based Models
Naveen Raman, Mateo Espinosa Zarlenga, Mateja Jamnik
Human Understanding Human Relationship Concept Based Concept Representation Concept Based Model Concept Intervention

May 2, 2024

Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models
Nishad Singhi, Jae Myung Kim, Karsten Roth, Zeynep Akata
Concept Bottleneck Model Concept Shift Concept Intervention

February 2, 2024

Counterfactual Concept Bottleneck Models
Gabriele Dominici, Pietro Barbiero, Francesco Giannini, Martin Gjoreski, Giuseppe Marra, Marc Langheinrich
Inherent Interpretability Counterfactual Explanation Concept Bottleneck Model Task Prediction Concept Intervention

January 25, 2024

Energy-Based Concept Bottleneck Models: Unifying Prediction, Concept Intervention, and Probabilistic Interpretations
Xinyue Xu, Yi Qin, Lu Mi, Hao Wang, Xiaomeng Li
Black Box Model Language Correction Concept Bottleneck Model Concept Based Probabilistic Interpretation Unified Prediction Concept Intervention

January 24, 2024

Beyond Concept Bottleneck Models: How to Make Black Boxes Intervenable?
Sonia Laguna, Ričards Marcinkevičs, Moritz Vandenhirtz, Julia E. Vogt
Black Box Interpretable Machine Learning Concept Bottleneck Model Concept Intervention

September 29, 2023

Learning to Receive Help: Intervention-Aware Concept Embedding Models
Mateo Espinosa Zarlenga, Katherine M. Collins, Krishnamurthy Dvijotham, Adrian Weller, Zohreh Shams, Mateja Jamnik
LeArning Abstract Full Model Concept Bottleneck Model Concept Based HELP Request Intervention Policy Concept Intervention