Concept Bottleneck Model
Concept Bottleneck Models (CBMs) aim to enhance the interpretability of deep learning models by incorporating human-understandable concepts into the prediction process, thereby bridging the gap between complex model outputs and human comprehension. Current research focuses on improving CBM accuracy, addressing security vulnerabilities like backdoor attacks, and developing methods for automated concept discovery and selection, often leveraging vision-language models like CLIP. This work is significant because it strives to create more trustworthy and reliable AI systems, particularly in high-stakes domains like medicine, where understanding model decisions is crucial for both performance and user acceptance.
Papers
Interpretable and intervenable ultrasonography-based machine learning models for pediatric appendicitis
Ričards Marcinkevičs, Patricia Reis Wolfertstetter, Ugne Klimiene, Kieran Chin-Cheong, Alyssia Paschke, Julia Zerres, Markus Denzinger, David Niederberger, Sven Wellmann, Ece Ozkan, Christian Knorr, Julia E. Vogt
A Closer Look at the Intervention Procedure of Concept Bottleneck Models
Sungbin Shin, Yohan Jo, Sungsoo Ahn, Namhoon Lee
Learn to explain yourself, when you can: Equipping Concept Bottleneck Models with the ability to abstain on their concept predictions
Joshua Lockhart, Daniele Magazzeni, Manuela Veloso
Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification
Yue Yang, Artemis Panagopoulou, Shenghao Zhou, Daniel Jin, Chris Callison-Burch, Mark Yatskar