Confidence Score

Confidence scores, representing a model's certainty in its predictions, are crucial for building trustworthy AI systems, particularly in high-stakes applications like healthcare and autonomous driving. Current research focuses on improving the calibration and reliability of these scores across diverse model architectures (including LLMs, transformers, and conformers) and tasks, often employing techniques like self-consistency, multicalibration, and novel scoring functions tailored to specific data characteristics (e.g., ordinal data, long-form text). The accurate estimation of confidence is vital for enhancing model performance, enabling selective classification (rejecting low-confidence predictions), and facilitating human-in-the-loop systems where trust and transparency are paramount.

Papers

July 27, 2023

Understanding Silent Failures in Medical Image Classification
Till J. Bungert, Levin Kobelke, Paul F. Jaeger
Image Classification Medical Image Classification Confidence Score Silent Failure

June 28, 2023

Confidence Ranking for CTR Prediction
Jian Zhu, Congcong Liu, Pei Wang, Xiwei Zhao, Zhangang Lin, Jingping Shao
Confidence Score CTR Prediction Ranking Function Direct Optimization

June 2, 2023

Evaluating Machine Translation Quality with Conformal Predictive Distributions
Patrizio Giovannotti
Machine Translation Translation Quality Prediction Interval Confidence Score Conformal Predictor

May 24, 2023

March 29, 2023

Did You Mean...? Confidence-based Trade-offs in Semantic Parsing
Elias Stengel-Eskin, Benjamin Van Durme
Semantic Parsing Human in the Loop Trade Offs Confidence Score Calibrated Confidence

March 10, 2023

Explaining Model Confidence Using Counterfactuals
Thao Le, Tim Miller, Ronal Singh, Liz Sonenberg
Counterfactual Explanation Human AI Interaction Confidence Score Model Confidence

March 8, 2023

Simple and Efficient Confidence Score for Grading Whole Slide Images
Mélanie Lubrano, Yaëlle Bellahsen-Harrar, Rutger Fick, Cécile Badoual, Thomas Walter
Artificial Intelligence Whole Slide Image High Confidence Confidence Score Low Confidence

January 29, 2023

Confidence-Aware Calibration and Scoring Functions for Curriculum Learning
Shuang Ao, Stefan Rueger, Advaith Siddharthan
Curriculum Learning Label Smoothing Confidence Calibration Confidence Score Model Confidence Scoring Function

January 25, 2023

Evaluating Probabilistic Classifiers: The Triptych
Timo Dimitriadis, Tilmann Gneiting, Alexander I. Jordan, Peter Vogel
Predictive Performance Forecasting Performance Multiple Outcome Confidence Score Probabilistic Classifier Reliability Diagram

December 1, 2022

Purifier: Defending Data Inference Attacks via Transforming Confidence Scores
Ziqi Yang, Lijin Wang, Da Yang, Jie Wan, Ziming Zhao, Ee-Chien Chang, Fan Zhang, Kui Ren
Membership Inference Attack Inference Attack Confidence Score Attribute Inference Attack

November 28, 2022

A Call to Reflect on Evaluation Practices for Failure Detection in Image Classification
Paul F. Jaeger, Carsten T. Lüth, Lukas Klein, Till J. Bungert
Image Classification Confidence Score Business Call Failure Detection Evaluation Practice Confidence Scoring

October 28, 2022

Beyond calibration: estimating the grouping loss of modern neural networks
Alexandre Perez-Lebel, Marine Le Morvan, Gaël Varoquaux
Calibration Performance Interest Loss Confidence Score Modern Neural Network Group Loss

August 30, 2022

Constraining Representations Yields Models That Know What They Don't Know
Joao Monteiro, Pau Rodriguez, Pierre-Andre Noel, Issam Laradji, David Vazquez
Activation Function Confidence Score Activation Pattern Sufficient Representation

July 1, 2022

Usable Region Estimate for Assessing Practical Usability of Medical Image Segmentation Models
Yizhe Zhang, Suraj Mishra, Peixian Liang, Hao Zheng, Danny Z. Chen
Confidence Estimation Confidence Score Prediction Confidence Region Prediction Medical Image Segmentation Model

June 24, 2022

Confidence Score Based Conformer Speaker Adaptation for Speech Recognition
Jiajun Deng, Xurong Xie, Tianzi Wang, Mingyu Cui, Boyang Xue, Zengrui Jin, Mengzhe Geng, Guinan Li, Xunying Liu, Helen Meng
Automatic Speech Recognition Speech Recognition Speaker Adaptation Confidence Score Speaker Variability

June 14, 2022

Confidence Score for Source-Free Unsupervised Domain Adaptation
Jonghyun Lee, Dahuin Jung, Junho Yim, Sungroh Yoon
Domain Adaptation Trained Source Model Confidence Score Source Free Unsupervised Domain Adaptation Confidence Measure

June 6, 2022

Improving Model Understanding and Trust with Counterfactual Explanations of Model Confidence
Thao Le, Tim Miller, Ronal Singh, Liz Sonenberg
Counterfactual Explanation Appropriate Trust Confidence Score Model Confidence Model Understanding

June 1, 2022

HYCEDIS: HYbrid Confidence Engine for Deep Document Intelligence System
Bao-Sinh Nguyen, Quang-Bach Tran, Tuan-Anh Nguyen Dang, Duc Nguyen, Hung Le
Confidence Score Document Intelligence Machine Self Confidence Confidence Measure Confidence Estimator

May 27, 2022

Failure Detection in Medical Image Classification: A Reality Check and Benchmarking Testbed
Melanie Bernhardt, Fabio De Sousa Ribeiro, Ben Glocker
Image Classification Distribution Detection Medical Image Classification Medical Image Datasets Test Bed Confidence Score Failure Detection Reality Check Misclassification Detection