Threshold Based Auto Labeling

Threshold-based auto-labeling (TBAL) aims to efficiently create large labeled datasets by automatically labeling data points based on a model's confidence scores exceeding a predefined threshold. Current research focuses on optimizing the threshold selection process, developing improved confidence functions to mitigate model overconfidence, and analyzing the sample complexity required for reliable auto-labeling. This approach significantly reduces the need for manual labeling, impacting various machine learning applications by accelerating model training and potentially improving data efficiency, but careful consideration of potential pitfalls, such as high validation data requirements, is crucial for successful implementation.

Papers

May 21, 2024

Parallel Algorithm for Optimal Threshold Labeling of Ordinal Regression Methods
Ryoya Yamasaki, Toshiyuki Tanaka
Ordinal Pattern Ordinal Regression Parallel Algorithm Threshold Based Threshold Based Auto Labeling

April 24, 2024

Pearls from Pebbles: Improved Confidence Functions for Auto-labeling
Harit Vishwakarma, Reid, Chen, Sui Jiet Tay, Satya Sai Srinath Namburi, Frederic Sala, Ramya Korlakai Vinayak
Labeling Technique Threshold Based Auto Labeling

November 22, 2022

Promises and Pitfalls of Threshold-based Auto-labeling
Harit Vishwakarma, Heguang Lin, Frederic Sala, Ramya Korlakai Vinayak
Common Pitfall Promise Human Labeled Threshold Based Auto Labeling

December 4, 2021

Adaptive label thresholding methods for online multi-label classification
Tingting Zhai, Hongcheng Tang, Hao Wang
Multi Label Multi Label Recognition Adaptive Thresholding Threshold Based Auto Labeling

Threshold Based Auto Labeling

Papers

Parallel Algorithm for Optimal Threshold Labeling of Ordinal Regression Methods

Pearls from Pebbles: Improved Confidence Functions for Auto-labeling

Promises and Pitfalls of Threshold-based Auto-labeling

Adaptive label thresholding methods for online multi-label classification