Pseudo Labeling
Pseudo-labeling is a semi-supervised learning technique that leverages unlabeled data by using a model's predictions as pseudo-labels to augment training datasets. Current research focuses on improving the reliability of these pseudo-labels through techniques like confidence thresholding, multi-view approaches, and incorporating additional information such as contextual metadata or neighbor relations. This approach is particularly valuable in domains with limited labeled data, such as medical image analysis, speech processing, and object detection, leading to improved model performance and reduced annotation costs. The resulting advancements have significant implications for various applications where obtaining labeled data is expensive or difficult.
Papers
Labeled-to-Unlabeled Distribution Alignment for Partially-Supervised Multi-Organ Medical Image Segmentation
Xixi Jiang, Dong Zhang, Xiang Li, Kangyi Liu, Kwang-Ting Cheng, Xin Yang
PEPL: Precision-Enhanced Pseudo-Labeling for Fine-Grained Image Classification in Semi-Supervised Learning
Bowen Tian, Songning Lai, Lujundong Li, Zhihao Shuai, Runwei Guan, Tian Wu, Yutao Yue
SelEx: Self-Expertise in Fine-Grained Generalized Category Discovery
Sarah Rastegar, Mohammadreza Salehi, Yuki M. Asano, Hazel Doughty, Cees G. M. Snoek
Empowering Low-Resource Language ASR via Large-Scale Pseudo Labeling
Kaushal Santosh Bhogale, Deovrat Mehendale, Niharika Parasa, Sathish Kumar Reddy G, Tahir Javed, Pratyush Kumar, Mitesh M. Khapra
Memory-Efficient Pseudo-Labeling for Online Source-Free Universal Domain Adaptation using a Gaussian Mixture Model
Pascal Schlachter, Simon Wagner, Bin Yang
CRMSP: A Semi-supervised Approach for Key Information Extraction with Class-Rebalancing and Merged Semantic Pseudo-Labeling
Qi Zhang, Yonghong Song, Pengcheng Guo, Yangyang Hui