Self Training

Self-training is a semi-supervised machine learning technique that leverages unlabeled data to improve model performance by iteratively training on pseudo-labels generated by the model itself. Current research focuses on enhancing self-training's robustness and efficiency through techniques like contrastive learning, preference optimization, and uncertainty estimation, often integrated with various model architectures including deep neural networks, transformers, and generative models. This approach is proving valuable across diverse applications, from improving fairness in machine learning to enabling more sample-efficient training in areas like 3D object detection, natural language processing, and biosignal-based robotics control. The ultimate goal is to reduce reliance on expensive and time-consuming data annotation while improving model accuracy and generalization.

Papers

January 18, 2023

Enhancing Self-Training Methods
Aswathnarayan Radhakrishnan, Jim Davis, Zachary Rabin, Benjamin Lewis, Matthew Scherreik, Roman Ilin
Semi Supervised Pseudo Label Self Training Self Training Pipeline Semi Supervised Teacher Student

January 10, 2023

Neighborhood-Regularized Self-Training for Learning with Few Labels
Ran Xu, Yue Yu, Hejie Cui, Xuan Kan, Yanqiao Zhu, Joyce Ho, Chao Zhang, Carl Yang
Semi Supervised Pseudo Label Label Noise Self Training Noisy Pseudo Label

December 23, 2022

Bridging the Domain Gap in Satellite Pose Estimation: a Self-Training Approach based on Geometrical Constraints
Zi Wang, Minglin Chen, Yulan Guo, Zhang Li, Qifeng Yu
Domain Adaptation Self Training Domain Gap Geometric Constraint Sparse Keypoints Satellite Pose Estimation

December 20, 2022

Optimization Techniques for Unsupervised Complex Table Reasoning via Self-Training Framework
Zhenyu Li, Xiuxing Li, Sunqi Fan, Jianyong Wang
Tabular Data Self Training Complex Reasoning Task Optimization Method Tabular Reasoning Table to Text Table Based Reasoning

December 13, 2022

Distantly-Supervised Named Entity Recognition with Adaptive Teacher Learning and Fine-grained Student Ensemble
Xiaoye Qu, Jun Zeng, Daizong Liu, Zhefeng Wang, Baoxing Huai, Pan Zhou
Fine Grained Entity Recognition MAESTRO Dataset Self Training NER Model Adaptive Teaching

December 12, 2022

SRoUDA: Meta Self-training for Robust Unsupervised Domain Adaptation
Wanqing Zhu, Jia-Li Yin, Bo-Hao Chen, Ximeng Liu
Domain Adaptation Adversarial Example Adversarial Training Adversarial Robustness Unsupervised Domain Adaptation Self Training

December 8, 2022

Self-training via Metric Learning for Source-Free Domain Adaptation of Semantic Segmentation
Ibrahim Batuhan Akkaya, Ugur Halici
Semantic Segmentation Self Training Metric Learning Source Free Domain Adaptation Trained Source Model Unlabeled Target Proxy Based Deep Metric Learning

December 7, 2022

Adaptive Self-Training for Object Detection
Renaud Vandeghen, Gilles Louppe, Marc Van Droogenbroeck
Ground Truth Self Training Pseudo Labeling Semi Supervised Object Detection Automatic Thresholding

November 30, 2022

Single Slice Thigh CT Muscle Group Segmentation with Domain Adaptation and Self-Training
Qi Yang, Xin Yu, Ho Hin Lee, Leon Y. Cai, Kaiwen Xu, Shunxing Bao, Yuankai Huo, Ann Zenobia Moore, Sokratis Makrogiannis, Luigi Ferrucci, Bennett A. Landman
Domain Adaptation Self Training Muscle Segmentation

November 29, 2022

Dependency-aware Self-training for Entity Alignment
Bing Liu, Tiancheng Lan, Wen Hua, Guido Zuccon
Self Training Entity Alignment Entity Pair Different Knowledge Graph Entity Mapping

November 18, 2022

Self-Transriber: Few-shot Lyrics Transcription with Self-training
Xiaoxue Gao, Xianghu Yue, Haizhou Li
Semi Supervised Learning Self Training Speech Transcription Automatic Lyric Transcription Lyric Transcription

November 17, 2022

Self-Training with Purpose Preserving Augmentation Improves Few-shot Generative Dialogue State Tracking
Jihyun Lee, Chaebin Lee, Yunsu Kim, Gary Geunbae Lee
Self Training Dialogue State Tracking Label Invariant Augmentation

November 14, 2022

November 4, 2022

1Cademy @ Causal News Corpus 2022: Leveraging Self-Training in Causality Classification of Socio-Political Event Data
Adam Nik, Ge Zhang, Xingran Chen, Mingyu Li, Jie Fu
Self Training Event Causality Identification Causal Categorization Self Training Pipeline Negative Pseudo Label Causal News Corpus

November 2, 2022

October 31, 2022

Zero-Shot Text Classification with Self-Training
Ariel Gera, Alon Halfon, Eyal Shnarch, Yotam Perlitz, Liat Ein-Dor, Noam Slonim
Zero Shot Natural Language Inference Self Training Zero Shot Text Classification

October 24, 2022

360-MLC: Multi-view Layout Consistency for Self-training and Hyper-parameter Tuning
Bolivar Solarte, Chin-Hsuan Wu, Yueh-Cheng Liu, Yi-Hsuan Tsai, Min Sun
Self Training Ground Truth Annotation View Consistency Hyper Parameter Tuning Room Layout Estimation Layout Estimation

October 23, 2022

SAT: Improving Semi-Supervised Text Classification with Simple Instance-Adaptive Self-Training
Hui Chen, Wei Han, Soujanya Poria
Semi Supervised Learning Self Training Semi Supervised Text Classification

Self Training

Papers

Enhancing Self-Training Methods

Neighborhood-Regularized Self-Training for Learning with Few Labels

Bridging the Domain Gap in Satellite Pose Estimation: a Self-Training Approach based on Geometrical Constraints

Optimization Techniques for Unsupervised Complex Table Reasoning via Self-Training Framework

Distantly-Supervised Named Entity Recognition with Adaptive Teacher Learning and Fine-grained Student Ensemble

SRoUDA: Meta Self-training for Robust Unsupervised Domain Adaptation

Self-training via Metric Learning for Source-Free Domain Adaptation of Semantic Segmentation

Adaptive Self-Training for Object Detection

Single Slice Thigh CT Muscle Group Segmentation with Domain Adaptation and Self-Training

Dependency-aware Self-training for Entity Alignment

Self-Transriber: Few-shot Lyrics Transcription with Self-training

Self-Training with Purpose Preserving Augmentation Improves Few-shot Generative Dialogue State Tracking

On Unsupervised Uncertainty-Driven Speech Pseudo-Label Filtering and Model Calibration

Self-training of Machine Learning Models for Liver Histopathology: Generalization under Clinical Shifts

1Cademy @ Causal News Corpus 2022: Leveraging Self-Training in Causality Classification of Socio-Political Event Data

More Speaking or More Speakers?

Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection

Zero-Shot Text Classification with Self-Training

360-MLC: Multi-view Layout Consistency for Self-training and Hyper-parameter Tuning

SAT: Improving Semi-Supervised Text Classification with Simple Instance-Adaptive Self-Training