Self Training

Self-training is a semi-supervised machine learning technique that leverages unlabeled data to improve model performance by iteratively training on pseudo-labels generated by the model itself. Current research focuses on enhancing self-training's robustness and efficiency through techniques like contrastive learning, preference optimization, and uncertainty estimation, often integrated with various model architectures including deep neural networks, transformers, and generative models. This approach is proving valuable across diverse applications, from improving fairness in machine learning to enabling more sample-efficient training in areas like 3D object detection, natural language processing, and biosignal-based robotics control. The ultimate goal is to reduce reliance on expensive and time-consuming data annotation while improving model accuracy and generalization.

Papers

September 10, 2024

Semi-Supervised Reward Modeling via Iterative Self-Training
Yifei He, Haoxiang Wang, Ziyan Jiang, Alexandros Papangelis, Han Zhao
Large Language Model Self Training Reward Model Semi Supervised Reward

September 4, 2024

A Comparative Study of Pre-training and Self-training
Yiheng Wang, Jiayu Lin, Zuoquan Lin
Semi Supervised Learning Comparative Study Self Training Self Supervised Pre Training Semi Supervised Training Training Paradigm

September 3, 2024

EvoChart: A Benchmark and a Self-Training Approach Towards Real-World Chart Understanding
Muye Huang, Han Lai, Xinyu Zhang, Wenjun Wu, Jie Ma, Lingling Zhang, Jun Liu
New Benchmark Self Training Real World Data Synthetic Tabular Data Chart Comprehension

August 31, 2024

TSO: Self-Training with Scaled Preference Optimization
Kaihui Chen, Hao Yi, Qingyang Li, Tianyu Qi, Yulan Hu, Fuzheng Zhang, Yong Liu
Self Training Direct Preference Optimization Preference Optimization Preference Learning Offline Preference

August 23, 2024

Leveraging Contrastive Learning and Self-Training for Multimodal Emotion Recognition with Limited Labeled Samples
Qi Fan, Yutong Li, Yi Xin, Xinyu Cheng, Guanglai Gao, Miao Ma
Contrastive Learning Semi Supervised Learning Emotion Recognition Self Training Multimodal Emotion Recognition Limited Sample Modality Representation

August 14, 2024

Protected Test-Time Adaptation via Online Entropy Matching: A Betting Approach
Yarin Bar, Shalev Shaer, Yaniv Romano
Self Supervised Learning Test Time Adaptation Self Training Online Adaptation

July 25, 2024

July 20, 2024

Meta-GPS++: Enhancing Graph Meta-Learning with Contrastive Learning and Self-Training
Yonghao Liu, Mengyu Li, Ximing Li, Lan Huang, Fausto Giunchiglia, Yanchun Liang, Xiaoyue Feng, Renchu Guan
Contrastive Learning Graph Learning Node Classification Self Training Node Representation Node Embeddings Shot Node Classification Graph Meta Learning

July 11, 2024

Self-training Language Models for Arithmetic Reasoning
Marek Kadlčík, Michal Štefánik
Language Model Self Training Mathematical Reasoning Multi Step Reasoning

July 10, 2024

Simplifying Source-Free Domain Adaptation for Object Detection: Effective Self-Training Strategies and Performance Insights
Yan Hao, Florent Forest, Olga Fink
Self Training Source Free Domain Adaptation Performance Analysis Batch Normalization Layer Weak Augmentation Source Free Object Detection

July 9, 2024

Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization
Jeongseok Hyun, Su Ho Han, Hyolim Kang, Joon-Young Lee, Seon Joo Kim
Potential Scalability Self Training Pre Trained Vision Language Model Temporal Action Localization Action Localization

July 8, 2024

Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision
Orr Zohar, Xiaohan Wang, Yonatan Bitton, Idan Szpektor, Serena Yeung-Levy
Large Vision Language Model Video Understanding Self Training Video Dataset Visual Instruction Tuning Instructional Video

July 3, 2024

An Uncertainty-guided Tiered Self-training Framework for Active Source-free Domain Adaptation in Prostate Segmentation
Zihao Luo, Xiangde Luo, Zijun Gao, Guotai Wang
Domain Adaptation Self Training Source Free Domain Adaptation Prostate Segmentation Domain Sample Source Free Active Domain Adaptation

June 27, 2024

June 26, 2024

Self-Training with Pseudo-Label Scorer for Aspect Sentiment Quad Prediction
Yice Zhang, Jie Zeng, Weiming Hu, Ziyi Wang, Shiwei Chen, Ruifeng Xu
Self Training Sentiment Polarity Aspect Based Sentiment Analysis Human Score Aspect Sentiment Quad Prediction

June 22, 2024

Self Training and Ensembling Frequency Dependent Networks with Coarse Prediction Pooling and Sound Event Bounding Boxes
Hyeonuk Nam, Deokki Min, Seungdeok Choi, Inhan Choi, Yong-Hwa Park
Data Augmentation Self Training Sound Event Detection Sound Event Localization

June 17, 2024

Self Training

Papers

Semi-Supervised Reward Modeling via Iterative Self-Training

A Comparative Study of Pre-training and Self-training

EvoChart: A Benchmark and a Self-Training Approach Towards Real-World Chart Understanding

TSO: Self-Training with Scaled Preference Optimization

Leveraging Contrastive Learning and Self-Training for Multimodal Emotion Recognition with Limited Labeled Samples

Protected Test-Time Adaptation via Online Entropy Matching: A Betting Approach

Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning

NC-NCD: Novel Class Discovery for Node Classification

Meta-GPS++: Enhancing Graph Meta-Learning with Contrastive Learning and Self-Training

Self-training Language Models for Arithmetic Reasoning

Simplifying Source-Free Domain Adaptation for Object Detection: Effective Self-Training Strategies and Performance Insights

Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization

Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision

An Uncertainty-guided Tiered Self-training Framework for Active Source-free Domain Adaptation in Prostate Segmentation

HUWSOD: Holistic Self-training for Unified Weakly Supervised Object Detection

STAL3D: Unsupervised Domain Adaptation for 3D Object Detection via Collaborating Self-Training and Adversarial Learning

Self-Training with Pseudo-Label Scorer for Aspect Sentiment Quad Prediction

Self Training and Ensembling Frequency Dependent Networks with Coarse Prediction Pooling and Sound Event Bounding Boxes

Self-Train Before You Transcribe

Self-training Large Language Models through Knowledge Detection