Self Training Pipeline

Self-training pipelines leverage unlabeled data to enhance the performance of machine learning models, primarily addressing the limitations of supervised learning where labeled data is scarce, expensive, or biased. Current research focuses on improving the accuracy and robustness of self-training through techniques like bilevel optimization, contrastive learning, and consistency regularization, often implemented within architectures such as ResNet, and applied to diverse tasks including image classification, segmentation, and optical flow estimation. These advancements are significant because they enable the development of more accurate and generalizable models, particularly in domains with limited labeled data, such as medical image analysis and remote sensing.

Papers