Self Supervised Pre Training

Self-supervised pre-training (SSP) aims to learn robust feature representations from unlabeled data by training models on pretext tasks, improving their performance on downstream tasks with limited labeled data. Current research focuses on optimizing the alignment between pre-training and fine-tuning stages, developing efficient methods like dataset distillation and exploring various architectures including Vision Transformers, masked autoencoders, and diffusion models, often within contrastive or self-distillation frameworks. SSP's significance lies in its ability to leverage vast amounts of unlabeled data, enhancing model performance across diverse domains like computer vision, natural language processing, and medical imaging, particularly when labeled data is scarce or expensive to obtain.

104papers

Papers

March 28, 2025

Masked Self-Supervised Pre-Training for Text Recognition Transformers on Large-Scale Datasets
Pre Trained Model Transfer Learning Self Supervised Pre Training Text Recognition Large Scale Datasets Self Supervised Learning

January 24, 2025

Data-efficient Performance Modeling via Pre-training
Supervised Autoencoder Code Optimization Self Supervised Pre Training Performance Model Code Transformation

November 13, 2024

AstroM³: A self-supervised multimodal model for astronomy
Multimodal Self Supervised Learning Astronomical Data Machine Learning Model Time Series Self Supervised Pre Training

November 4, 2024

Training on test proteins improves fitness, structure, and function prediction
Protein Stability Simple Function Training Data Non Structural Protein Self Supervised Pre Training Human Prediction Protein Structure Prediction Strong Generalization Physical Activity

October 8, 2024

TimeDART: A Diffusion Autoregressive Transformer for Self-Supervised Time Series Representation
Diffusion Transformer Self Supervised Task Self Supervised Pre Training Self Supervised Time Series Time Series

October 3, 2024

October 2, 2024

Decorrelation-based Self-Supervised Visual Representation Learning for Writer Identification
Signature Verification Self Supervised Learning Self Supervised Visual Representation Writer Identification Feature Decorrelation Self Supervised Pre Training

September 27, 2024

How Effective is Pre-training of Large Masked Autoencoders for Downstream Earth Observation Tasks?
Masked Autoencoders Pre Trained Model Self Supervised Pre Training Pre Training Task Remote Sensing Task

September 4, 2024

A Comparative Study of Pre-training and Self-training
Training Paradigm Semi Supervised Training Comparative Study Self Supervised Pre Training Self Training Semi Supervised Learning

August 19, 2024

P3P: Pseudo-3D Pre-training for Scaling 3D Voxel-based Masked Autoencoders
Masked Autoencoders 3D Object Classification 3D Perception Task 3D Pre Training Self Supervised Pre Training Perspective N Point

August 2, 2024

POA: Pre-training Once for Models of All Sizes
Pre Trained Model Single Center Training Full Model Self Supervised Pre Training Size Matter

July 25, 2024

Self-supervised pre-training with diffusion model for few-shot landmark detection in x-ray images
Self Supervised Learning Diffusion Model Self Supervised Pre Training X Ray Image One Shot Landmark Detection Annotated Training Image Self Supervised Framework

July 16, 2024

DiNO-Diffusion. Scaling Medical Diffusion via Self-Supervised Pre-Training
Diffusion Model Self Supervised Pre Training Latent Diffusion Model DiNO Mix State of the Art Diffusion

June 21, 2024

Effect of Rotation Angle in Self-Supervised Pre-training is Dataset-Dependent
Saliency Map Self Supervised Pre Training Rotation Angle Self Supervised Contrastive Pre Training Mixed Effect

May 24, 2024

Automatic Data Curation for Self-Supervised Learning: A Clustering-Based Approach
Clustering Based Approach Automatic Curation High Quality Self Supervised Learning Self Supervised Supervised Learning Self Supervised Pre Training

May 17, 2024

DINO as a von Mises-Fisher mixture model
Image Representation Von Mises Self Supervised Pre Training Siamese Network Self Distillation DiNO Mix

May 2, 2024

S4: Self-Supervised Sensing Across the Spectrum
Self Supervised Pre Training Time to Spectrum Satellite Image Time Series Satellite Imagery

May 1, 2024

Self-supervised Pre-training of Text Recognizers
Self Supervised Pre Training Method Self Supervised Pre Training Self Supervised Text Recognition

April 23, 2024

Delayed Bottlenecking: Alleviating Forgetting in Pre-trained Graph Neural Networks
Graph Neural Network Anti Forgetting Graph Representation Learning Dynamic Knowledge Major Challenge Bottleneck Self Supervised Pre Training Information Bottleneck Pre Training Task

Self Supervised Pre Training

Papers

Masked Self-Supervised Pre-Training for Text Recognition Transformers on Large-Scale Datasets

Data-efficient Performance Modeling via Pre-training

AstroM³: A self-supervised multimodal model for astronomy

Training on test proteins improves fitness, structure, and function prediction

TimeDART: A Diffusion Autoregressive Transformer for Self-Supervised Time Series Representation

BiSSL: Enhancing the Alignment Between Self-Supervised Pretraining and Downstream Fine-Tuning via Bilevel Optimization

Dataset Distillation via Knowledge Distillation: Towards Efficient Self-Supervised Pre-Training of Deep Networks

Decorrelation-based Self-Supervised Visual Representation Learning for Writer Identification

How Effective is Pre-training of Large Masked Autoencoders for Downstream Earth Observation Tasks?

A Comparative Study of Pre-training and Self-training

P3P: Pseudo-3D Pre-training for Scaling 3D Voxel-based Masked Autoencoders

POA: Pre-training Once for Models of All Sizes