Learning Phase

Research into learning phases in artificial neural networks (ANNs) focuses on characterizing the distinct stages of training, revealing how models acquire and utilize information. Current investigations utilize various architectures, including ResNets, VGGs, and transformers, to analyze learning dynamics through metrics like reconstruction loss and prediction accuracy, often identifying multiple phases such as initial fitting, compression, and sometimes a later "grokking" phase where generalization improves unexpectedly. These studies aim to improve understanding of model behavior, leading to better training strategies, such as optimized transfer learning techniques, and potentially shedding light on the fundamental processes of learning itself.

Papers

December 11, 2023

Understanding and Leveraging the Learning Phases of Neural Networks
Johannes Schneider, Mohit Prabhushankar
Neural Network Deep Neural Network Transfer Learning Human Understanding Information Bottleneck Learning Dynamic Learning Phase

June 6, 2023

Language acquisition: do children and language models follow similar learning stages?
Linnea Evanson, Yair Lakretz, Jean-Rémi King
Language Model Natural Language Nine Year Old Child Language Acquisition Linguistic Competence Learning Phase

April 30, 2023

Multi-Task Structural Learning using Local Task Similarity induced Neuron Creation and Removal
Naresh Kumar Gurulingan, Bahram Zonooz, Elahe Arani
Neural Network Multi Task Learning Neuron Model Active Removal Task Similarity Learning Phase

September 2, 2022

Three Learning Stages and Accuracy-Efficiency Tradeoff of Restricted Boltzmann Machines
Lennart Dabelow, Masahito Ueda
Restricted Boltzmann Machine Unsupervised Machine Learning Accuracy Efficiency Trade Correlation Learning Learning Phase

May 20, 2022

Towards Understanding Grokking: An Effective Theory of Representation Learning
Ziming Liu, Ouail Kitouni, Niklas Nolte, Eric J. Michaud, Max Tegmark, Mike Williams
Strong Generalization Representation Learning Structured Representation Grokking Phenomenon Effective Theory Learning Phase

February 16, 2022

The learning phases in NN: From Fitting the Majority to Fitting a Few
Johannes Schneider
Deep Neural Network Information Bottleneck Learning Dynamic Classification Error Layer Neural Network Majority Rule Learning Phase