Two Layer Neural Network
Two-layer neural networks serve as a fundamental model for understanding the behavior of deeper networks, with research focusing on their optimization dynamics, generalization capabilities, and feature learning properties. Current investigations utilize stochastic gradient descent and related algorithms, often within the context of the neural tangent kernel approximation, to analyze convergence rates and the impact of hyperparameters like learning rate and network width. These studies provide crucial insights into the theoretical foundations of deep learning, informing the design of more efficient and robust algorithms and offering a clearer understanding of phenomena like spectral bias and the emergence of skills during training.
Papers
October 11, 2023
October 3, 2023
September 26, 2023
September 14, 2023
September 1, 2023
July 13, 2023
July 11, 2023
July 3, 2023
June 29, 2023
June 28, 2023
May 29, 2023
May 26, 2023
May 22, 2023
May 10, 2023
May 9, 2023
April 6, 2023
March 31, 2023
March 29, 2023