Neural Network Generalization
Neural network generalization focuses on understanding why and how deep learning models trained on a limited dataset can accurately predict outcomes on unseen data. Current research investigates factors influencing generalization, such as model architecture (including modular networks and transformers), training optimization techniques (like sharpness-aware minimization and data augmentation strategies), and the role of biases and noise in both data and model parameters. These investigations are crucial for improving the reliability and robustness of AI systems across diverse applications, ranging from medical image analysis to robotics and cybersecurity, where generalization to real-world scenarios is paramount.
Papers
May 25, 2023
May 19, 2023
February 23, 2023
February 14, 2023
December 7, 2022
December 1, 2022
November 18, 2022
November 17, 2022
October 21, 2022
October 18, 2022
August 10, 2022
July 5, 2022
March 21, 2022
March 19, 2022
February 3, 2022
February 1, 2022