Neural Network Generalization
Neural network generalization focuses on understanding why and how deep learning models trained on a limited dataset can accurately predict outcomes on unseen data. Current research investigates factors influencing generalization, such as model architecture (including modular networks and transformers), training optimization techniques (like sharpness-aware minimization and data augmentation strategies), and the role of biases and noise in both data and model parameters. These investigations are crucial for improving the reliability and robustness of AI systems across diverse applications, ranging from medical image analysis to robotics and cybersecurity, where generalization to real-world scenarios is paramount.
Papers
September 19, 2024
September 9, 2024
August 26, 2024
May 29, 2024
May 26, 2024
April 1, 2024
March 19, 2024
March 12, 2024
February 9, 2024
February 8, 2024
December 26, 2023
November 10, 2023
October 30, 2023
September 20, 2023
September 7, 2023
September 5, 2023
August 21, 2023
August 16, 2023
August 3, 2023