Deep ReLU Network
Deep ReLU networks, characterized by their use of rectified linear unit activation functions, are a central focus in deep learning research, with current efforts concentrating on understanding their approximation capabilities, generalization properties, and optimization dynamics. Researchers are exploring various architectures, including deep operator networks and "nested" networks, and developing novel algorithms like component-based sketching to improve training efficiency and generalization performance. These investigations aim to provide a stronger theoretical foundation for the remarkable empirical success of deep ReLU networks, ultimately leading to more robust and efficient deep learning models for diverse applications.
Papers
December 27, 2024
December 24, 2024
November 11, 2024
October 8, 2024
October 1, 2024
September 21, 2024
July 4, 2024
June 8, 2024
May 10, 2024
February 19, 2024
January 22, 2024
December 27, 2023
December 7, 2023
November 7, 2023
October 11, 2023
August 15, 2023
June 20, 2023
June 13, 2023
May 16, 2023