Activation Function
Activation functions are crucial components of neural networks, introducing nonlinearity to enable the learning of complex patterns from data. Current research focuses on developing novel activation functions, including those with learnable parameters, and exploring their impact within various architectures like Kolmogorov-Arnold Networks and transformers. These efforts aim to improve model performance, efficiency, and interpretability across diverse applications, from image classification and generation to solving partial differential equations and formal verification tasks. The ongoing search for optimal activation functions is driving significant advancements in the field of deep learning.
Papers
On the Sample Complexity of Two-Layer Networks: Lipschitz vs. Element-Wise Lipschitz Activation
Amit Daniely, Elad Granot
SFPDML: Securer and Faster Privacy-Preserving Distributed Machine Learning based on MKTFHE
Hongxiao Wang, Zoe L. Jiang, Yanmin Zhao, Siu-Ming Yiu, Peng Yang, Man Chen, Zejiu Tan, Bohan Jin