Deep Learning Architecture
Deep learning architectures are complex computational models designed to learn intricate patterns from data, primarily aiming to improve the accuracy and efficiency of machine learning tasks. Current research focuses on optimizing existing architectures like convolutional neural networks (CNNs), recurrent neural networks (RNNs), transformers, and autoencoders, as well as developing novel activation functions and exploring efficient search algorithms for optimal network structures. These advancements are significantly impacting various fields, from medical image analysis and anomaly detection in complex systems to natural language processing and 3D data processing, driving improvements in accuracy, efficiency, and interpretability.
Papers
From NeurODEs to AutoencODEs: a mean-field control framework for width-varying Neural Networks
Cristina Cipriani, Massimo Fornasier, Alessandro Scagliotti
Harmonizing Feature Attributions Across Deep Learning Architectures: Enhancing Interpretability and Consistency
Md Abdul Kadir, Gowtham Krishna Addluri, Daniel Sonntag