Deep Learning Model
Deep learning models are complex computational systems designed to learn patterns from data, achieving high accuracy in various tasks like image classification, natural language processing, and time series forecasting. Current research emphasizes improving model efficiency (e.g., through parameter reduction and optimized training algorithms), robustness (e.g., against adversarial attacks and noisy data), and interpretability (e.g., via feature attribution and visualization techniques), often employing architectures such as convolutional neural networks (CNNs), recurrent neural networks (RNNs, including LSTMs and GRUs), and transformers. These advancements are driving significant impact across diverse fields, from medical diagnosis and environmental monitoring to industrial automation and personalized medicine.
Papers
What could go wrong? Discovering and describing failure modes in computer vision
Gabriela Csurka, Tyler L. Hayes, Diane Larlus, Riccardo Volpi
An experimental comparative study of backpropagation and alternatives for training binary neural networks for image classification
Ben Crulis, Barthelemy Serres, Cyril de Runz, Gilles Venturini
Accelerating Domain-Aware Electron Microscopy Analysis Using Deep Learning Models with Synthetic Data and Image-Wide Confidence Scoring
Matthew J. Lynch, Ryan Jacobs, Gabriella Bruno, Priyam Patki, Dane Morgan, Kevin G. Field
A Survey of Mamba
Haohao Qu, Liangbo Ning, Rui An, Wenqi Fan, Tyler Derr, Hui Liu, Xin Xu, Qing Li