Deep Learning Model
Deep learning models are complex computational systems designed to learn patterns from data, achieving high accuracy in various tasks like image classification, natural language processing, and time series forecasting. Current research emphasizes improving model efficiency (e.g., through parameter reduction and optimized training algorithms), robustness (e.g., against adversarial attacks and noisy data), and interpretability (e.g., via feature attribution and visualization techniques), often employing architectures such as convolutional neural networks (CNNs), recurrent neural networks (RNNs, including LSTMs and GRUs), and transformers. These advancements are driving significant impact across diverse fields, from medical diagnosis and environmental monitoring to industrial automation and personalized medicine.
Papers
ELUDE: Generating interpretable explanations via a decomposition into labelled and unlabelled features
Vikram V. Ramaswamy, Sunnie S. Y. Kim, Nicole Meister, Ruth Fong, Olga Russakovsky
Robust SAR ATR on MSTAR with Deep Learning Models trained on Full Synthetic MOCEM data
Benjamin Camus, Corentin Le Barbu, Eric Monteux
Learning the Space of Deep Models
Gianluca Berardi, Luca De Luigi, Samuele Salti, Luigi Di Stefano
Weakly-supervised segmentation using inherently-explainable classification models and their application to brain tumour classification
Soumick Chatterjee, Hadya Yassin, Florian Dubost, Andreas Nürnberger, Oliver Speck
8-bit Numerical Formats for Deep Neural Networks
Badreddine Noune, Philip Jones, Daniel Justus, Dominic Masters, Carlo Luschi
Deep Learning Models of the Discrete Component of the Galactic Interstellar Gamma-Ray Emission
Alexander Shmakov, Mohammadamin Tavakoli, Pierre Baldi, Christopher M. Karwin, Alex Broughton, Simona Murgia