Deep Learning Model
Deep learning models are complex computational systems designed to learn patterns from data, achieving high accuracy in various tasks like image classification, natural language processing, and time series forecasting. Current research emphasizes improving model efficiency (e.g., through parameter reduction and optimized training algorithms), robustness (e.g., against adversarial attacks and noisy data), and interpretability (e.g., via feature attribution and visualization techniques), often employing architectures such as convolutional neural networks (CNNs), recurrent neural networks (RNNs, including LSTMs and GRUs), and transformers. These advancements are driving significant impact across diverse fields, from medical diagnosis and environmental monitoring to industrial automation and personalized medicine.
Papers
A topological description of loss surfaces based on Betti Numbers
Maria Sofia Bucarelli, Giuseppe Alessio D'Inverno, Monica Bianchini, Franco Scarselli, Fabrizio Silvestri
A Large-Scale Empirical Study on Improving the Fairness of Image Classification Models
Junjie Yang, Jiajun Jiang, Zeyu Sun, Junjie Chen
TSPP: A Unified Benchmarking Tool for Time-series Forecasting
Jan Bączek, Dmytro Zhylko, Gilberto Titericz, Sajad Darabi, Jean-Francois Puget, Izzy Putterman, Dawid Majchrowski, Anmol Gupta, Kyle Kranen, Pawel Morkisz
Understanding Distributed Representations of Concepts in Deep Neural Networks without Supervision
Wonjoon Chang, Dahee Kwon, Jaesik Choi