Deep Learning Model
Deep learning models are complex computational systems designed to learn patterns from data, achieving high accuracy in various tasks like image classification, natural language processing, and time series forecasting. Current research emphasizes improving model efficiency (e.g., through parameter reduction and optimized training algorithms), robustness (e.g., against adversarial attacks and noisy data), and interpretability (e.g., via feature attribution and visualization techniques), often employing architectures such as convolutional neural networks (CNNs), recurrent neural networks (RNNs, including LSTMs and GRUs), and transformers. These advancements are driving significant impact across diverse fields, from medical diagnosis and environmental monitoring to industrial automation and personalized medicine.
Papers
AI-driven Structure Detection and Information Extraction from Historical Cadastral Maps (Early 19th Century Franciscean Cadastre in the Province of Styria) and Current High-resolution Satellite and Aerial Imagery for Remote Sensing
Wolfgang Göderle, Christian Macher, Katrin Mauthner, Oliver Pimas, Fabian Rampetsreiter
TaskMet: Task-Driven Metric Learning for Model Learning
Dishank Bansal, Ricky T. Q. Chen, Mustafa Mukadam, Brandon Amos
Simplifying Neural Network Training Under Class Imbalance
Ravid Shwartz-Ziv, Micah Goldblum, Yucen Lily Li, C. Bayan Bruss, Andrew Gordon Wilson
AI-driven emergence of frequency information non-uniform distribution via THz metasurface spectrum prediction
Xiaohua Xing, Yuqi Ren, Die Zou, Qiankun Zhang, Bingxuan Mao, Jianquan Yao, Deyi Xiong, Shuang Zhang, Liang Wu