Design Choice
Design choice in machine learning and related fields focuses on optimizing model performance and robustness by systematically investigating the impact of various design parameters. Current research emphasizes the effects of model architecture (e.g., transformers, joint-embedding predictive architectures), data preprocessing techniques, training strategies (including pre-training and fine-tuning), and loss functions on outcomes such as accuracy, efficiency, and fairness. These investigations are crucial for improving the reliability and generalizability of models across diverse applications, ranging from time series forecasting and image classification to natural language processing and anomaly detection. Understanding these design choices is essential for building more effective and responsible AI systems.
Papers
Exploring Design Choices for Building Language-Specific LLMs
Atula Tejaswi, Nilesh Gupta, Eunsol Choi
Understanding Different Design Choices in Training Large Time Series Models
Yu-Neng Chuang, Songchen Li, Jiayi Yuan, Guanchu Wang, Kwei-Herng Lai, Leisheng Yu, Sirui Ding, Chia-Yuan Chang, Qiaoyu Tan, Daochen Zha, Xia Hu