Data Centric
Data-centric AI prioritizes high-quality data as the primary driver of successful machine learning, shifting focus from solely model optimization. Current research emphasizes improving data quality through techniques like data augmentation, feature engineering, and careful dataset curation, often employing transformer-based models and other deep learning architectures for analysis. This approach is crucial for addressing issues like algorithmic bias, improving model robustness and generalization, and ultimately leading to more reliable and trustworthy AI systems across diverse applications, from healthcare and finance to earth observation and natural language processing.
Papers
Solving Data-centric Tasks using Large Language Models
Shraddha Barke, Christian Poelitz, Carina Suzana Negreanu, Benjamin Zorn, José Cambronero, Andrew D. Gordon, Vu Le, Elnaz Nouri, Nadia Polikarpova, Advait Sarkar, Brian Slininger, Neil Toronto, Jack Williams
Efficient Multimodal Learning from Data-centric Perspective
Muyang He, Yexin Liu, Boya Wu, Jianhao Yuan, Yueze Wang, Tiejun Huang, Bo Zhao