Data Perspective
Data perspective in machine learning research emphasizes the crucial role of datasets in shaping model performance and generalization. Current research focuses on analyzing dataset characteristics to improve model robustness, understand learning mechanisms (like in-context learning), and optimize training strategies (such as curriculum learning and data augmentation using LLMs). This data-centric approach is transforming various fields, from improving the accuracy and reliability of autonomous driving systems to enhancing the personalization of text-to-image models and advancing the understanding of complex algorithms like graph neural networks.
Papers
A Data Generation Perspective to the Mechanism of In-Context Learning
Haitao Mao, Guangliang Liu, Yao Ma, Rongrong Wang, Kristen Johnson, Jiliang Tang
Evaluating the Robustness of Off-Road Autonomous Driving Segmentation against Adversarial Attacks: A Dataset-Centric analysis
Pankaj Deoli, Rohit Kumar, Axel Vierling, Karsten Berns