Real World Dataset
Real-world datasets are crucial for training and evaluating machine learning models, as idealized datasets often fail to capture the complexity and variability of real-world scenarios. Current research focuses on creating diverse datasets for various applications, including anomaly detection, robotic manipulation, and medical image analysis, often incorporating techniques like data augmentation and active learning to improve model robustness and generalization. The availability of high-quality, realistic datasets is essential for advancing the field and ensuring the reliable deployment of machine learning models in practical settings across numerous domains.
Papers
OCTScenes: A Versatile Real-World Dataset of Tabletop Scenes for Object-Centric Learning
Yinxuan Huang, Tonglin Chen, Zhimeng Shen, Jinghao Huang, Bin Li, Xiangyang Xue
MedFMC: A Real-world Dataset and Benchmark For Foundation Model Adaptation in Medical Image Classification
Dequan Wang, Xiaosong Wang, Lilong Wang, Mengzhang Li, Qian Da, Xiaoqiang Liu, Xiangyu Gao, Jun Shen, Junjun He, Tian Shen, Qi Duan, Jie Zhao, Kang Li, Yu Qiao, Shaoting Zhang