Annotated Dataset
Annotated datasets are collections of data points labeled with specific information, crucial for training and evaluating machine learning models, particularly in complex domains like medicine and robotics. Current research emphasizes creating high-quality annotations, often incorporating AI-assisted methods to reduce manual effort, and addressing challenges like noisy or partially annotated data through techniques such as active learning, multi-task learning, and self-supervised learning. These datasets are vital for advancing various fields, enabling the development of more accurate and robust models for applications ranging from medical image analysis and natural language processing to robotics and e-commerce.
Papers
PLANesT-3D: A new annotated dataset for segmentation of 3D plant point clouds
Kerem Mertoğlu, Yusuf Şalk, Server Karahan Sarıkaya, Kaya Turgut, Yasemin Evrenesoğlu, Hakan Çevikalp, Ömer Nezih Gerek, Helin Dutağacı, David Rousseau
dopanim: A Dataset of Doppelganger Animals with Noisy Annotations from Multiple Humans
Marek Herde, Denis Huseljic, Lukas Rauch, Bernhard Sick