Annotated Dataset
Annotated datasets are collections of data points labeled with specific information, crucial for training and evaluating machine learning models, particularly in complex domains like medicine and robotics. Current research emphasizes creating high-quality annotations, often incorporating AI-assisted methods to reduce manual effort, and addressing challenges like noisy or partially annotated data through techniques such as active learning, multi-task learning, and self-supervised learning. These datasets are vital for advancing various fields, enabling the development of more accurate and robust models for applications ranging from medical image analysis and natural language processing to robotics and e-commerce.
Papers
Simple but Effective Unsupervised Classification for Specified Domain Images: A Case Study on Fungi Images
Zhaocong liu, Fa Zhang, Lin Cheng, Huanxi Deng, Xiaoyan Yang, Zhenyu Zhang, Chichun Zhou
Self-Improving for Zero-Shot Named Entity Recognition with Large Language Models
Tingyu Xie, Qi Li, Yan Zhang, Zuozhu Liu, Hongwei Wang