Annotated Dataset
Annotated datasets are collections of data points labeled with specific information, crucial for training and evaluating machine learning models, particularly in complex domains like medicine and robotics. Current research emphasizes creating high-quality annotations, often incorporating AI-assisted methods to reduce manual effort, and addressing challenges like noisy or partially annotated data through techniques such as active learning, multi-task learning, and self-supervised learning. These datasets are vital for advancing various fields, enabling the development of more accurate and robust models for applications ranging from medical image analysis and natural language processing to robotics and e-commerce.
109papers
Papers
May 13, 2025
Advancing Food Nutrition Estimation via Visual-Ingredient Feature Fusion
Huiyan Qi, Bin Zhu, Chong-Wah Ngo, Jingjing Chen, Ee-Peng LimSingapore Management University●Fudan UniversityA Mamba-based Network for Semi-supervised Singing Melody Extraction Using Confidence Binary Regularization
Xiaoliang He, Kangjie Dong, Jingkai Cao, Shuai Yu, Wei Li, Yi YuDonghua University●Fudan University●Hiroshima University
March 3, 2025
Automated Annotation of Evolving Corpora for Augmenting Longitudinal Network Data: A Framework Integrating Large Language Models and Expert Knowledge
Xiao Liu, Zirui Wu, Jiayi Li, Zhicheng Shao, Xun Pang, Yansong FengPeking University●School of International Studies●Peking UniversityKoWit-24: A Richly Annotated Dataset of Wordplay in News Headlines
Alexander Baranov, Anna Palatkina, Yulia Makovka, Pavel BraslavskiHSE University
February 6, 2025