MAESTRO Dataset
The MAESTRO dataset, while not explicitly defined in the provided abstracts, appears to be a collection of diverse datasets used to benchmark and evaluate various machine learning models, primarily focusing on multimodal tasks and addressing challenges in data quality, label accuracy, and model generalization. Current research leverages large language models (LLMs), transformer architectures, and deep learning techniques like nnUNet and diffusion models to improve performance across diverse applications, including medical image analysis, content moderation, and natural language processing. The availability of these datasets and the associated research significantly advances the field by providing standardized benchmarks for evaluating model performance and facilitating the development of more robust and reliable AI systems.
Papers
OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents
Hugo Laurençon, Lucile Saulnier, Léo Tronchon, Stas Bekman, Amanpreet Singh, Anton Lozhkov, Thomas Wang, Siddharth Karamcheti, Alexander M. Rush, Douwe Kiela, Matthieu Cord, Victor Sanh
DIAS: A Dataset and Benchmark for Intracranial Artery Segmentation in DSA sequences
Wentao Liu, Tong Tian, Lemeng Wang, Weijin Xu, Lei Li, Haoyuan Li, Wenyi Zhao, Siyu Tian, Xipeng Pan, Huihua Yang, Feng Gao, Yiming Deng, Xin Yang, Ruisheng Su
PromptNER: Prompt Locating and Typing for Named Entity Recognition
Yongliang Shen, Zeqi Tan, Shuhui Wu, Wenqi Zhang, Rongsheng Zhang, Yadong Xi, Weiming Lu, Yueting Zhuang
ReConPatch : Contrastive Patch Representation Learning for Industrial Anomaly Detection
Jeeho Hyun, Sangyun Kim, Giyoung Jeon, Seung Hwan Kim, Kyunghoon Bae, Byung Jun Kang