MAESTRO Dataset
The MAESTRO dataset, while not explicitly defined in the provided abstracts, appears to be a collection of diverse datasets used to benchmark and evaluate various machine learning models, primarily focusing on multimodal tasks and addressing challenges in data quality, label accuracy, and model generalization. Current research leverages large language models (LLMs), transformer architectures, and deep learning techniques like nnUNet and diffusion models to improve performance across diverse applications, including medical image analysis, content moderation, and natural language processing. The availability of these datasets and the associated research significantly advances the field by providing standardized benchmarks for evaluating model performance and facilitating the development of more robust and reliable AI systems.
Papers
L3Cube-MahaSum: A Comprehensive Dataset and BART Models for Abstractive Text Summarization in Marathi
Pranita Deshmukh, Nikita Kulkarni, Sanhita Kulkarni, Kareena Manghani, Raviraj Joshi
Bukva: Russian Sign Language Alphabet
Karina Kvanchiani, Petr Surovtsev, Alexander Nagaev, Elizaveta Petrova, Alexander Kapitanov
Autopet III challenge: Incorporating anatomical knowledge into nnUNet for lesion segmentation in PET/CT
Hamza Kalisch, Fabian Hörst, Ken Herrmann, Jens Kleesiek, Constantin Seibold
BanStereoSet: A Dataset to Measure Stereotypical Social Biases in LLMs for Bangla
Mahammed Kamruzzaman, Abdullah Al Monsur, Shrabon Das, Enamul Hassan, Gene Louis Kim