Novel Dataset
Recent research focuses on creating novel datasets for diverse applications, addressing limitations in existing resources and enabling advancements in various fields. These datasets span various modalities, including text, images, video, and sensor data, and are designed for tasks such as code generation, object detection, natural language processing, and multi-agent reinforcement learning. Common model architectures employed include transformers, convolutional neural networks, and ensemble methods, often benchmarked against established baselines. The development of these high-quality datasets is crucial for improving the accuracy and reliability of machine learning models across a wide range of scientific and practical applications.
Papers
TaskComplexity: A Dataset for Task Complexity Classification with In-Context Learning, FLAN-T5 and GPT-4o Benchmarks
Areeg Fahad Rasheed, M. Zarkoosh, Safa F. Abbas, Sana Sabah Al-Azzawi
CycleCrash: A Dataset of Bicycle Collision Videos for Collision Prediction and Analysis
Nishq Poorav Desai, Ali Etemad, Michael Greenspan
SurveySum: A Dataset for Summarizing Multiple Scientific Articles into a Survey Section
Leandro Carísio Fernandes, Gustavo Bartz Guedes, Thiago Soares Laitz, Thales Sales Almeida, Rodrigo Nogueira, Roberto Lotufo, Jayr Pereira
Real-Time Energy Pricing in New Zealand: An Evolving Stream Analysis
Yibin Sun, Heitor Murilo Gomes, Bernhard Pfahringer, Albert Bifet