Data Set
Datasets are crucial for training and evaluating machine learning models, particularly in areas like natural language processing, computer vision, and audio analysis. Current research emphasizes creating diverse and high-quality datasets addressing specific challenges, such as data imbalance, cross-lingual inconsistencies, and the need for realistic representations of real-world scenarios. This involves developing novel annotation techniques, incorporating multiple data modalities (e.g., text, images, audio), and employing various model architectures (e.g., transformers, convolutional neural networks) for analysis and benchmark creation. The availability of well-designed datasets directly impacts the development of robust and reliable machine learning models, ultimately advancing scientific understanding and improving practical applications across numerous fields.
Papers
PolyIE: A Dataset of Information Extraction from Polymer Material Scientific Literature
Jerry Junyang Cheung, Yuchen Zhuang, Yinghao Li, Pranav Shetty, Wantian Zhao, Sanjeev Grampurohit, Rampi Ramprasad, Chao Zhang
Transpose Attack: Stealing Datasets with Bidirectional Training
Guy Amit, Mosh Levy, Yisroel Mirsky
JRDB-Traj: A Dataset and Benchmark for Trajectory Forecasting in Crowds
Saeed Saadatnejad, Yang Gao, Hamid Rezatofighi, Alexandre Alahi
Extraction of Atypical Aspects from Customer Reviews: Datasets and Experiments with Language Models
Smita Nannaware, Erfan Al-Hossami, Razvan Bunescu
BanMANI: A Dataset to Identify Manipulated Social Media News in Bangla
Mahammed Kamruzzaman, Md. Minul Islam Shovon, Gene Louis Kim
Dense Video Captioning: A Survey of Techniques, Datasets and Evaluation Protocols
Iqra Qasim, Alexander Horsch, Dilip K. Prasad
ACQUIRED: A Dataset for Answering Counterfactual Questions In Real-Life Videos
Te-Lin Wu, Zi-Yi Dou, Qingyuan Hu, Yu Hou, Nischal Reddy Chandra, Marjorie Freedman, Ralph M. Weischedel, Nanyun Peng
InsPLAD: A Dataset and Benchmark for Power Line Asset Inspection in UAV Images
André Luiz Buarque Vieira e Silva, Heitor de Castro Felix, Franscisco Paulo Magalhães Simões, Veronica Teichrieb, Michel Mozinho dos Santos, Hemir Santiago, Virginia Sgotti, Henrique Lott Neto