Video Dataset
Video datasets are crucial for training and evaluating computer vision models capable of understanding video content, encompassing diverse tasks like action recognition, object tracking, and quality assessment. Current research emphasizes creating benchmarks with varied video sources (e.g., natural scenes, AI-generated content), incorporating multimodal information (text, audio), and focusing on challenging scenarios such as unusual activity localization and camouflaged object segmentation. These advancements are driving progress in video understanding, with applications ranging from improved surveillance systems and e-commerce experiences to more sophisticated content moderation and conservation efforts.
Papers
OpenHumanVid: A Large-Scale High-Quality Dataset for Enhancing Human-Centric Video Generation
Hui Li, Mingwang Xu, Yun Zhan, Shan Mu, Jiaye Li, Kaihui Cheng, Yuxuan Chen, Tan Chen, Mao Ye, Jingdong Wang, Siyu Zhu
Video Set Distillation: Information Diversification and Temporal Densification
Yinjie Zhao, Heng Zhao, Bihan Wen, Yew-Soon Ong, Joey Tianyi Zhou