Video Dataset
Video datasets are crucial for training and evaluating computer vision models capable of understanding video content, encompassing diverse tasks like action recognition, object tracking, and quality assessment. Current research emphasizes creating benchmarks with varied video sources (e.g., natural scenes, AI-generated content), incorporating multimodal information (text, audio), and focusing on challenging scenarios such as unusual activity localization and camouflaged object segmentation. These advancements are driving progress in video understanding, with applications ranging from improved surveillance systems and e-commerce experiences to more sophisticated content moderation and conservation efforts.
Papers
Learning from One Continuous Video Stream
João Carreira, Michael King, Viorica Pătrăucean, Dilara Gokay, Cătălin Ionescu, Yi Yang, Daniel Zoran, Joseph Heyward, Carl Doersch, Yusuf Aytar, Dima Damen, Andrew Zisserman
Dancing with Still Images: Video Distillation via Static-Dynamic Disentanglement
Ziyu Wang, Yue Xu, Cewu Lu, Yong-Lu Li