Source Video
Source video analysis encompasses a broad range of research aiming to extract meaningful information and perform various tasks directly from video data. Current efforts focus on developing robust and efficient methods for tasks such as 3D motion estimation, object detection and tracking, and multimodal analysis integrating audio and other sensor data, often employing deep learning architectures like transformers and diffusion models. These advancements have significant implications for diverse fields, including autonomous driving, medical diagnosis, and multimedia content creation, by enabling more sophisticated and automated processing of visual information.
Papers
Generative Regression Based Watch Time Prediction for Video Recommendation: Model and Performance
Hongxu Ma, Kai Tian, Tao Zhang, Xuefeng Zhang, Chunjie Chen, Han Li, Jihong Guan, Shuigeng Zhou
STNMamba: Mamba-based Spatial-Temporal Normality Learning for Video Anomaly Detection
Zhangxun Li, Mengyang Zhao, Xuan Yang, Yang Liu, Jiamu Sheng, Xinhua Zeng, Tian Wang, Kewei Wu, Yu-Gang Jiang
Continuous Patient Monitoring with AI: Real-Time Analysis of Video in Hospital Care Settings
Paolo Gabriel, Peter Rehani, Tyler Troy, Tiffany Wyatt, Michael Choma, Narinder Singh
Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues
Yan Zhang, Gangyan Zeng, Huawen Shen, Daiqing Wu, Yu Zhou, Can Ma
Can video generation replace cinematographers? Research on the cinematic language of generated video
Xiaozhe Li, Kai WU, Siyi Yang, YiZhan Qu, Guohua.Zhang, Zhiyu Chen, Jiayao Li, Jiangchuan Mu, Xiaobin Hu, Wen Fang, Mingliang Xiong, Hao Deng, Qingwen Liu, Gang Li, Bin He
Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video
Junkai Fan, Kun Wang, Zhiqiang Yan, Xiang Chen, Shangbing Gao, Jun Li, Jian Yang
BiM-VFI: directional Motion Field-Guided Frame Interpolation for Video with Non-uniform Motions
Wonyong Seo, Jihyong Oh, Munchurl Kim