Video Sequence
Video sequence analysis focuses on understanding and interpreting the information contained within temporally ordered image frames, aiming to extract meaningful information and enable various applications. Current research emphasizes developing robust models for tasks like anomaly detection, action recognition, and video generation, often employing deep learning architectures such as convolutional neural networks, recurrent neural networks (including GRUs and LSTMs), transformers (including Swin Transformers), and diffusion models. These advancements are driving progress in diverse fields, including video editing, autonomous driving, healthcare (e.g., medical image analysis), and human-computer interaction, by enabling more efficient and accurate processing of video data.
Papers
NaRCan: Natural Refined Canonical Image with Integration of Diffusion Prior for Video Editing
Ting-Hsuan Chen, Jiewen Chan, Hau-Shiang Shiu, Shih-Han Yen, Chang-Han Yeh, Yu-Lun Liu
ProcessPainter: Learn Painting Process from Sequence Data
Yiren Song, Shijie Huang, Chen Yao, Xiaojun Ye, Hai Ci, Jiaming Liu, Yuxuan Zhang, Mike Zheng Shou