Video Chapter Generation

Video chapter generation focuses on automatically segmenting long videos into meaningful chapters, each with a descriptive title, to improve user navigation and understanding. Recent research emphasizes developing robust models that integrate visual and textual information, often employing multi-modal architectures and leveraging large-scale datasets of user-generated chapters for training and evaluation. This work is significant because it addresses the growing need for efficient video organization and summarization, impacting areas like video search, accessibility, and educational content delivery.

Papers

March 12, 2024

AesopAgent: Agent-driven Evolutionary System on Story-to-Video Production
Jiuniu Wang, Zehua Du, Yuyuan Zhao, Bo Yuan, Kexiang Wang, Jian Liang, Yaxi Zhao, Yihen Lu, Gengliang Li, Junlong Gao, Xin Tu, Zhenyu Guo
Generated Content Agent System Sophisticated Agent Agent Based Evolution Video Chapter Generation

September 25, 2023

VidChapters-7M: Video Chapters at Scale
Antoine Yang, Arsha Nagrani, Ivan Laptev, Josef Sivic, Cordelia Schmid
Visual Analogue Scale Video Annotation Ground a Video Video Chapter Generation

September 26, 2022

Multi-modal Video Chapter Generation
Xiao Cao, Zitan Chen, Canyu Le, Lei Meng
Global Feature Annotated Chapter Information User Generated Video Chapter to Chapter Video Chapter Generation

Video Chapter Generation

Papers

AesopAgent: Agent-driven Evolutionary System on Story-to-Video Production

VidChapters-7M: Video Chapters at Scale

Multi-modal Video Chapter Generation