Highlight Detection
Highlight detection in videos aims to automatically identify key moments, either based on user queries (moment retrieval) or inherent visual/audio cues (highlight detection). Current research heavily utilizes transformer-based architectures, often incorporating multimodal features (visual and audio) and leveraging techniques like attention mechanisms and denoising diffusion models to improve accuracy and efficiency. This field is crucial for enhancing video accessibility and usability, with applications ranging from automated video summarization and trailer generation to personalized content recommendation and improved video editing workflows.
Papers
XAI for time-series classification leveraging image highlight methods
Georgios Makridis, Georgios Fatouros, Vasileios Koukos, Dimitrios Kotios, Dimosthenis Kyriazis, Ioannis Soldatos
Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection
Yicheng Xiao, Zhuoyan Luo, Yong Liu, Yue Ma, Hengwei Bian, Yatai Ji, Yujiu Yang, Xiu Li