Scene Detection

Scene detection in video aims to automatically segment videos into semantically meaningful units, such as scenes or shots, facilitating various downstream tasks like video summarization and content analysis. Current research focuses on improving the accuracy and efficiency of scene detection using deep learning models, including transformers and state-space models, often incorporating techniques like semi-supervised learning and multi-modal data fusion (e.g., combining visual and textual information). These advancements are crucial for applications ranging from video indexing and retrieval to automated content generation and analysis of historical visual archives. The development of more robust and efficient scene detection methods is driving progress in numerous fields, including computer vision, multimedia processing, and information retrieval.

Papers

December 23, 2024

Modality-Aware Shot Relating and Comparing for Video Scene Detection
Jiawei Tan, Hongxing Wang, Kang Dang, Jiaxin Li, Zhilong Ou
Shot Classification Multi Modal Cue Scene Detection

September 15, 2024

Efficient Video to Audio Mapper with Visual Scene Detection
Mingjing Yi, Ming Li
Audio Visual Cross Modality Video to Video Video Dynamic Scene Detection Multi Scene

July 13, 2024

Semi-supervised 3D Object Detection with PatchTeacher and PillarMix
Xiaopei Wu, Liang Peng, Liang Xie, Yuenan Hou, Binbin Lin, Xiaoshui Huang, Haifeng Liu, Deng Cai, Wanli Ouyang
3D Object Detection Semi Supervised Pseudo Label Semi Supervised 3D Object Detection Scene Detection

January 9, 2024

Knowledge-enhanced Multi-perspective Video Representation Learning for Scene Recognition
Xuzheng Yu, Chen Jiang, Wei Zhang, Tian Gan, Linlin Chao, Jianan Zhao, Yuan Cheng, Qingpei Guo, Wei Chu
Video Representation Video Representation Learning Scene Recognition Temporal Perspective Scene Detection Video Level Representation

October 10, 2023

Blind Dates: Examining the Expression of Temporality in Historical Photographs
Alexandra Barancová, Melvin Wevers, Nanne van Noord
Temporal Information Computer Vision Model Zero Shot Classification Human Expression Scene Detection Integrating Temporality

September 21, 2023

Video Scene Location Recognition with Neural Networks
Lukáš Korel, Petr Pulc, Jiří Tumpach, Martin Holeňa
Neural Network LSTM Network Pre Trained Convolutional Neural Network Video Sequence Scene Recognition Bidirectional LSTM Scene Detection

June 11, 2023

Stable Remaster: Bridging the Gap Between Old Content and New Displays
Nathan Paull, Shuvam Keshari, Yian Wong
Vision Task High Frequency Display Aspect Ratio Scene Detection

May 21, 2023

A Dual-level Detection Method for Video Copy Detection
Tianyi Wang, Feipeng Ma, Zhenhua Liu, Fengyun Rao
Scene Detection Video Forgery Video Copy

December 29, 2022

Efficient Movie Scene Detection using State-Space Transformers
Md Mohaiminul Islam, Mahmudul Hasan, Kishan Shamsundar Athrey, Tony Braskich, Gedas Bertasius
Video Recognition Scene Detection Augmented Transformer

May 17, 2022

Learnable Optimal Sequential Grouping for Video Scene Detection
Daniel Rotman, Yevgeny Yaroker, Elad Amrani, Udi Barzelay, Rami Ben-Ari
Sequential Learning Optimal Grouping Scene Detection