Source Video
Source video analysis encompasses a broad range of research aiming to extract meaningful information and perform various tasks directly from video data. Current efforts focus on developing robust and efficient methods for tasks such as 3D motion estimation, object detection and tracking, and multimodal analysis integrating audio and other sensor data, often employing deep learning architectures like transformers and diffusion models. These advancements have significant implications for diverse fields, including autonomous driving, medical diagnosis, and multimedia content creation, by enabling more sophisticated and automated processing of visual information.
Papers
RoboMNIST: A Multimodal Dataset for Multi-Robot Activity Recognition Using WiFi Sensing, Video, and Audio
Kian Behzad, Rojin Zandi, Elaheh Motamedi, Hojjat Salehinejad, Milad Siami
Turbulence Strength $C_n^2$ Estimation from Video using Physics-based Deep Learning
Ripon Kumar Saha, Esen Salcin, Jihoo Kim, Joseph Smith, Suren Jayasuriya