Video Prediction
Video prediction aims to generate future frames of a video sequence, based on preceding frames, addressing challenges in modeling complex dynamics and uncertainty. Current research emphasizes incorporating procedural knowledge and physical constraints into data-driven models, often employing architectures like transformers, diffusion models, and state-space models with various techniques for handling long-term dependencies and multi-modality (e.g., integrating text or tactile data). This field is significant for its potential applications in robotics, autonomous driving, and other areas requiring predictive modeling of dynamic visual scenes, driving advancements in both computer vision and artificial intelligence.
Papers
October 30, 2024
October 29, 2024
October 24, 2024
October 21, 2024
October 20, 2024
June 26, 2024
June 25, 2024
June 18, 2024
June 10, 2024
June 9, 2024
May 25, 2024
May 24, 2024
May 23, 2024
May 7, 2024
April 17, 2024
March 27, 2024
March 14, 2024