Video Prediction
Video prediction aims to generate future frames of a video sequence, based on preceding frames, addressing challenges in modeling complex dynamics and uncertainty. Current research emphasizes incorporating procedural knowledge and physical constraints into data-driven models, often employing architectures like transformers, diffusion models, and state-space models with various techniques for handling long-term dependencies and multi-modality (e.g., integrating text or tactile data). This field is significant for its potential applications in robotics, autonomous driving, and other areas requiring predictive modeling of dynamic visual scenes, driving advancements in both computer vision and artificial intelligence.
Papers
August 24, 2022
August 19, 2022
June 27, 2022
June 23, 2022
June 15, 2022
June 9, 2022
June 8, 2022
May 19, 2022
May 4, 2022
April 20, 2022
April 12, 2022
March 30, 2022
March 29, 2022
March 17, 2022
February 1, 2022
December 22, 2021