World Model
World models are computational representations of environments, aiming to predict future states based on actions, enabling more efficient and robust decision-making in artificial intelligence. Current research focuses on improving the accuracy and generalization of these models, particularly through the use of transformer-based architectures, generative models (like diffusion models and VAEs), and techniques like model-based reinforcement learning. This work is significant because accurate world models are crucial for developing autonomous agents capable of complex reasoning and planning in diverse, real-world scenarios, impacting fields like robotics, autonomous driving, and healthcare.
Papers
Advancing Humanoid Locomotion: Mastering Challenging Terrains with Denoising World Model Learning
Xinyang Gu, Yen-Jen Wang, Xiang Zhu, Chengming Shi, Yanjiang Guo, Yichen Liu, Jianyu Chen
Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving
Yu Yang, Jianbiao Mei, Yukai Ma, Siliang Du, Wenqing Chen, Yijie Qian, Yuxiang Feng, Yong Liu
UnO: Unsupervised Occupancy Fields for Perception and Forecasting
Ben Agro, Quinlan Sykora, Sergio Casas, Thomas Gilles, Raquel Urtasun
Pandora: Towards General World Model with Natural Language Actions and Video States
Jiannan Xiang, Guangyi Liu, Yi Gu, Qiyue Gao, Yuting Ning, Yuheng Zha, Zeyu Feng, Tianhua Tao, Shibo Hao, Yemin Shi, Zhengzhong Liu, Eric P. Xing, Zhiting Hu