Real World
Research on "real-world" applications focuses on bridging the gap between simulated and real-world environments, particularly for complex tasks like robotics, autonomous driving, and natural language processing. Current efforts utilize various model architectures, including large language models (LLMs), diffusion models, reinforcement learning (RL), and graph neural networks, to improve robustness, generalization, and efficiency in diverse real-world scenarios. This research is crucial for advancing AI capabilities beyond controlled settings and enabling practical applications in areas such as healthcare, manufacturing, and transportation, while also addressing challenges like data scarcity, safety, and bias.
Papers
3D$^2$-Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling
Zichen Tang, Hongyu Yang, Hanchen Zhang, Jiaxin Chen, Di Huang
Learning Implicit Features with Flow Infused Attention for Realistic Virtual Try-On
Delong Zhang, Qiwei Huang, Yuanliu Liu, Yang Sun, Wei-Shi Zheng, Pengfei Xiong, Wei Zhang
VisionArena: 230K Real World User-VLM Conversations with Preference Labels
Christopher Chou, Lisa Dunlap, Koki Mashita, Krishna Mandal, Trevor Darrell, Ion Stoica, Joseph E. Gonzalez, Wei-Lin Chiang
POINTS1.5: Building a Vision-Language Model towards Real World Applications
Yuan Liu, Le Tian, Xiao Zhou, Xinyu Gao, Kavio Yu, Yang Yu, Jie Zhou
Can transformative AI shape a new age for our civilization?: Navigating between speculation and reality
Jesus L. Lobo, Javier Del Ser
RealOSR: Latent Unfolding Boosting Diffusion-based Real-world Omnidirectional Image Super-Resolution
Xuhan Sheng, Runyi Li, Bin Chen, Weiqi Li, Xu Jiang, Jian Zhang
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics
Xi Chen, Zhifei Zhang, He Zhang, Yuqian Zhou, Soo Ye Kim, Qing Liu, Yijun Li, Jianming Zhang, Nanxuan Zhao, Yilin Wang, Hui Ding, Zhe Lin, Hengshuang Zhao
Scaling Sequential Recommendation Models with Transformers
Pablo Zivic, Hernan Vazquez, Jorge Sanchez
RAP-SR: RestorAtion Prior Enhancement in Diffusion Models for Realistic Image Super-Resolution
Jiangang Wang, Qingnan Fan, Jinwei Chen, Hong Gu, Feng Huang, Wenqi Ren