Vision and Language Navigation
Vision-and-Language Navigation (VLN) focuses on enabling agents to navigate 3D environments by following natural language instructions, aiming to bridge the gap between visual perception and linguistic understanding. Current research emphasizes improving model efficiency (e.g., through knowledge distillation), exploring zero-shot navigation with large language models (LLMs) and incorporating safety mechanisms, and addressing challenges like instruction errors and robustness to environmental changes. This field is significant for advancing embodied AI and has potential applications in robotics, autonomous systems, and human-computer interaction.
Papers
December 25, 2023
November 30, 2023
November 29, 2023
November 28, 2023
November 22, 2023
November 6, 2023
October 16, 2023
October 11, 2023
October 10, 2023
September 10, 2023
August 24, 2023
August 20, 2023
August 14, 2023
August 13, 2023
August 7, 2023
July 28, 2023
July 25, 2023
July 24, 2023
July 23, 2023