Dense Vision Task
Dense vision tasks, encompassing pixel-level predictions like semantic segmentation and depth estimation, aim to extract rich spatial information from images. Current research focuses on developing efficient and generalizable models, employing architectures such as vision transformers and diffusion models, often incorporating multi-task learning and parameter-efficient fine-tuning techniques to handle diverse tasks with limited data. These advancements are crucial for improving the accuracy and efficiency of various applications, including robotics, medical imaging, and autonomous driving, by enabling more robust and versatile visual perception systems.
Papers
November 7, 2024
October 30, 2024
June 29, 2024
March 26, 2024
March 15, 2024
February 4, 2024
December 19, 2023
December 2, 2023
October 3, 2023
October 2, 2023
October 1, 2023
August 23, 2023
March 30, 2023
March 26, 2023
November 15, 2022