Vision Task
Vision tasks, encompassing image and video analysis for diverse applications, are a central focus in computer vision research. Current efforts concentrate on improving model efficiency and robustness, particularly through multi-task learning, the development of novel architectures like Vision Transformers and state-space models, and the incorporation of human feedback for improved alignment with user preferences. These advancements are driving progress in areas such as image compression for machine learning pipelines, multi-image understanding, and the creation of more robust and fair models for real-world deployment.
Papers
June 3, 2024
June 2, 2024
May 17, 2024
May 14, 2024
May 10, 2024
April 29, 2024
April 26, 2024
April 23, 2024
April 15, 2024
April 12, 2024
April 9, 2024
April 1, 2024
March 31, 2024
March 23, 2024
March 18, 2024
March 17, 2024
March 14, 2024
March 12, 2024