Vision Task
Vision tasks, encompassing image and video analysis for diverse applications, are a central focus in computer vision research. Current efforts concentrate on improving model efficiency and robustness, particularly through multi-task learning, the development of novel architectures like Vision Transformers and state-space models, and the incorporation of human feedback for improved alignment with user preferences. These advancements are driving progress in areas such as image compression for machine learning pipelines, multi-image understanding, and the creation of more robust and fair models for real-world deployment.
Papers
March 4, 2024
March 3, 2024
March 2, 2024
March 1, 2024
February 26, 2024
February 25, 2024
February 21, 2024
February 16, 2024
February 7, 2024
February 1, 2024
January 25, 2024
January 23, 2024
January 16, 2024
January 11, 2024
January 9, 2024
December 27, 2023
December 23, 2023
December 22, 2023
December 14, 2023
December 5, 2023