VQA Benchmark
Visual Question Answering (VQA) benchmarks evaluate the ability of artificial intelligence models to understand and respond to questions about images. Current research focuses on improving model robustness to variations in question phrasing and answer formats, enhancing reasoning capabilities through retrieval-augmented architectures and modular designs, and addressing biases stemming from language priors. These advancements aim to create more reliable and explainable VQA systems, impacting fields like healthcare (through analysis of medical images) and remote sensing (by enabling efficient image interpretation), while also furthering our understanding of multimodal learning.
Papers
November 4, 2024
October 16, 2024
September 26, 2024
August 30, 2024
April 22, 2024
December 21, 2023
November 10, 2023
October 12, 2023
September 5, 2023
July 26, 2023
June 13, 2023
June 1, 2023
May 31, 2023
May 17, 2023
August 3, 2022