Surgical Visual Question Answering
Surgical Visual Question Answering (Surgical-VQA) aims to develop AI systems that can answer questions about surgical procedures based on visual input from videos or images, assisting surgeons and trainees. Current research focuses on improving the accuracy and robustness of these systems, particularly by incorporating advanced architectures like transformers and large vision-language models, and by addressing challenges such as multimodal information fusion and visual grounding of answers. This field is significant because it has the potential to improve surgical education, enhance intraoperative decision-making, and ultimately lead to better patient outcomes by providing readily available expert-level information.
Papers
August 9, 2024
May 22, 2024
March 22, 2024
December 15, 2023
July 22, 2023
July 11, 2023
May 19, 2023