Surgical Action Triplet

Surgical action triplets represent surgical activities as combinations of instrument, verb (action), and target (anatomy), aiming to provide a detailed, fine-grained understanding of surgical workflows for improved AI assistance. Current research focuses on developing robust and accurate methods for both recognizing and detecting these triplets from surgical videos, employing techniques like diffusion models, transformer networks, and attention-based temporal fusion to address challenges in triplet association and localization. This work is significant for advancing context-aware decision support in surgery, potentially leading to enhanced surgical safety and efficiency through improved computer-assisted intervention.

Papers