Shot Localization
Shot localization, the task of identifying the location of objects or events within an image or video based on textual or other cues, is a rapidly evolving field driven by the need for more robust and efficient methods. Current research emphasizes zero-shot and few-shot learning approaches, often employing transformer-based architectures and leveraging pre-trained vision-language models like CLIP, to minimize reliance on large labeled datasets. This work is significant for its potential applications in diverse areas such as image manipulation detection, embodied AI, and accessibility technologies for visually impaired individuals, improving the accuracy and efficiency of object localization across various scenarios.
Papers
November 18, 2024
October 5, 2024
March 19, 2024
March 11, 2024
December 6, 2023
December 1, 2023
October 22, 2023
April 26, 2023
July 24, 2022
April 7, 2022