Temporal Localization
Temporal localization focuses on identifying the precise time intervals of events or actions within video data, often in response to natural language queries. Current research emphasizes improving accuracy and efficiency through various approaches, including transformer-based architectures, multimodal large language models (MLLMs), and techniques that leverage both visual and textual information for more robust localization. This field is crucial for advancing video understanding, enabling applications such as automated video summarization, content moderation, and assistive technologies for visually impaired individuals.
Papers
November 18, 2022
November 16, 2022
October 18, 2022
July 22, 2022
July 21, 2022
July 6, 2022
June 16, 2022
June 15, 2022
June 7, 2022
May 29, 2022
May 20, 2022
April 26, 2022
April 4, 2022
March 30, 2022
March 25, 2022
March 10, 2022
February 16, 2022
January 25, 2022