Frame Wise
Frame-wise analysis focuses on extracting meaningful information from individual frames within sequences, such as videos or audio recordings, to improve various downstream tasks. Current research emphasizes leveraging large language models and transformer architectures to enhance feature extraction and contextual understanding, often incorporating techniques like contrastive learning and temporal modeling to capture both local and global relationships within the data. This approach is proving valuable across diverse applications, including improving the accuracy of action recognition, sound event detection, and video retrieval, while also streamlining tasks like clinical trial analysis and sign language recognition.
Papers
September 15, 2024
July 30, 2024
April 12, 2024
March 4, 2024
September 15, 2023
September 12, 2023
August 7, 2023
July 10, 2023
May 31, 2023
March 16, 2023
March 15, 2023
January 18, 2023
January 5, 2023
April 7, 2022
March 6, 2022