Audio Visual
Audio-visual research focuses on understanding and leveraging the interplay between audio and visual information, primarily aiming to improve multimodal understanding and generation. Current research emphasizes developing sophisticated models, often employing transformer architectures and diffusion models, to achieve tasks like video-to-audio generation, audio-visual speech recognition, and emotion analysis from combined audio-visual data. This field is significant for its potential applications in various domains, including media production, accessibility technologies, and even mental health diagnostics, by enabling more robust and nuanced analysis of multimedia content.
Papers
November 2, 2022
October 29, 2022
October 28, 2022
October 27, 2022
October 17, 2022
October 13, 2022
October 11, 2022
October 4, 2022
October 2, 2022
September 27, 2022
September 11, 2022
September 9, 2022
August 3, 2022
July 29, 2022
July 27, 2022
July 16, 2022
July 9, 2022
July 7, 2022
June 30, 2022