Audio Visual
Audio-visual research focuses on understanding and leveraging the interplay between audio and visual information, primarily aiming to improve multimodal understanding and generation. Current research emphasizes developing sophisticated models, often employing transformer architectures and diffusion models, to achieve tasks like video-to-audio generation, audio-visual speech recognition, and emotion analysis from combined audio-visual data. This field is significant for its potential applications in various domains, including media production, accessibility technologies, and even mental health diagnostics, by enabling more robust and nuanced analysis of multimedia content.
Papers
August 10, 2023
August 9, 2023
August 4, 2023
August 1, 2023
July 31, 2023
July 25, 2023
July 18, 2023
July 4, 2023
June 29, 2023
June 27, 2023
June 15, 2023
June 6, 2023
June 4, 2023
June 2, 2023
May 30, 2023
May 25, 2023
May 24, 2023
May 22, 2023