Audio Object
Audio object research focuses on representing and manipulating individual sound sources within complex audio scenes, aiming to improve audio processing and analysis. Current research emphasizes the development of models that can separate and identify these objects, leveraging techniques like transformers and non-negative matrix factorization (NMF) for both audio-only and audio-visual processing. This work has significant implications for applications ranging from improved audio broadcasting and personalized listening experiences to medical diagnostics, such as early dementia detection using voice biomarkers. The ability to accurately isolate and interpret individual audio objects promises advancements across diverse fields.
Papers
January 31, 2024
October 25, 2023
May 12, 2023
May 11, 2023
March 20, 2023
July 12, 2022