Audio Separation

Audio separation aims to isolate individual sound sources from a mixture, a crucial task with applications ranging from music production to environmental monitoring. Current research heavily focuses on developing robust models capable of handling open-world scenarios with varying numbers of unseen sources, employing techniques like large language models for source identification and U-Net architectures for spectrographic segmentation. These advancements leverage both supervised and weakly-supervised learning approaches, including innovative methods using multimodal data (audio-visual or audio-text) to improve separation accuracy and generalization. The resulting improvements in separation quality have significant implications for various fields, including music processing, speech enhancement, and anomaly detection.

Papers