Time Domain Audio Separation Network
Time-domain audio separation networks (TasNets) are deep learning models designed to separate individual sound sources from a mixture, focusing on processing audio waveforms directly rather than spectral representations. Current research emphasizes improving efficiency and real-time capabilities, exploring architectures like conformers and modified U-Nets, and incorporating multi-channel and even brain-activity data for enhanced performance. These advancements have significant implications for applications such as hearing aids, music production, speech recognition, and marine biodiversity monitoring, offering improved accuracy and reduced computational demands compared to previous methods.
Papers
February 27, 2024
October 9, 2023
September 29, 2023
May 17, 2023
December 14, 2022
October 28, 2022
October 27, 2022
January 13, 2022