Acoustic Context

Acoustic context, the influence of surrounding sounds on the perception and processing of a target sound, is a crucial area of research impacting diverse fields like speech recognition, audio captioning, and sound synthesis. Current research focuses on developing models, often employing neural networks (including transformers and attention-based architectures), to effectively incorporate contextual information across various timescales and modalities (audio-visual, audio-textual). This work aims to improve the accuracy and efficiency of audio processing tasks, leading to advancements in applications such as more natural-sounding speech synthesis, improved speech recognition in noisy environments, and more realistic audio rendering in virtual and augmented reality.

Papers