Sound Event
Sound event detection (SED) focuses on identifying the types and temporal locations of sounds within audio recordings, a crucial task with applications in various fields. Current research emphasizes improving SED robustness to overlapping sounds and noisy environments, often employing deep learning models like convolutional recurrent neural networks (CRNNs) and transformers, along with techniques such as audio source separation and multi-modal data fusion (audio-visual). These advancements are driving progress in areas like smart home monitoring, environmental monitoring, and assistive technologies for older adults, highlighting the growing importance of accurate and efficient sound event analysis.
Papers
Few-shot bioacoustic event detection at the DCASE 2023 challenge
Ines Nolasco, Burooj Ghani, Shubhr Singh, Ester Vidaña-Vila, Helen Whitehead, Emily Grout, Michael Emmerson, Frants Jensen, Ivan Kiskin, Joe Morford, Ariana Strandburg-Peshkin, Lisa Gill, Hanna Pamuła, Vincent Lostanlen, Dan Stowell
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
Kazuki Shimada, Archontis Politis, Parthasaarathy Sudarsanam, Daniel Krause, Kengo Uchida, Sharath Adavanne, Aapo Hakala, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen, Yuki Mitsufuji