Environmental Sound
Environmental sound analysis focuses on automatically classifying and interpreting sounds from various environments, aiming to improve applications ranging from biodiversity monitoring to urban noise management and assistive technologies for the hearing impaired. Current research emphasizes developing robust and efficient models, often employing convolutional neural networks (CNNs), recurrent neural networks (RNNs), and increasingly, transformer-based architectures, with a focus on handling noisy data and limited labeled datasets through techniques like data augmentation, transfer learning, and semi-supervised learning. This field is significant for its potential to automate time-consuming tasks, provide valuable insights into ecological systems and urban environments, and create innovative assistive technologies.
Papers
Synthetic training set generation using text-to-audio models for environmental sound classification
Francesca Ronchini, Luca Comanducci, Fabio Antonacci
Correlation of Fr\'echet Audio Distance With Human Perception of Environmental Audio Is Embedding Dependant
Modan Tailleur, Junwon Lee, Mathieu Lagrange, Keunwoo Choi, Laurie M. Heller, Keisuke Imoto, Yuki Okamoto
Infrastructure-less Localization from Indoor Environmental Sounds Based on Spectral Decomposition and Spatial Likelihood Model
Satoki Ogiso, Yoshiaki Bando, Takeshi Kurata, Takashi Okuma