Acoustic Scene Classification
Acoustic scene classification (ASC) aims to automatically identify the environment of an audio recording based on its acoustic characteristics. Current research heavily focuses on improving data efficiency and model efficiency, often employing convolutional neural networks (CNNs), spectrogram transformers (ASTs), and knowledge distillation techniques to achieve high accuracy with limited training data and computational resources. ASC advancements have significant implications for various applications, including environmental monitoring, assistive technologies, and content verification, by enabling robust and efficient audio analysis in diverse real-world settings.
Papers
Instance-level loss based multiple-instance learning framework for acoustic scene classification
Won-Gook Choi, Joon-Hyuk Chang, Jae-Mo Yang, Han-Gil Moon
A Squeeze-and-Excitation and Transformer based Cross-task System for Environmental Sound Recognition
Jisheng Bai, Jianfeng Chen, Mou Wang, Muhammad Saad Ayub