Acoustic Model
Acoustic modeling focuses on representing and processing speech sounds for applications like speech recognition, synthesis, and emotion analysis. Current research emphasizes improving model robustness to noise and diverse acoustic conditions, exploring architectures like Transformers and convolutional neural networks, and developing techniques for efficient training and adaptation, including unsupervised and transfer learning methods. These advancements are crucial for enhancing the accuracy and reliability of speech-based technologies across various languages and applications, particularly in low-resource settings and healthcare.
Papers
Acoustic Model Fusion for End-to-end Speech Recognition
Zhihong Lei, Mingbin Xu, Shiyi Han, Leo Liu, Zhen Huang, Tim Ng, Yuanyuan Zhang, Ernest Pusateri, Mirko Hannemann, Yaqiao Deng, Man-Hung Siu
Sound-skwatter (Did You Mean: Sound-squatter?) AI-powered Generator for Phishing Prevention
Rodolfo Valentim, Idilio Drago, Marco Mellia, Federico Cerutti