Speech Intelligibility Prediction Model
Speech intelligibility prediction models aim to automatically assess how well a person understands spoken words, often in challenging acoustic conditions. Current research focuses on developing robust, non-intrusive models—meaning they don't require modifications to the speech signal—using deep learning architectures like multi-branched and multi-task networks, often incorporating features from pre-trained speech recognition models and metadata. These advancements are crucial for improving hearing aid technology and streamlining the evaluation of speech enhancement algorithms, ultimately leading to more effective and personalized assistive listening devices.
Papers
MTI-Net: A Multi-Target Speech Intelligibility Prediction Model
Ryandhimas E. Zezario, Szu-wei Fu, Fei Chen, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao
MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids
Ryandhimas E. Zezario, Fei Chen, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao