Speech Intelligibility Prediction Model

Speech intelligibility prediction models aim to automatically assess how well a person understands spoken words, often in challenging acoustic conditions. Current research focuses on developing robust, non-intrusive models—meaning they don't require modifications to the speech signal—using deep learning architectures like multi-branched and multi-task networks, often incorporating features from pre-trained speech recognition models and metadata. These advancements are crucial for improving hearing aid technology and streamlining the evaluation of speech enhancement algorithms, ultimately leading to more effective and personalized assistive listening devices.

Papers

October 20, 2023

Intelligibility prediction with a pretrained noise-robust automatic speech recognition model
Zehai Tu, Ning Ma, Jon Barker
Speech Recognition Noisy Speech Intelligibility Prediction Speech Intelligibility Prediction Model

September 18, 2023

Non-Intrusive Speech Intelligibility Prediction for Hearing Aids using Whisper and Metadata
Ryandhimas E. Zezario, Fei Chen, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao
Cross Domain Speech Intelligibility State of the Art Whisper Metadata Information Speech Intelligibility Prediction Model

April 7, 2022

Speech Intelligibility Prediction Model

Papers

Intelligibility prediction with a pretrained noise-robust automatic speech recognition model

Non-Intrusive Speech Intelligibility Prediction for Hearing Aids using Whisper and Metadata

MTI-Net: A Multi-Target Speech Intelligibility Prediction Model

MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids