Paper ID: 2202.12957

Deep Neural Network for Automatic Assessment of Dysphonia

Mario Alejandro García, Ana Lorena Rosset

The purpose of this work is to contribute to the understanding and improvement of deep neural networks in the field of vocal quality. A neural network that predicts the perceptual assessment of overall severity of dysphonia in GRBAS scale is obtained. The design focuses on amplitude perturbations, frequency perturbations, and noise. Results are compared with performance of human raters on the same data. Both the precision and the mean absolute error of the neural network are close to human intra-rater performance, exceeding inter-rater performance.

Submitted: Feb 25, 2022