A Machine-Learning Algorithm for the Automated Perceptual Evaluation of Dysphonia Severity

Benjamin Van Der Woerd,Zhuohao Chen,Nikolaos Flemotomos,Maria Oljaca,Lauren Timmons Sund,Shrikanth Narayanan,Michael M Johns

doi:10.1016/j.jvoice.2023.06.006

Abstract

Auditory-perceptual assessments are the gold standard for assessing voice quality. This project aims to develop a machine-learning model for measuring perceptual dysphonia severity of audio samples consistent with assessments by expert raters. The Perceptual Voice Qualities Database samples were used, including sustained vowel and Consensus Auditory-Perceptual Evaluation of Voice sentences, whichwere previously expertly rated on a 0-100 scale. The OpenSMILE (audEERING GmbH, Gilching, Germany) toolkit was used to extract acoustic (Mel-Frequency Cepstral Coefficient-based, n=1428) and prosodic (n=152) features, pitch onsets, and recording duration. We utilized a support vector machineand these features (n=1582) for automated assessment of dysphonia severity. Recordings were separated into vowels (V) and sentences (S) and features were extracted separately from each. Final voice quality predictions were made by combining the features extracted from the individual components with the whole audio (WA) sample (three file sets: S, V, WA). This algorithm has a high correlation (r=0.847) with estimates of expert raters. The root mean square error was 13.36. Increasing signal complexity resulted in better estimation of dysphonia, whereby combining the features outperformed WA, S, and V sets individually. A novel machine-learning algorithm was able to perform perceptual estimates of dysphonia severity using standardized audio samples on a 100-point scale. This was highly correlated to expert raters. This suggests that ML algorithms could offer an objective method for evaluating voice samples for dysphonia severity.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Machine-Learning Algorithm for the Automated Perceptual Evaluation of Dysphonia Severity

Abstract

Talk to us

Similar Papers

More From: Journal of Voice

Lead the way for us

Journal: Journal of Voice	Publication Date: Jul 8, 2023
Citations: 1

Similar Papers

Usefulness of Direct Magnitude Estimation (DME) and Acoustic Analysis in Measuring Dysphonia Severity
Yeon Woo Lee ... Geun Hyo Kim
Journal of Voice | VOL. -
Yeon Woo Lee, et. al.Yeon Woo Lee ... Geun Hyo Kim
01 Aug 2024
Journal of Voice | VOL. -

Comparisons of 4-Point GRBAS, 7-Point-GRBAS, and CAPE-V for Auditory Perceptual Evaluation of Dysphonia
Seong Hee Choi ... Miok Yu
Audiology and Speech Research | VOL. 17
Seong Hee Choi, et. al.Seong Hee Choi ... Miok Yu
15 Apr 2021
Audiology and Speech Research | VOL. 17

A comparative analysis of data mining techniques for agricultural and hydrological drought prediction in the eastern Mediterranean
Safwan Mohammed ... Endre Harsányi
Computers and Electronics in Agriculture | VOL. 197
Safwan Mohammed, et. al.Safwan Mohammed ... Endre Harsányi
10 Apr 2022
Computers and Electronics in Agriculture | VOL. 197

Gaming behavior and brain activation using functional near-infrared spectroscopy, Iowa gambling task, and machine learning techniques.
Denis Kornev ... Siamak Aram
Brain and Behavior | VOL. 12
Denis Kornev, et. al.Denis Kornev ... Siamak Aram
15 Mar 2022
Brain and Behavior | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Machine-Learning Algorithm for the Automated Perceptual Evaluation of Dysphonia Severity

Abstract

Talk to us

Similar Papers

More From: Journal of Voice