Enhancing the Performance of Pathological Voice Quality Assessment System Through the Attention-Mechanism Based Neural Network

Ji-Yan Han,Ching-Ju Hsiao,Wei-Zhong Zheng,Ko-Cheng Weng,Guan-Min Ho,Chia-Yuan Chang,Chi-Te Wang,Shih-Hau Fang,Ying-Hui Lai

doi:10.1016/j.jvoice.2022.12.026

Abstract

Doctors, nowadays, primarily use auditory-perceptual evaluation, such as the grade, roughness, breathiness, asthenia, and strain scale, to evaluate voice quality and determine the treatment. However, the results predicted by individual physicians often differ, because of subjective perceptions, and diagnosis time interval, if the patient's symptoms are hard to judge. Therefore, an accurate computerized pathological voice quality assessment system will improve the quality of assessment. This study proposes a self_attention-based system, with a deep learning technology, named self_attention-based bidirectional long-short term memory (SA BiLSTM). Different pitches [low, normal, high], and vowels [/a/, /i/, /u/], were added into the proposed model, to make it learn how professional doctors evaluate the grade, roughness, breathiness, asthenia, and strain scale, in a high dimension view. The experimental results showed that the proposed system provided higher performance than the baseline system. More specifically, the macro average of the F1 score, presented as decimal, was used to compare the accuracy of classification. The (G, R, and B) of the proposed system were (0.768±0.011, 0.820±0.009, and 0.815±0.009), which is higher than the baseline systems: deep neural network (0.395±0.010, 0.312±0.019, 0.321±0.014) and convolution neural network (0.421±0.052, 0.306±0.043, 0.3250±0.032) respectively. The proposed system, with SA BiLSTM, pitches, and vowels, provides a more accurate way to evaluate the voice. This will be helpful for clinical voice evaluations and will improve patients' benefits from voice therapy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhancing the Performance of Pathological Voice Quality Assessment System Through the Attention-Mechanism Based Neural Network

Abstract

Talk to us

Similar Papers

More From: Journal of Voice

Lead the way for us

Journal: Journal of Voice	Publication Date: Jan 1, 2023
Citations: 4

Similar Papers

Deep distributed convolutional neural networks: Universality
Ding-Xuan Zhou
Analysis and Applications | VOL. 16
Ding-Xuan ZhouDing-Xuan Zhou
01 Nov 2018
Analysis and Applications | VOL. 16

A Deep Ensemble Neural Network with Attention Mechanisms for Lung Abnormality Classification Using Audio Inputs
Conor Wall ... Li Zhang
Sensors | VOL. 22
Conor Wall, et. al.Conor Wall ... Li Zhang
26 Jul 2022
Sensors | VOL. 22

Applying a Deep Learning Neural Network to Gait-Based Pedestrian Automatic Detection and Recognition
Chih-Lung Lin ... Kuo-Chin Fan
Applied Sciences | VOL. 12
Chih-Lung Lin, et. al.Chih-Lung Lin ... Kuo-Chin Fan
25 Apr 2022
Applied Sciences | VOL. 12

A New iPhone Application for Voice Quality Assessment Based on the GRBAS Scale.
Tsuyoshi Kojima ... Yusuke Okanoue
The Laryngoscope | VOL. 131
Tsuyoshi Kojima, et. al.Tsuyoshi Kojima ... Yusuke Okanoue
09 Jul 2020
The Laryngoscope | VOL. 131

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhancing the Performance of Pathological Voice Quality Assessment System Through the Attention-Mechanism Based Neural Network

Abstract

Talk to us

Similar Papers

More From: Journal of Voice