Multi-Scale Recurrence Quantification Measurements for Voice Disorder Detection

Xin-Cheng Zhu,Deng-Huang Zhao,Yi-Hua Zhang,Xiao-Jun Zhang,Zhi Tao

doi:10.3390/app12189196

Xin-Cheng Zhu, Deng-Huang Zhao + Show 3 more

Open Access

https://doi.org/10.3390/app12189196

Copy DOI

Journal: Applied sciences	Publication Date: Sep 14, 2022
Citations: 2	License type: CC BY 4.0

Affiliation: Soochow University

Abstract

Due to the complexity and non-stationarity of the voice generation system, the nonlinearity of speech signals cannot be accurately quantified. Recently, the recurrence quantification analysis method has been used for voice disorder detection. In this paper, multiscale recurrence quantification measures (MRQMs) are proposed. The signals are reconstructed in the high-dimensional phase space at the equivalent rectangular bandwidth scale. Recurrence plots (RPs) combining the characteristics of human auditory perception are drawn with an appropriate recurrence threshold. Based on the above, the nonlinear dynamic recurrence features of the speech signal are quantized from the recurrence plot of each frequency channel. Furthermore, this paper explores the recurrence quantification thresholds that are most suitable for pathological voices. Our results show that the proposed MRQMs with support vector machine (SVM), random forest (RF), Bayesian network (BN) and Local Weighted Learning (LWL) achieve an average accuracy of 99.45%, outperforming traditional features and other complex measurements. In addition, MRQMs also have the potential for multi-classification of voice disorder, achieving an accuracy of 89.05%. This study demonstrates that MRQMs can characterize the recurrence characteristic of pathological voices and effectively detect voice disorders.

Full Text