Abstract

Detection of dysphonia is useful for monitoring the progression of phonatory impairment for patients with Parkinson’s disease (PD), and also helps assess the disease severity. This paper describes the statistical pattern analysis methods to study different vocal measurements of sustained phonations. The feature dimension reduction procedure was implemented by using the sequential forward selection (SFS) and kernel principal component analysis (KPCA) methods. Four selected vocal measures were projected by the KPCA onto the bivariate feature space, in which the class-conditional feature densities can be approximated with the nonparametric kernel density estimation technique. In the vocal pattern classification experiments, Fisher’s linear discriminant analysis (FLDA) was applied to perform the linear classification of voice records for healthy control subjects and PD patients, and the maximum a posteriori (MAP) decision rule and support vector machine (SVM) with radial basis function kernels were employed for the nonlinear classification tasks. Based on the KPCA-mapped feature densities, the MAP classifier successfully distinguished 91.8% voice records, with a sensitivity rate of 0.986, a specificity rate of 0.708, and an area value of 0.94 under the receiver operating characteristic (ROC) curve. The diagnostic performance provided by the MAP classifier was superior to those of the FLDA and SVM classifiers. In addition, the classification results indicated that gender is insensitive to dysphonia detection, and the sustained phonations of PD patients with minimal functional disability are more difficult to be correctly identified.

Highlights

  • Dysphonia is a type of phonation disorder with an impairment in the ability to produce normal voice sounds [1]

  • It can be observed that majority (82.6%) of the Parkinson’s disease (PD) patients underwent the intermediate course of the disease (1,Hoehn and Yahr (H&Y),3.5)

  • The difference of mean values between CO and PD records for multidimensional voice program (MDVP): F0 is with a much larger order of magnitudes than that for MDVP: Jitter (%), both of these two measurements present the vocal perturbations in fundamental frequency

Read more

Summary

Introduction

Dysphonia is a type of phonation disorder with an impairment in the ability to produce normal voice sounds [1]. Dysphonia is detrimental to quality of life, because the speech impaired patient often encounters difficulty in personal communication that leads to depression and further social handicap [4]. Ho et al [5] utilized the clinical-perceptual method to study the speech difficulties in PD. They sampled the two-minute conversational speech of 200 PD patients, and examined the speech deficit profiles (i.e., voice, articulation, and fluency) [5]. Their study showed that voice was the leading deficit in the initial stage of PD, and articulatory and fluency deficits manifested in the severe stage of PD [5]

Objectives
Methods
Results
Discussion
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.