Abstract

In voice research and clinical assessment, many objective parameters are in use. However, there is no commonly used set of parameters that reflect certain voice disorders, such as functional dysphonia (FD); i.e. disorders with no visible anatomical changes. Hence, 358 high-speed videoendoscopy (HSV) recordings (159 normal females (NF), 101 FD females (FDF), 66 normal males (NM), 32 FD males (FDM)) were analyzed. We investigated 91 quantitative HSV parameters towards their significance. First, 25 highly correlated parameters were discarded. Second, further 54 parameters were discarded by using a LogitBoost decision stumps approach. This yielded a subset of 12 parameters sufficient to reflect functional dysphonia. These parameters separated groups NF vs. FDF and NM vs. FDM with fair accuracy of 0.745 or 0.768, respectively. Parameters solely computed from the changing glottal area waveform (1D-function called GAW) between the vocal folds were less important than parameters describing the oscillation characteristics along the vocal folds (2D-function called Phonovibrogram). Regularity of GAW phases and peak shape, harmonic structure and Phonovibrogram-based vocal fold open and closing angles were mainly important. This study showed the high degree of redundancy of HSV-voice-parameters but also affirms the need of multidimensional based assessment of clinical data.

Highlights

  • It is possible that better performance in separating healthy and FD subjects could have been achieved if more parameters from more signal sources, e.g. simultaneously recorded audio, would have been investigated in this work

  • Due to the different age ranges of the healthy and the disordered group, results could have been influenced by subject age

  • An influence of subject age for different signal types and voice parameters is well documented in the literature[52,53,54]

Read more

Summary

Introduction

3 Average values and standard deviations of parameters are both relevant. Regularity of GAW phases (SQ) and peak shape (PQ), harmonic structure (SNRK) and regularity and average values of different contour angles are mainly important. It is possible that better performance in separating healthy and FD subjects could have been achieved if more parameters from more signal sources, e.g. simultaneously recorded audio, would have been investigated in this work. Due to the different age ranges of the healthy and the disordered group, results could have been influenced by subject age.

Objectives
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.