Abstract

The intelligent data-driven screening of pathological voice signals is a non-invasive and real-time tool for computer-aided diagnosis that has attracted increasing attention from researchers and clinicians. In this paper, the authors propose multi-domain features and the hierarchical extreme learning machine (H-ELM) for the automatic identification of voice disorders. A sufficient number of sensitive features are first extracted from the original voice signal through multi-domain feature extraction (i.e., features of the time domain and the sample entropy based on ensemble empirical mode decomposition and gammatone frequency cepstral coefficients). To eliminate redundancy in high-dimensional features, neighborhood component analysis is then applied to filter out sensitive features from the high-dimensional feature vectors to improve the efficiency of network training and reduce overfitting. The sensitive features thus obtained are then used to train the H-ELM for pathological voice classification. The results of the experiments showed that the sensitivity, specificity, F1 score, and accuracy of the H-ELM were 99.37%, 98.61%, 99.37%, and 98.99%, respectively. Therefore, the proposed method is feasible for the initial classification of pathological voice signals.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.