Abstract

Speech signals often include paralinguistic features such as pathologies that impair a speaker’s capability to communicate. Those cognitive symptoms have various causes depending on the disease. For example, morphological diseases like cleft lip and palate create hypernasality, while neurodegenerative conditions like Parkinson’s disease cause hypokinetic dysarthria. Automatic assessment of abnormal speech supports early diagnosis or disease severity evaluation. Conventional methods rely on manually assessing single aspects like shimmer, jitter, or formant frequencies, which may not fully reflect the disease’s manifestations. In this paper, we use deep convolutional neural networks (DCNNs) to recognize disordered speech. Despite DCNNs’ many approved benefits, selecting the best structure is challenging. In order to overcome this issue, this research looks into using the chimp optimization algorithm (ChOA) to automatically select the optimal DCNN structure. In order to achieve the goal, three ChOA-based advancements are proposed. First, an internet protocol address-based (IPA-based) encoding method for DCNN layers employing chimp vectors is created. Then an Enfeebled layer with specified chimp vector dimensions is presented for variable-length DCNNs. Eventually, large datasets are partitioned into smaller ones and evaluated at random to recognize abnormal speech signals from patients with Parkinson’s disease and cleft lip and palate. In addition to receiver operating characteristic (ROC) and precision-recall curves, five well-known metrics were used: sensitivity, specificity, accuracy, precision, F1-Score. The proposed model accurately diagnoses disordered and normal speech signals, with an accuracy of up to 96.37%, which is 1.62 more accurate than the second-best approach, i.e., VLNSGA-II.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call