Abstract

More than 90% of patients with Parkinson’s disease suffer from hypokinetic dysarthria. This paper proposes a novel end-to-end deep learning model for Parkinson’s disease detection from speech signals. The proposed model extracts time series dynamic features using time-distributed two-dimensional convolutional neural networks (2D-CNNs), and then captures the dependencies between these time series using a one-dimensional CNN (1D-CNN). The performance of the proposed model was verified on two databases. On Database-1, the proposed model outperformed expert features-based machine learning models and achieved promising results, showing accuracies of 81.6% on the speech task of sustained vowel /a/ and 75.3% on the speech task of reading a short sentence (/si shi si zhi shi shi zi/) in Chinese. On Database-2, the proposed model was assessed on multiple sound types, including vowels, words, and sentences. An accuracy of up to 92% was obtained on the speech tasks, which included reading simple (/loslibros/) and complex (/viste/) sentences in Spanish. By visualizing the features generated by the model, it was found that the learned time series dynamic features are able to capture the characteristics of the reduced overall frequency range and reduced variability of Parkinson’s disease sounds, which are important clinical evidence for detecting Parkinson’s disease patients. The results also suggest that the low-frequency region of the Mel-spectrogram is more influential and important than the high-frequency region for Parkinson’s disease detection from speech.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.