Dysarthria, a motor speech disorder, poses challenges in accurate severity assessment. Recent research has excelled in classifying dysarthria based on severity levels, primarily utilizing annotated datasets and achieving high accuracies. However, these classification-based approaches may not readily translate to real-world scenarios without predefined labels. This study follows a different path by proposing a two-stage approach leveraging binary classification and clustering to comprehensively analyze and classify dysarthria severity levels. We begin by employing binary classification to differentiate control from dysarthric cases by experiencing eight different feature extraction techniques and two classifiers in order to support the largest amount of dysarthric cases to the second stage, where k-means clustering uncovers hidden patterns and boundaries within dysarthria severity levels, enabling a more nuanced understanding of the disorder. We applied our methodology to the TORGO dataset, a benchmark in dysarthria research, and evaluated it on the UA Speech dataset. After optimizing the number of clusters, our approach achieved an accuracy of 91% with sentence-based features and 85% with word-based features in clustering. This research extends previous studies by exploring unsupervised clustering to differentiate severity levels in unannotated cases, bridging the gap between controlled datasets and practical applications. Our findings highlight the effectiveness of clustering-driven two-stage analysis in improving dysarthria severity-level classification, with implications for real-world clinical settings.
Read full abstract