Diagnosing Parkinson's disease (PD) via speech is crucial for its non-invasive and convenient data collection. However, the small sample size of PD speech data impedes accurate recognition of PD speech. Therefore, we propose a novel multi-source sparse broad transfer learning (SBTL) method, inspired by incremental broad learning, which balances model learning capability and the overfitting associated with limited sample size of PD speech data. Specifically, SBTL initially leverages a sparse network to preprocess highly overlapping PD speech data, facilitating the identification of intrinsic invariant features between the multi-source auxiliary domain and the target data, which contributes to reducing model complexity. Subsequently, SBTL evaluate transfer effectiveness by virtue of the incremental learning mechanism, adaptively adjusting model structure to ensure the positive transfer of knowledge from the multi-source auxiliary domains to the target domain. Numerous experimental results show that, compared to transfer learning methods for PD diagnosis via speech, SBTL consistently demonstrates significant advantages with a smaller standard deviation, particularly leading by at least 2.58%, 5.71%, 12%, and 14.81% in accuracy, precision, sensitivity, and F1-score, respectively. Even when compared to some well-known transfer learning methods, SBTL still exhibits significant advantages in most cases while maintaining comparable sensitivity. These demonstrate that SBTL is an effective, efficient, and stable multi-source transfer learning method for PD speech recognition, giving more accurate assistance information for clinicians on decision-making for PD in practice.
Read full abstract