Abstract

Recent advances in the state-of-the-art open-source kernel functions for support vector machines (SVMs) have widened the choices of benchmark kernels for Machine Learning (ML)-based classification. However, it is still challenging to achieve margin maximisation in SVM, and further evidence is required to ensure such novel kernel functions can have translational applications with tangible impact. Noteworthily, m-arcsinh, freely available in scikit-learn, was preliminarily proven as a benchmark kernel function on 15 datasets in its seminal paper. Quantifying the benefit from leveraging this kernel in a specific application is essential to provide further evidence of its accuracy and reliability on real-life supervised ML-aided tasks. Thus, the predictive capability of SVM, including that with Lagrange multipliers for the first time coupled with m-arcsinh (m-ark-SVM with soft margin; m-arK-SVM with hard margin), is hereby assessed in aiding early detection of Parkinson’s Disease (PD) from speech data. This is important to leverage the m-arcsinh kernel ‘trick’ to maximise the margin width and, therefore, the linear separability of input speech features via automated pattern recognition. In this study, we demonstrate the accuracy and reliability of m-ark-SVM to aid early diagnosis of PD, evaluated against other gold standard kernel functions. Two benchmark datasets from the University of California-Irvine (UCI) database, pre-processed solely via min-max normalisation, were used to discriminate between speech patterns of 72 healthy subjects and 211 patients with PD. Overtraining was avoided via cross validation and the models were developed and tested in Python 3.7. The supervised model (m-ark-SVM) could detect early Parkinson’s Disease with 87.18% and 86.9% classification accuracy from the two datasets respectively (F1- scores: 85 and 86.2% correspondingly). Furthermore, the model achieved high precision (89.2% and 86.8%) and specificity (87% and 86.8%). Thus, this study validates the application of m-arcsinh to aid real-life supervised ML-based classification, in particular early diagnosis of Parkinson’s Disease from speech data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.