Abstract
AbstractMachine learning classifiers play vital role in biomedical signals analysis and disease diagnosis. The selection of proper machine learning model for disease detection is based on the data characteristics. Bias and variance are the important errors which affects the machine learning model performance. Bias and variance are often taken into consideration for error analysis of any model. Unbiasedness is often considered as a positive property of a classifier selection condition but here we present a low variance is at least as significant, as a non-negligible variance introduces the possible solution for over-fitting problem in classifier selection and model training. In machine learning (ML), the performance degradation caused by over-fitting the ML classifiers selection criterion is a common problem but attained a minimum attention in machine learning literature. This paper is aimed to address the problems faced due to over-fitting in machine learning. The effects of over-fitting are often of comparable degree to differences in performance between various learning algorithms and hence cannot be avoided in experimental evaluation. Common performance measuring matrices are dependent on selection of bias/variance and hence results in over-fitting which are unreliable in practice. We discuss various methods to evade the over-fitting in the selection of classifiers and also discuss subsequent bias/variance selection in performance parameter evaluation. While this study focuses on statistical parameter-based ML classifiers selection, the findings are quite general and can be applied for any model selection in practice involving ML classifiers selection in biomedical signal and data applications. The novelty of the suggested method highlights on effect of bias and variance in choosing the ML classifiers, especially for biomedical signals and data classification. There is very limited work carried out to the best of our knowledge toward the ML classifier selection based on bias and variance, and hence, our suggested method ensures better performance in abnormality detection using biomedical signal and data.KeywordsEstimatorBiasVarianceOver-fittingClassifier selectionPerformance evaluationROCAUC
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.