Abstract
With the recent Coronavirus Disease 2019 (COVID-19) outbreak, there has been a growing interest in the utilization of machine learning models to aid in medical diagnostics. However, during the initial phases of the pandemic, the available datasets were severely imbalanced due to a scarcity of positive cases. These datasets predominantly comprised normal cases, with only a limited number of disease instances. This inherent data imbalance restricted the effectiveness of binary classification, prompting an exploration of one-class classification. This study aims to compare the performance of one-class and binary classification models during the early stages of COVID-19 development. The objective is to provide healthcare professionals with the most effective machine learning-assisted solutions for handling future outbreaks of a similar nature. The experiment incorporates three distinct models: One-Class Support Vector Machine (OCSVM) for one-class classification, alongside Categorical Boosting (CatBoost) and Extreme Gradient Boosting (XGBoost) for binary classification. These models undergo a rigorous tuning process employing grid search, using training data composed of features extracted from compressed chest X-ray images. In summary, the research findings unequivocally demonstrate the superior performance of one-class classification over binary classification in all early stages, achieving an impressive F1 score of 0.46, compared to binary classification's modest score of 0.063. Consequently, it is strongly recommended to employ a one-class classification model to enhance real-life diagnostic processes. However, it is anticipated that as more positive cases are incorporated into the dataset, achieving a more balanced distribution, binary classification may exhibit greater potential. Thus, medical practitioners should reconsider the applicability of binary classification in later stages when the dataset attains improved balance.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.