Abstract

Researchers dealing with real-world data - such as in the healthcare domain - tend to face class imbalance issues. More specifically, publicly available datasets containing Chest X-Ray (CXR) of Pneumonia diseases (including COVID-19) usually have an imbalanced class distribution. This dataset imbalance causes automatic diagnosis systems to classify majority classes with much more accuracy than the minority ones. Several resampling algorithms were proposed in the past to deal with the class imbalance issue. Hierarchical classifiers have also been proposed to increase the predictive performance of classifiers, but there is little research in the literature verifying if using existing resampling algorithms with hierarchical classifiers are a good alternative to improve classification performance. This work proposes an experimental classification schema to investigate the effectiveness of using resampling algorithms in the identification of COVID-19 and other types of Pneumonia through CXR images. The proposed schema uses resampling algorithms to rebalance the class distribution, in a Local Hierarchical Classification scenario. The experimental evaluation, which is supported by inferential statistical analysis, showed that using specific resampling algorithms with Local Hierarchical Classifiers brings a statistically significant increase to the macro-averaged Fl-Score, and improves the predictive performance for the minority classes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.