Abstract
PurposeIn many fields, including bioinformatics, ensemble methods have been often employed to perform prediction tasks such as classification and regression due to their ability to improve the accuracy and robustness of machine learning models. Given that the most prevalent cancer and the primary cause of death for women is breast cancer, researchers have started using ensemble approaches to advance their studies in this area. Effectively, ensemble methods can be used to develop clinical decision support systems that can help healthcare providers make more accurate judgements. MethodsIn the present study, we construct and evaluate bagging and boosting ensemble methods for the binary classification of breast cancer screening images from the CBIS-DDSM dataset. To build the bagging ensembles, a hybrid architecture consisting of three feature extractors (Inception V3, MobileNet V2, DenseNet 201) and four classifiers (K-nearest neighbors, Multilayer perceptron, Support vector machine, Decision trees) was chosen. As for the boosting ensembles, the method consists of the same three feature extractors with a decision trees-based classifier and the four boosting methods (AdaBoost, GBM, XGboost, LightGBM). ResultsThe evaluation of both ensemble methods over the mammography dataset was made using four performance metrics (accuracy, sensitivity, F1-score, and recall), as well as the Skott–Knott statistical test and the Borda count voting system to rank all the possible model combinations. Results proved that bagging and boosting ensembles generally outperformed both their base learners and single models, and that the best performing ensemble overall was Gradient boosting with DenseNet 201 as feature extractor and 200 trees which scored an accuracy of 85.50 %. ConclusionsThe purpose of this research was to use hybrid learning based ensembles to maximize the potent performance of numerous models. It effectively illustrated the prospective potential of using ML approaches for the binary classification of mammography data used for breast cancer screening.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have