Abstract

Breast cancers emerged as one of the fatal diseases among women. Particular efforts are being taken continuously around the globe to help this cause. Taking advantage of Machine Learning (ML) technology is a measure to lessen the death toll of this disease. This paper has emphasized breast cancer classification to expedite the diagnostic process by identifying whether the tumor is benign or malignant. Moreover, in this process, we have combined the ensemble learning-based Hard Voting (HV) technique with random under-sampling using the Wisconsin Diagnostic Breast Cancer (WDBC) dataset. This dataset provides several features upon performing Fine Needle Aspirate (FNA) test to each affected individual, and five features have been chosen to accomplish the classification task. These features have been further scaled using RobustScaler, and the classes (benign or malignant) were balanced using random under-sampling to achieve an accurate outcome. Four distinguished ML classifiers, e.g., Decision Tree Classifier (DTC), k-Nearest Neighbor (KNN), Random Forest (RF), and Support Vector Machine (SVM), have been applied to form an HV meta-classifier. The HV meta-classifier has obtained 99.42% test accuracy in classifying breast cancer incidents. This result has been further verified by considering other distinct parameters.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.