Breast cancer is a significant contributor to female mortality, emphasizing the importance of early detection. Predicting breast cancer accurately remains a complex challenge within medical data analysis. Machine learning (ML) algorithms offer valuable assistance in decision-making and diagnosis using medical data. Numerous research studies highlight the effectiveness of ML techniques in improving breast cancer prediction. Feature selection plays a pivotal role in data preprocessing, eliminating irrelevant and redundant features to minimize feature count and improve classification accuracy. This study focuses on optimizing breast cancer diagnostics through feature selection methods, specifically genetic algorithms (GA) and particle swarm optimization (PSO). The research involves a comparative analysis of these methods and the application of a diverse set of ML classification techniques, including logistic regression (LR), support vector machine (SVM), decision tree (DT), and ensemble methods like random forest (RF), AdaBoost, and gradient boosting (GB), using a breast cancer dataset. The models' performance is subsequently evaluated using various performance metrics. The experimental findings illustrate that PSO achieved the highest average accuracy, reaching 99.6% when applied to AdaBoost, while GA attained an accuracy rate of 99.5% when employed with both AdaBoost and RF.
Read full abstract