Abstract

Background: Many studies employed machine learning (ML) to forecast the prognosis of breast cancer (BC) patients and discovered that the ML model showed high individualized forecasting ability. Breast cancer is the most frequent kind of carcinoma in women globally and ranks as the leading cause of death in women. Objectives: This study intends to use the Surveillance, Epidemiology, and End Results dataset to categorize breast carcinoma cases’ alive and dead conditions. Deep learning and machine learning have been extensively utilized in clinical studies to address various categorization problems due to their ability to manage massive data sets in an organized manner. Pre-processing the data allows it to be visualized and analyzed for making critical choices. This study describes a realistic machine learning-based strategy for categorizing the SEER breast cancer dataset. Materials and methods: We employed classification and machine learning algorithms to classify breast cancer mortality. Four well-known classification ML algorithms were employed in this study. To identify risk factors, we employed multivariate analysis using the data set. Results: The decision tree performed the best accuracy (0.914) among all the models. T4 stage (β=1.4, p<0.001, OR=4.22, 95% CI (2.06-8.64), N2 stage (β=0.39, p=0.008, OR= 1.49, 95% CI (1.111-1.997) found to be major risk factors for breast cancer mortality using multivariate analysis. Conclusion: The significant prognostic variables affecting the breast carcinoma survival rates reported in the current research are relevant and might be turned into decision support systems in the medical realm.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call