Classification Prediction of Breast Cancer Based on Machine Learning.

Hua Chen,Kehui Mei,Guangxing Cai,Nan Wang,Xueping Du,Yuan Zhou

doi:10.1155/2023/6530719

Abstract

Breast cancer is the most common and deadly type of cancer in the world. Based on machine learning algorithms such as XGBoost, random forest, logistic regression, and K-nearest neighbor, this paper establishes different models to classify and predict breast cancer, so as to provide a reference for the early diagnosis of breast cancer. Recall indicates the probability of detecting malignant cancer cells in medical diagnosis, which is of great significance for the classification of breast cancer, so this article takes recall as the primary evaluation index and considers the precision, accuracy, and F1-score evaluation indicators to evaluate and compare the prediction effect of each model. In order to eliminate the influence of different dimensional concepts on the effect of the model, the data are standardized. In order to find the optimal subset and improve the accuracy of the model, 15 features were screened out as input to the model through the Pearson correlation test. The K-nearest neighbor model uses the cross-validation method to select the optimal k value by using recall as an evaluation index. For the problem of positive and negative sample imbalance, the hierarchical sampling method is used to extract the training set and test set proportionally according to different categories. The experimental results show that under different dataset division (8 : 2 and 7 : 3), the prediction effect of the same model will have different changes. Comparative analysis shows that the XGBoost model established in this paper (which divides the training set and test set by 8 : 2) has better effects, and its recall, precision, accuracy, and F1-score are 1.00, 0.960, 0.974, and 0.980, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computational Intelligence and Neuroscience	Publication Date: Jan 1, 2023
Citations: 13	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Classification Prediction of Breast Cancer Based on Machine Learning.

Abstract

Talk to us

Similar Papers

More From: Computational Intelligence and Neuroscience

Lead the way for us

Similar Papers

Optimization and normalization strategies for long term untargeted HILIC-LC-qTOF-MS based metabolomics analysis: Early diagnosis of breast cancer
Tuba Reçber ... Sedef Kır
Microchemical Journal | VOL. 179
Tuba Reçber, et. al.Tuba Reçber ... Sedef Kır
03 Jun 2022
Microchemical Journal | VOL. 179

Diagnosis of Breast Cancer Using Improved Machine Learning Algorithms Based on Bayesian Optimization
Zeynep Ceylan
International Journal of Intelligent Systems and Applications in Engineering | VOL. 8
Zeynep CeylanZeynep Ceylan
28 Sep 2020
International Journal of Intelligent Systems and Applications in Engineering | VOL. 8

Rapid Diagnosis of Ductal Carcinoma In Situ and Breast Cancer Based on Raman Spectroscopy of Serum Combined with Convolutional Neural Network.
Xianglei Wang ... Shu Wang
Bioengineering (Basel, Switzerland) | VOL. 10
Xianglei Wang, et. al.Xianglei Wang ... Shu Wang
04 Jan 2023
Bioengineering (Basel, Switzerland) | VOL. 10

Fuzzy Neural Network Expert System with an Improved Gini Index Random Forest-Based Feature Importance Measure Algorithm for Early Diagnosis of Breast Cancer in Saudi Arabia
Ebrahem A Algehyne ... Osama Abdulaziz Alamri
Big Data and Cognitive Computing | VOL. 6
Ebrahem A Algehyne, et. al.Ebrahem A Algehyne ... Osama Abdulaziz Alamri
27 Jan 2022
Big Data and Cognitive Computing | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Classification Prediction of Breast Cancer Based on Machine Learning.

Abstract

Talk to us

Similar Papers

More From: Computational Intelligence and Neuroscience