Importance of Feature Selection and Data Visualization Towards Prediction of Breast Cancer

Rajalakshmi Krishnamurthi,Diva Srivastava,Niyati Aggrawal,Lokendra Sharma,Shivangi Sharma

doi:10.2174/2213275912666190101121058

Abstract

Background: Breast cancer is one of the most common forms of cancers among women and the leading cause of death among them. Countries like United States, England and Canada have reported a high number of breast cancer patients every year and this number is continuously increasing due to detection at later stages. Hence, it is very important to create awareness among women and develop such algorithms which help to detect malignant cancer. Several research studies have been conducted to analyze the breast cancer data. Objective: This paper presents an effective method in predicting breast cancer and its stage and will also analyze the performance of different supervised learning algorithms such as Random Classifier, Chi2 Square test used in order to predict. The paper focuses on the three important aspects such as the feature selection, the corresponding data visualisation and finally making a prediction call on different machine learning models. Methods: The dataset used for this work is breast cancer Wisconsin data taken from UCI library. The dataset has been used to show the different 32 features which are all important and how it can be achieved using data visualisation. Secondly, after the feature selection, different machine learning models have been applied. Conclusion: The machine learning models involved are namely Support Vector Machine (SVM), KNearest Neighbour (KNN), Random Forest, Principal Component Analysis (PCA), Neural Network using Perceptron (NNP). This has been done to check which type of model is better under what conditions. At different stages several charts have been plotted and eliminated based on relative comparison. Results have shown that Random Tree classifier along with Chi2 Square proves to be an efficient one.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Importance of Feature Selection and Data Visualization Towards Prediction of Breast Cancer

Abstract

Talk to us

Similar Papers

More From: Recent Patents on Computer Science

Lead the way for us

Journal: Recent Patents on Computer Science	Publication Date: Aug 19, 2019
Citations: 5

Similar Papers

Predicting Breast Cancer from Risk Factors Using SVM and Extra-Trees-Based Feature Selection Method
Ganjar Alfian ... Nurul Bahiyah
Computers | VOL. 11
Ganjar Alfian, et. al.Ganjar Alfian ... Nurul Bahiyah
12 Sep 2022
Computers | VOL. 11

Prediction of Female Breast Cancer Incidence among the Aging Society in Kanagawa, Japan
Kayoko Katayama ... Hiroto Narimatsu
PLOS ONE | VOL. 11
Kayoko Katayama, et. al.Kayoko Katayama ... Hiroto Narimatsu
17 Aug 2016
PLOS ONE | VOL. 11

Research on Prediction of Breast Cancer Type using Machine Learning
Dehui Kong
Highlights in Science, Engineering and Technology | VOL. 54
Dehui KongDehui Kong
04 Jul 2023
Highlights in Science, Engineering and Technology | VOL. 54

Comparative Analysis of Breast and Prostate Cancer Prediction Using Machine Learning Techniques
Samta Rani ... Tanvir Ahmad
-
Samta Rani, et. al.Samta Rani ... Tanvir Ahmad
27 Sep 2022
27 Sep 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Importance of Feature Selection and Data Visualization Towards Prediction of Breast Cancer

Abstract

Talk to us

Similar Papers

More From: Recent Patents on Computer Science