Abstract

Cancer is the second leading cause of mortality across the globe. Approximately 9.6 million people are estimated to have died due to cancer disease in 2019. Accurate and early prediction of cancer can assist healthcare professionals to devise timely therapeutic innervations to control sufferings and the risk of mortality. Generally, a machine learning (ML) based predictive system in healthcare uses data (genetic profile or clinical parameters) and learning algorithms to predict target values for cancer detection. However, optimization of predictive accuracy is an important endeavor for accurate decision making. Reject Option (RO) classifiers have been used to improve the predictive accuracy of classifiers for cancer like complex problems. In a gene profile all of the features are not important and should be shaved off. ML offers different techniques with their own methodology for feature selection (FS) and the classification results are dependent on the datasets each having its own distribution and features. Therefore, both FS methods and ML algorithms with RO need to be considered for robust classification. The main objective of this study is to optimize three parameters (learning algorithm, FS method and rejection rate) for robust cancer prediction rather than considering two traditional parameters (learning algorithm and rejection rate). The analysis of different FS methods (including t-test, Las Vegas Filter (LVF), Relief, and Information Gain (IG)) and RO classifiers on different rejection thresholds is performed to investigate the robust predictability of cancer. The three cancer datasets (Colon cancer, Leukemia and Breast cancer) were reduced using different FS methods and each of them were used to analyze the predictability of cancer using different RO classifiers. The results reveal that for each dataset predictive accuracies of RO classifiers were different for different FS methods. The findings based on proposed scheme indicate that, the ML algorithms along with their dependence on suitable FS methods need to be taken into consideration for accurate prediction.

Highlights

  • Cancer is becoming one of the main causes of death across the globe [1]

  • WORK DIRECTIONS In this paper, we analyzed the dependence of Reject Option (RO) based machine learning (ML) algorithms on feature selection (FS) methods for robust prediction of cancer while using Gene Expression (GE) microarray data

  • The results reveal that the RO classifiers showed improvements in predictive accuracy of RO classifiers differently at different Rejection rates and with the reduction of feature space with different FS methods

Read more

Summary

Introduction

Cancer is becoming one of the main causes of death across the globe [1]. Thousands of people die and agonize across the world every year due to inaccuracies in the healthcare systems. Growth and abnormalities occurring on the development of some specific cancer [2]. The extraction of this valuable information can assist in the reliable prediction of disease onset and in devising managerial solutions for the selection of therapy and personalized care [2]. The quality of prediction for effective decision making is of primary importance and the main emphasis of science is assisting the insufficiencies of human findings and judgments [3]. In the fields of artificial intelligence, information science etc. numerous tools and techniques called decisions support systems (DSS)

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call