Abstract

Cancer has been characterized as a heterogeneous disease consisting of many different subtypes. The early diagnosis and prognosis of a cancer type have become a necessity in cancer research, as it can facilitate the subsequent clinical management of patients. The importance of classifying cancer patients into high or low risk groups has led many research teams, from the biomedical and the bioinformatics field, to study the application of machine learning (ML) methods. Therefore, these techniques have been utilized as an aim to model the progression and treatment of cancerous conditions. In addition, the ability of ML tools to detect key features from complex datasets reveals their importance. A variety of these techniques, including Artificial Neural Networks (ANNs), Bayesian Networks (BNs), Support Vector Machines (SVMs) and Decision Trees (DTs) have been widely applied in cancer research for the development of predictive models, resulting in effective and accurate decision making. Even though it is evident that the use of ML methods can improve our understanding of cancer progression, an appropriate level of validation is needed in order for these methods to be considered in the everyday clinical practice. In this work, we present a review of recent ML approaches employed in the modeling of cancer progression. The predictive models discussed here are based on various supervised ML techniques as well as on different input features and data samples. Given the growing trend on the application of ML methods in cancer research, we present here the most recent publications that employ these techniques as an aim to model cancer risk or patient outcomes.

Highlights

  • Over the past decades, a continuous evolution related to cancer research has been performed [1]

  • We identify the trends regarding the types of machine learning (ML) methods that are used, the types of data that are integrated as well as the evaluation methods employed for assessing the overall performance of the methods used for cancer prediction or disease outcomes

  • Among the most recent publications that resulted after our limited literature search regarding the cancer risk assessment prediction [19,56,57,58], we selected a recent and very interesting study to present relevant to the breast cancer risk estimation by means of Artificial Neural Networks (ANNs) [19]

Read more

Summary

Introduction

A continuous evolution related to cancer research has been performed [1]. The last two decades a variety of different ML techniques and feature selection algorithms have been widely applied to disease prognosis and prediction [3,22,23,24,25,26,27] Most of these works employ ML methods for modeling the progression of cancer and identify informative factors that are utilized afterwards in a classification scheme. A growing trend is noted the last decade in the use of other supervised learning techniques, namely SVMs and BNs, towards cancer prediction and prognosis [24,31,32,33,34,35,36] All of these classification algorithms have been widely used in a wide range of problems posed in cancer research. TCGA provides with the ability to better understand the molecular basis of cancer through the application of high-throughput genome technologies

Survey of ML applications in cancer
Prediction of cancer susceptibility
Prediction of cancer recurrence
Method
Prediction of cancer survival
Findings
Discussion
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call