Abstract

Predicting lung adenocarcinoma (LUAD) and Lung Squamous Cell Carcinoma (LUSC) risk status is a crucial step in precision oncology. In current clinical practice, clinicians, and patients are informed about the patient's risk group only with cancer staging. Several machine learning approaches for stratifying LUAD and LUSC patients have recently been described, however, there has yet to be a study that compares the integrated modeling of clinical and genetic data from these two lung cancer types. In our work, we used a prognostic prediction model based on clinical and somatically altered gene features from 1026 patients to assess the relevance of features based on their impact on risk classification. By integrating the clinical features and somatically mutated genes of patients, we achieved the highest accuracy; 93% for LUAD and 89% for LUSC, respectively. Our second finding is that new prognostic genes such as KEAP1 for LUAD and CSMD3 for LUSC and new clinical factors such as the site of resection are significantly associated with the risk stratification and can be integrated into clinical decision making. We validated the most important features found on an independent RNAseq dataset from NCBI GEO with survival information (GSE81089) and integrated our model into a user-friendly mobile application. Using this machine learning model and mobile application, clinicians and patients can assess the survival risk of their patients using each patient’s own clinical and molecular feature set.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call