Abstract

Lung cancer is the second most common cancer in the United States and the leading cause of mortality in cancer patients. Biomarkers predicting survival of patients with lung cancer have a profound effect on patient prognosis and treatment. However, predictive biomarkers for survival and their relevance for lung cancer are not been well known yet. The objective of this study was to perform machine learning with data from The Cancer Genome Atlas of patients with lung adenocarcinoma (LUAD) to find survival-specific gene mutations that could be used as survival-predicting biomarkers. To identify survival-specific mutations according to various clinical factors, four feature selection methods (information gain, chi-squared test, minimum redundancy maximum relevance, and correlation) were used. Extracted survival-specific mutations of LUAD were applied individually or as a group for Kaplan-Meier survival analysis. Mutations in MMRN2 and GMPPA were significantly associated with patient mortality while those in ZNF560 and SETX were associated with patient survival. Mutations in DNAJC2 and MMRN2 showed significant negative association with overall survival while mutations in ZNF560 showed significant positive association with overall survival. Mutations in MMRN2 showed significant negative association with disease-free survival while mutations in DRD3 and ZNF560 showed positive associated with disease-free survival. Mutations in DRD3, SETX, and ZNF560 showed significant positive association with survival in patients with LUAD while the opposite was true for mutations in DNAJC2, GMPPA, and MMRN2. These gene mutations were also found in other cohorts of LUAD, lung squamous cell carcinoma, and small cell lung cancer. In LUAD of Pan-Lung Cancer cohort, mutations in GMPPA, DNAJC2, and MMRN2 showed significant negative associations with survival of patients while mutations in DRD3 and SETX showed significant positive association with survival. In this study, machine learning was conducted to obtain information necessary to discover specific gene mutations associated with the survival of patients with LUAD. Mutations in the above six genes could predict survival rate and disease-free survival rate in patients with LUAD. Thus, they are important biomarker candidates for prognosis.

Highlights

  • Lung cancer is the leading cause of death in patients with cancer

  • Among mutations in 19 genes, mutations in GMPPA and MMRN2 were significantly associated with patient mortality while mutations in ZNF560 and SETX were significantly associated with patient survival (Fig 1 and Table 1)

  • Mutations in DNAJC2, GMPPA, MMRN2, DRD3, SETX, and ZNF560 were associated with survival in patients with lung adenocarcinoma (LUAD)

Read more

Summary

Introduction

Lung cancer is the leading cause of death in patients with cancer. It is the second most common cancer in men and women to date in the United States, following prostate cancer in men and breast cancer in women, respectively [1,2,3]. In 2016, 17,963 people died from lung cancer in Korea, accounting for 23% of all cancer-related deaths. Lung cancer has the highest mortality rate, few biomarkers for predicting overall survival or disease-free survival have been reported. Predicting survival rate in patients with cancer has a significant impact on their prognosis and treatment [4,5,6]

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call