A novel prognostic model based on multi-omics features predicts the prognosis of colon cancer patients.

Haojie Yang,Can Cui,Yilin Han,Jiong Wu,Dan Gan,Hua Liu,Wei Jin,Xiaoxue Wang,Zhenyi Wang,Changpeng Han

doi:10.1002/mgg3.1255

Abstract

BackgroundAs a common malignant tumor in the colon, colon cancer (CC) has high incidence and recurrence rates. This study is designed to build a prognostic model for CC.MethodsThe gene expression dataset, microRNA‐seq dataset, copy number variation (CNV) dataset, DNA methylation dataset, and transcription factor (TF) dataset of CC were downloaded from UCSC Xena database. Using limma package, the differentially methylated genes (DMGs), and differentially expressed genes (DEGs) and miRNAs (DEMs) were identified. Based on random forest method, prognostic model for each omics dataset were constructed. After the omics features related to prognosis were selected using logrank test, the prognostic model based on multi‐omics features was built. Finally, the clinical phenotypes correlated with prognosis were screened using Kaplan–Meier survival analysis, and the nomogram model was established.ResultsThere were 1625 DEGs, 268 DEMs, and 386 DMGs between the tumor and normal samples. A total of 105, 29, 159, five, and six genes/sites significantly correlated with prognosis were identified in the gene expression dataset (GABRD), miRNA‐seq dataset (miR‐1271), CNV dataset (RN7SKP247), DNA methylation dataset (cg09170112 methylation site [located in SFSWAP]), and TF dataset (SIX5), respectively. The prognostic model based on multi‐omics features was more effective than those based on single omics dataset. The number of lymph nodes, pathologic_M stage, and pathologic_T stage were the clinical phenotypes correlated with prognosis, based on which the nomogram model was constructed.ConclusionThe prognostic model based on multi‐omics features and the nomogram model might be valuable for the prognostic prediction of CC.

Full Text