Abstract
Epithelial mesenchymal transition (EMT) process has been shown as highly relevant to cancer prognosis. However, although different biological network-based biomarker identification methods have been proposed to predict cancer prognosis, EMT network has not been directly used for this purpose. In this study, we constructed an EMT regulatory network consisting of 87 molecules and tried to select features that are useful for prognosis prediction in Lung Adenocarcinoma (LUAD). To incorporate multiple molecular profiles, we obtained four types of molecular data including mRNA-Seq, copy number alteration (CNA), DNA methylation, and miRNA-Seq data from The Cancer Genome Atlas. The data were mapped to the EMT network in three alternative ways: mRNA-Seq and miRNA-Seq, DNA methylation, and CNA and miRNA-Seq. Each mapping was employed to extract five different sets of features using discretization and network-based biomarker identification methods. Each feature set was then used to predict prognosis with SVM and logistic regression classifiers. We measured the prediction accuracy with AUC and AUPR values using 10 times 10-fold cross validation. For a more comprehensive evaluation, we also measured the prediction accuracies of clinical features, EMT plus clinical features, randomly picked 87 molecules from each data mapping, and using all molecules from each data type. Counter-intuitively, EMT features do not always outperform randomly selected features and the prediction accuracies of the five feature sets are mostly not significantly different. Clinical features are shown to give the highest prediction accuracies. In addition, the prediction accuracies of both EMT features and random features are comparable as using all features (more than 17,000) from each data type.
Highlights
Epithelial mesenchymal transition (EMT) process has been shown as highly relevant to cancer prognosis
Different biological network-based biomarker identification methods have been proposed to predict cancer prognosis, EMT network has not been directly used for this purpose
We constructed an EMT regulatory network consisting of 87 molecules and tried to select features that are useful for prognosis prediction in Lung Adenocarcinoma (LUAD)
Summary
Epithelial mesenchymal transition (EMT) process has been shown as highly relevant to cancer prognosis. In [29] the authors compare the performance of molecular biomarkers and network motif biomarkers in breast cancer prognosis classification They enumerate all three nodes network motifs using PPI, DNA-protein interaction, and a few pathway databases; assign a score to each motif; and use the scores to make predictions. In another study [26] the authors integrate PPI network and gene expression data to identify subnetwork markers for breast cancer metastasis prediction. In [28], researchers integrate CNA, microarray, patient clinical data and PPI network to identify subnetworks for predicting prognosis of OV They show that subnetwork biomarkers are able to achieve good prediction accuracy (74.47%) on independent test set and at the same time reveal biologically meaningful pathways. We use the word feature(s) in the context of machine learning and the word node(s) in the context of EMT network for clarity
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.