Abstract
BackgroundDrug sensitivity prediction and drug responsive biomarker selection on high-throughput genomic data is a critical step in drug discovery. Many computational methods have been developed to serve this purpose including several deep neural network models. However, the modular relations among genomic features have been largely ignored in these methods. To overcome this limitation, the role of the gene co-expression network on drug sensitivity prediction is investigated in this study.MethodsIn this paper, we first introduce a network-based method to identify representative features for drug response prediction by using the gene co-expression network. Then, two graph-based neural network models are proposed and both models integrate gene network information directly into neural network for outcome prediction. Next, we present a large-scale comparative study among the proposed network-based methods, canonical prediction algorithms (i.e., Elastic Net, Random Forest, Partial Least Squares Regression, and Support Vector Regression), and deep neural network models for drug sensitivity prediction. All the source code and processed datasets in this study are available at https://github.com/compbiolabucf/drug-sensitivity-prediction.ResultsIn the comparison of different feature selection methods and prediction methods on a non-small cell lung cancer (NSCLC) cell line RNA-seq gene expression dataset with 50 different drug treatments, we found that (1) the network-based feature selection method improves the prediction performance compared to Pearson correlation coefficients; (2) Random Forest outperforms all the other canonical prediction algorithms and deep neural network models; (3) the proposed graph-based neural network models show better prediction performance compared to deep neural network model; (4) the prediction performance is drug dependent and it may relate to the drug’s mechanism of action.ConclusionsNetwork-based feature selection method and prediction models improve the performance of the drug response prediction. The relations between the genomic features are more robust and stable compared to the correlation between each individual genomic feature and the drug response in high dimension and low sample size genomic datasets.
Highlights
Drug sensitivity prediction and drug responsive biomarker selection on high-throughput genomic data is a critical step in drug discovery
A non-small cell lung cancer (NSCLC) cell line RNA-seq gene expression dataset with 50 different drug treatments was applied to evaluate the performance [34]
Alternative methods for comparison and evaluation For more insight in the drug sensitivity prediction problem by using gene expression data, we compared our proposed methods with four canonical prediction algorithms that were used in DREAM 7 - Drug Sensitivity Prediction Challenge [4]: Random Forest, Support Vector Regression (SVR), Elastic Net, and Partial Least Squares Regression (PLSR)
Summary
Drug sensitivity prediction and drug responsive biomarker selection on high-throughput genomic data is a critical step in drug discovery. The modular relations among genomic features have been largely ignored in these methods To overcome this limitation, the role of the gene co-expression network on drug sensitivity prediction is investigated in this study. Powered by the high-throughput genomic technologies developed in the past two decades, personalized treatment has been enabled to understand complex diseases for individual patients. The question of effective translation of highthroughput omics data from patient samples into prognosis and personalized treatment still remains It needs a comprehensive study across many drugs, patients, diseases and profiling technologies, which is limited by time, expense and scope of the drugs that can be tested. The omics data includes but not limited to gene expression, mutation, and copy number variations
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.