KESVR: An Ensemble Model for Drug Response Prediction in Precision Medicine Using Cancer Cell Lines Gene Expression.

Abhishek Majumdar,Yaoqin Lu,Yueze Liu,Shaofeng Wu,Lijun Cheng

doi:10.3390/genes12060844

Abstract

Background: Cancer cell lines are frequently used in research as in-vitro tumor models. Genomic data and large-scale drug screening have accelerated the right drug selection for cancer patients. Accuracy in drug response prediction is crucial for success. Due to data-type diversity and big data volume, few methods can integrative and efficiently find the principal low-dimensional manifold of the high-dimensional cancer multi-omics data to predict drug response in precision medicine. Method: A novelty k-means Ensemble Support Vector Regression (kESVR) is developed to predict each drug response values for single patient based on cell-line gene expression data. The kESVR is a blend of supervised and unsupervised learning methods and is entirely data driven. It utilizes embedded clustering (Principal Component Analysis and k-means clustering) and local regression (Support Vector Regression) to predict drug response and obtain the global pattern while overcoming missing data and outliers’ noise. Results: We compared the efficiency and accuracy of kESVR to 4 standard machine learning regression models: (1) simple linear regression, (2) support vector regression (3) random forest (quantile regression forest) and (4) back propagation neural network. Our results, which based on drug response across 610 cancer cells from Cancer Cell Line Encyclopedia (CCLE) and Cancer Therapeutics Response Portal (CTRP v2), proved to have the highest accuracy (smallest mean squared error (MSE) measure). We next compared kESVR with existing 17 drug response prediction models based a varied range of methods such as regression, Bayesian inference, matrix factorization and deep learning. After ranking the 18 models based on their accuracy of prediction, kESVR ranks first (best performing) in majority (74%) of the time. As for the remaining (26%) cases, kESVR still ranked in the top five performing models. Conclusion: In this paper we introduce a novel model (kESVR) for drug response prediction using high dimensional cell-line gene expression data. This model outperforms current existing prediction models in terms of prediction accuracy and speed and overcomes overfitting. This can be used in future to develop a robust drug response prediction system for cancer patients using the cancer cell-lines guidance and multi-omics data.

Highlights

A novelty k-means Ensemble Support Vector Regression is developed to predict each drug response values for single patient based on cell-line gene expression data. kESVR’s origin stems from previous work interval SVR [22], where we separated a global nonlinear SVR predictor into interval subspaces and ran a SVR in each interval subspace
We demonstrate the steps of creation of kESVR model using zebularine drug response from Cancer Therapeutics Response Portal (CTRP) on 610 cancer cells from Cell Line Encyclopedia (CCLE)
We test the performance of kESVR on 5 random drugs out of the 481 drugs from CTRP

Summary

Introduction

Precision medicine aims to provide individually tailored cancer treatment by considering an individual’s genetic makeup, genomic makeup and clinical information. Generation Sequencing (NGS) techniques and large-scale cancer screening data helps in achieving this goal [1,2]. Databases such as the Cancer Cell Line Encyclopedia (CCLE) [2]. Provides public access to genomic data over 1000 cancer cell lines by RNA sequencing (RNA-seq; 1019 cell lines), whole-exome sequencing (326 cell lines), whole-genome sequencing (329 cell lines), and reverse-phase protein array (RPPA; 899 cell lines).

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Genes	Publication Date: May 30, 2021
Citations: 8	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

KESVR: An Ensemble Model for Drug Response Prediction in Precision Medicine Using Cancer Cell Lines Gene Expression.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Genes

Lead the way for us

Similar Papers

Super.FELT: supervised feature extraction learning using triplet loss for drug response prediction with multi-omics data
Sejin Park ... Hyunju Lee
BMC bioinformatics | VOL. 22
Sejin Park, et. al.Sejin Park ... Hyunju Lee
25 May 2021
BMC bioinformatics | VOL. 22

The application of artificial intelligence to drug sensitivity prediction
Xutong Li ... Kaixian Chen
Chinese Science Bulletin | VOL. 65
Xutong Li, et. al.Xutong Li ... Kaixian Chen
17 Jun 2020
Chinese Science Bulletin | VOL. 65

Tumor cell sensitivity to vemurafenib can be predicted from protein expression in a BRAF-V600E basket trial setting
Molly J Carroll ... David Page
BMC Cancer | VOL. 19
Molly J Carroll, et. al.Molly J Carroll ... David Page
31 Oct 2019
BMC Cancer | VOL. 19

Abstract 1520: Identifying differential dependency networks accounting for response to NEDD8-inhibitor in large-scale cancer cell line data
Gil Speyer ... Stuart Schreiber
Cancer Research | VOL. 76
Gil Speyer, et. al.Gil Speyer ... Stuart Schreiber
15 Jul 2016
Cancer Research | VOL. 76

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

KESVR: An Ensemble Model for Drug Response Prediction in Precision Medicine Using Cancer Cell Lines Gene Expression.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Genes