Improving prediction of phenotypic drug response on cancer cell lines using deep convolutional network

Pengfei Liu,Hongjian Li,Kwong-Sak Leung,Shuai Li

doi:10.1186/s12859-019-2910-6

Pengfei Liu, Hongjian Li + Show 2 more

Open Access

https://doi.org/10.1186/s12859-019-2910-6

Copy DOI

Abstract

BackgroundUnderstanding the phenotypic drug response on cancer cell lines plays a vital role in anti-cancer drug discovery and re-purposing. The Genomics of Drug Sensitivity in Cancer (GDSC) database provides open data for researchers in phenotypic screening to build and test their models. Previously, most research in these areas starts from the molecular fingerprints or physiochemical features of drugs, instead of their structures.ResultsIn this paper, a model called twin Convolutional Neural Network for drugs in SMILES format (tCNNS) is introduced for phenotypic screening. tCNNS uses a convolutional network to extract features for drugs from their simplified molecular input line entry specification (SMILES) format and uses another convolutional network to extract features for cancer cell lines from the genetic feature vectors respectively. After that, a fully connected network is used to predict the interaction between the drugs and the cancer cell lines. When the training set and the testing set are divided based on the interaction pairs between drugs and cell lines, tCNNS achieves 0.826, 0.831 for the mean and top quartile of the coefficient of determinant (R2) respectively and 0.909, 0.912 for the mean and top quartile of the Pearson correlation (Rp) respectively, which are significantly better than those of the previous works (Ammad-Ud-Din et al., J Chem Inf Model 54:2347–9, 2014), (Haider et al., PLoS ONE 10:0144490, 2015), (Menden et al., PLoS ONE 8:61318, 2013). However, when the training set and the testing set are divided exclusively based on drugs or cell lines, the performance of tCNNS decreases significantly and Rp and R2 drop to barely above 0.ConclusionsOur approach is able to predict the drug effects on cancer cell lines with high accuracy, and its performance remains stable with less but high-quality data, and with fewer features for the cancer cell lines. tCNNS can also solve the problem of outliers in other feature space. Besides achieving high scores in these statistical metrics, tCNNS also provides some insights into the phenotypic screening. However, the performance of tCNNS drops in the blind test.

Highlights

Understanding the phenotypic drug response on cancer cell lines plays a vital role in anti-cancer drug discovery and re-purposing
Results the performance of our model tCNNS is demonstrated under various data input settings
The drug-cell line interaction pairs are divided into a training set, a validation set and a testing set. tCNNS is trained on the training set and the result on the test set is reported

Summary

Introduction

Understanding the phenotypic drug response on cancer cell lines plays a vital role in anti-cancer drug discovery and re-purposing. The authors used a neural network to analyze the response of drugs to cancer cell lines on the GDSC dataset. Their main result was the achievement of 0.72 for the coefficient of determination and 0.85 for the Pearson correlation. The first one used kernelized Bayesian matrix factorization to conduct QSAR analysis on cancer cell lines and anti-cancer drugs, and the second one used multivariate random forests. Both of their results were not as good as those in [2], which is chosen to be the baseline for our work

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Jul 29, 2019
Citations: 107	License type: open-access

R Discovery Prime

R Discovery Prime

Improving prediction of phenotypic drug response on cancer cell lines using deep convolutional network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Abstract 2206: Genomics of Drug Sensitivity in Cancer (GDSC): A resource for therapeutic biomarker discovery in cancer cells.
Wanjuan Yang ... Ramaswamy Sridhar
Cancer Research | VOL. 73
Wanjuan Yang, et. al.Wanjuan Yang ... Ramaswamy Sridhar
15 Apr 2013
Cancer Research | VOL. 73

Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells
Wanjuan Yang ... Simon Forbes
Nucleic Acids Research | VOL. 41
Wanjuan Yang, et. al.Wanjuan Yang ... Simon Forbes
22 Nov 2012
Nucleic Acids Research | VOL. 41

Revisiting inconsistency in large pharmacogenomic studies.
Zhaleh Safikhani ... Anna Goldenberg
F1000Research | VOL. 5
Zhaleh Safikhani, et. al.Zhaleh Safikhani ... Anna Goldenberg
16 Sep 2016
F1000Research | VOL. 5

Revisiting inconsistency in large pharmacogenomic studies
Zhaleh Safikhani ... Adrian She
F1000Research | VOL. 5
Zhaleh Safikhani, et. al.Zhaleh Safikhani ... Adrian She
25 Jul 2017
F1000Research | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving prediction of phenotypic drug response on cancer cell lines using deep convolutional network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics