Feature selection strategies for drug sensitivity prediction

Krzysztof Koras,Dilafruz Juraeva,Julian Kreis,Johanna Mazur,Eike Staub,Ewa Szczurek

doi:10.1038/s41598-020-65927-9

Krzysztof Koras, Dilafruz Juraeva + Show 4 more

Open Access

https://doi.org/10.1038/s41598-020-65927-9

Copy DOI

Journal: Scientific reports	Publication Date: Jun 10, 2020
Citations: 36	License type: open-access

Affiliation: University of Warsaw, Merck (Germany)

Abstract

Drug sensitivity prediction constitutes one of the main challenges in personalized medicine. Critically, the sensitivity of cancer cells to treatment depends on an unknown subset of a large number of biological features. Here, we compare standard, data-driven feature selection approaches to feature selection driven by prior knowledge of drug targets, target pathways, and gene expression signatures. We asses these methodologies on Genomics of Drug Sensitivity in Cancer (GDSC) dataset, evaluating 2484 unique models. For 23 drugs, better predictive performance is achieved when the features are selected according to prior knowledge of drug targets and pathways. The best correlation of observed and predicted response using the test set is achieved for Linifanib (r = 0.75). Extending the drug-dependent features with gene expression signatures yields the most predictive models for 60 drugs, with the best performing example of Dabrafenib. For many compounds, even a very small subset of drug-related features is highly predictive of drug sensitivity. Small feature sets selected using prior knowledge are more predictive for drugs targeting specific genes and pathways, while models with wider feature sets perform better for drugs affecting general cellular mechanisms. Appropriate feature selection strategies facilitate the development of interpretable models that are indicative for therapy design.

Highlights

Drug sensitivity prediction constitutes one of the main challenges in personalized medicine
We employed each of the feature selection approaches, which can be divided into two categories: biologically driven and automatic, data-driven selection methods
We considered the union of the direct target genes and the drug’s target pathway genes

Summary

Introduction

Drug sensitivity prediction constitutes one of the main challenges in personalized medicine. We compare standard, data-driven feature selection approaches to feature selection driven by prior knowledge of drug targets, target pathways, and gene expression signatures We asses these methodologies on Genomics of Drug Sensitivity in Cancer (GDSC) dataset, evaluating 2484 unique models. A multi-task learning approach based on a Bayesian model for collaborative filtering was proposed[23], which allows for identifying general interactions between features of the drugs with features of the cell lines. It gives insights in the form of ”activation of pathway Y will confer sensitivity to any drug targeting protein X”. Stability selection was proposed to mitigate this problem when regularized regression is applied[27], but it still comes without the guarantee to choose the most biologically relevant predictive features

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Feature selection strategies for drug sensitivity prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific reports

Lead the way for us

Similar Papers

Abstract 2206: Genomics of Drug Sensitivity in Cancer (GDSC): A resource for therapeutic biomarker discovery in cancer cells.
Wanjuan Yang ... Ramaswamy Sridhar
Cancer Research | VOL. 73
Wanjuan Yang, et. al.Wanjuan Yang ... Ramaswamy Sridhar
15 Apr 2013
Cancer Research | VOL. 73

Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells
Wanjuan Yang ... Simon Forbes
Nucleic Acids Research | VOL. 41
Wanjuan Yang, et. al.Wanjuan Yang ... Simon Forbes
22 Nov 2012
Nucleic Acids Research | VOL. 41

Abstract P1-07-04: Unique overlapping subtypes of triple-negative breast and ovarian cancers and sensitivity of “mesenchymal-like” cancers to HSP90 inhibition is revealed by integrated gene expression and drug sensitivity profiling
K Shee ... Tw Miller
Cancer Research | VOL. 77
K Shee, et. al.K Shee ... Tw Miller
14 Feb 2017
Cancer Research | VOL. 77

Ensembled machine learning framework for drug sensitivity prediction
Aman Sharma ... Rinkle Rani
IET Systems Biology | VOL. 14
Aman Sharma, et. al.Aman Sharma ... Rinkle Rani
01 Feb 2020
IET Systems Biology | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Feature selection strategies for drug sensitivity prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific reports