Abstract

Basal gene expression levels have been shown to be predictive of cellular response to cytotoxic treatments. However, such analyses do not fully reveal complex genotype- phenotype relationships, which are partly encoded in highly interconnected molecular networks. Biological pathways provide a complementary way of understanding drug response variation among individuals. In this study, we integrate chemosensitivity data from a large-scale pharmacogenomics study with basal gene expression data from the CCLE project and prior knowledge of molecular networks to identify specific pathways mediating chemical response. We first develop a computational method called PACER, which ranks pathways for enrichment in a given set of genes using a novel network embedding method. It examines a molecular network that encodes known gene-gene as well as gene-pathway relationships, and determines a vector representation of each gene and pathway in the same low-dimensional vector space. The relevance of a pathway to the given gene set is then captured by the similarity between the pathway vector and gene vectors. To apply this approach to chemosensitivity data, we identify genes whose basal expression levels in a panel of cell lines are correlated with cytotoxic response to a compound, and then rank pathways for relevance to these response-correlated genes using PACER. Extensive evaluation of this approach on benchmarks constructed from databases of compound target genes and large collections of drug response signatures demonstrates its advantages in identifying compound-pathway associations compared to existing statistical methods of pathway enrichment analysis. The associations identified by PACER can serve as testable hypotheses on chemosensitivity pathways and help further study the mechanisms of action of specific cytotoxic drugs. More broadly, PACER represents a novel technique of identifying enriched properties of any gene set of interest while also taking into account networks of known gene-gene relationships and interactions.

Highlights

  • Large-scale cancer genomics projects, such as the Cancer Genome Atlas [1], the Cancer Genome project [2], and the Cancer Cell Line Encyclopedia project [3], and cancer pharmacology projects, such as the Genomics of Drug Sensitivity in Cancer project [2], have generated a large volume of genomics and pharmacological profiling data

  • Gene expression levels have been used to study the cellular response to drug treatments

  • We showed that using the prior knowledge encoded in molecular networks substantially improves pathway identification

Read more

Summary

Introduction

Large-scale cancer genomics projects, such as the Cancer Genome Atlas [1], the Cancer Genome project [2], and the Cancer Cell Line Encyclopedia project [3], and cancer pharmacology projects, such as the Genomics of Drug Sensitivity in Cancer project [2], have generated a large volume of genomics and pharmacological profiling data. There is an unprecedented opportunity to link pharmacological and genomic data to identify therapeutic biomarkers [4,5,6] In pursuit of this vision, significant efforts have been invested in identifying the genetic basis of drug response variation among individual patients. While significant efforts have focused on specific genes that interact with compounds and confer observed cellular phenotypes, there has been relatively little progress in studying the synergistic effects of genes. These effects are key factors in comprehensively deciphering the mechanisms of action of compounds and understanding complex phenotypes [8]. The associated pathways for certain drugs have been studied experimentally [10,11,12], in vitro pathway analysis is costly and inherently difficult, making it hard to scale to hundreds of compounds

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call