Abstract

BackgroundGenome-wide libraries of yeast deletion strains have been used to screen for genes that drive phenotypes such as stress response. A surprising observation emerging from these studies is that the genes with the largest changes in mRNA expression during a state transition are not those that drive that transition. Here, we show that integrating gene expression data with context-independent protein interaction networks can help prioritize master regulators that drive biological phenotypes.ResultsGenes essential for survival had previously been shown to exhibit high centrality in protein interaction networks. However, the set of genes that drive growth in any specific condition is highly context-dependent. We inferred regulatory networks from gene expression data and transcription factor binding motifs in Saccharomyces cerevisiae, and found that high-degree nodes in regulatory networks are enriched for transcription factors that drive the corresponding phenotypes. We then found that using a metric combining protein interaction and transcriptional networks improved the enrichment for drivers in many of the contexts we examined. We applied this principle to a dataset of gene expression in normal human fibroblasts expressing a panel of viral oncogenes. We integrated regulatory interactions inferred from this data with a database of yeast two-hybrid protein interactions and ranked 571 human transcription factors by their combined network score. The ranked list was significantly enriched in known cancer genes that could not be found by standard differential expression or enrichment analyses.ConclusionsThere has been increasing recognition that network-based approaches can provide insight into critical cellular elements that help define phenotypic state. Our analysis suggests that no one network, based on a single data type, captures the full spectrum of interactions. Greater insight can instead be gained by exploring multiple independent networks and by choosing an appropriate metric on each network. Moreover we can improve our ability to rank phenotypic drivers by combining the information from individual networks. We propose that such integrative network analysis could be used to combine clinical gene expression data with interaction databases to prioritize patient- and disease-specific therapeutic targets.Electronic supplementary materialThe online version of this article (doi:10.1186/s12918-015-0228-1) contains supplementary material, which is available to authorized users.

Highlights

  • Genome-wide libraries of yeast deletion strains have been used to screen for genes that drive phenotypes such as stress response

  • For each of the 79 strains that overlapped between the gene expression and growth rate datasets, we computed a score by taking the average of the growth rates under the seven rapamycin-related conditions reported in [24]

  • We found that incorporating the protein interactor information resulted in a significant improvement in enrichment among the top differentially expressed Transcription factor (TF)

Read more

Summary

Introduction

Genome-wide libraries of yeast deletion strains have been used to screen for genes that drive phenotypes such as stress response. We show that integrating gene expression data with context-independent protein interaction networks can help prioritize master regulators that drive biological phenotypes. The sequencing of the yeast genome in the 1990s provided a catalog of genes, most of which were of unknown function. To functionally characterize the many newly discovered protein-coding genes, a genome-wide library was constructed of single gene deleted yeast. Padi and Quackenbush BMC Systems Biology (2015) 9:80 perturbation are not the same genes that drive phenotypic changes in response to that perturbation. Characterizing multiple genetic perturbations would become exponentially complicated. As a consequence, such screens often fail to provide insight into why some genes are important and others are not. What is needed is a principled way to build models of driver genes in the context of their biological interactions

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.