Abstract

BackgroundMotif enrichment analysis of transcription factor ChIP-seq data can help identify transcription factors that cooperate or compete. Previously, little attention has been given to comparative motif enrichment analysis of pairs of ChIP-seq experiments, where the binding of the same transcription factor is assayed under different conditions. Such comparative analysis could potentially identify the distinct regulatory partners/competitors of the assayed transcription factor under different conditions or at different stages of development.ResultsWe describe a new methodology for identifying sequence motifs that are differentially enriched in one set of DNA or RNA sequences relative to another set, and apply it to paired ChIP-seq experiments. We show that, using paired ChIP-seq data for a single transcription factor, differential motif enrichment analysis identifies all the known key transcription factors involved in the transformation of non-cancerous immortalized breast cells (MCF10A-ER-Src cells) into cancer stem cells whereas non-differential motif enrichment analysis does not. We also show that differential motif enrichment analysis identifies regulatory motifs that are significantly enriched at constrained locations within the bound promoters, and that these motifs are not identified by non-differential motif enrichment analysis. Our methodology differs from other approaches in that it leverages both comparative enrichment and positional enrichment of motifs in ChIP-seq peak regions or in the promoters of genes bound by the transcription factor.ConclusionsWe show that differential motif enrichment analysis of paired ChIP-seq experiments offers biological insights not available from non-differential analysis. In contrast to previous approaches, our method detects motifs that are enriched in a constrained region in one set of sequences, but not enriched in the same region in the comparative set. We have enhanced the web-based CentriMo algorithm to allow it to perform the constrained differential motif enrichment analysis described in this paper, and CentriMo’s on-line interface (http://meme.ebi.edu.au) provides dozens of databases of DNA- and RNA-binding motifs from a full range of organisms. All data and output files presented here are available at http://research.imb.uq.edu.au/t.bailey/supplementary_data/Lesluyes2014.Electronic supplementary materialThe online version of this article (doi:10.1186/1471-2164-15-752) contains supplementary material, which is available to authorized users.

Highlights

  • motif for the ChIP-ed TF (Motif) enrichment analysis of transcription factor ChIP-ed transcription factor (ChIP)-seq data can help identify transcription factors that cooperate or compete

  • Given two sets of ChIP-seq peak regions for TF X from experiments A and B, known motifs differentially enriched in set A relative to set B may indicate that X is coregulating some of its targets in conjunction with different TFs in the two experiments

  • Using this new feature of the CentriMo algorithm, we showed that the differential analysis of ChIP-seq peaks for a single transcription factor under two different cellular conditions identifies several other transcription factors with pivotal roles in the distinguishing the two cellular states

Read more

Summary

Introduction

Motif enrichment analysis of transcription factor ChIP-seq data can help identify transcription factors that cooperate or compete. Advances in protein-binding microarrays and high-throughput variants of SELEX have recently been used to produce large compendia of both DNA [2,3,4] and RNA motifs [5] These two threads of technological advancement provide the necessary inputs for very productive analyses of the regulatory roles of sequence motifs associated with particular DNA- or RNA-binding molecules. Restricting attention to the curated set of motifs increases statistical power, allowing more subtle motif enrichments to be detected This latter advantage is a consequence of the huge number of possible sequence motifs that de novo motif discovery must consider

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call