Abstract

Deciphering the specific contribution of individual motifs within cis-regulatory modules (CRMs) is crucial to understanding how gene expression is regulated and how this process is affected by sequence variation. But despite vast improvements in the ability to identify where transcription factors (TFs) bind throughout the genome, we are limited in our ability to relate information on motif occupancy to function from sequence alone. Here, we engineered 63 synthetic CRMs to systematically assess the relationship between variation in the content and spacing of motifs within CRMs to CRM activity during development using Drosophila transgenic embryos. In over half the cases, very simple elements containing only one or two types of TF binding motifs were capable of driving specific spatio-temporal patterns during development. Different motif organizations provide different degrees of robustness to enhancer activity, ranging from binary on-off responses to more subtle effects including embryo-to-embryo and within-embryo variation. By quantifying the effects of subtle changes in motif organization, we were able to model biophysical rules that explain CRM behavior and may contribute to the spatial positioning of CRM activity in vivo. For the same enhancer, the effects of small differences in motif positions varied in developmentally related tissues, suggesting that gene expression may be more susceptible to sequence variation in one tissue compared to another. This result has important implications for human eQTL studies in which many associated mutations are found in cis-regulatory regions, though the mechanism for how they affect tissue-specific gene expression is often not understood.

Highlights

  • Gene expression is initiated by the binding of transcription factors (TFs) to cis-regulatory modules (CRMs) such as enhancer elements, which give rise to specific patterns of temporal and spatial activity [1]

  • Transcription is initiated through the binding of transcription factors (TFs) to specific motifs that are dispersed throughout the genome

  • While we cannot change this inherent property of TFs, we did try to increase the potential specificity of the motif used here for the particular TFs we are interested in by using motifs derived from in vivo occupancy data for nine of the ten factors [2,7]

Read more

Summary

Introduction

Gene expression is initiated by the binding of transcription factors (TFs) to cis-regulatory modules (CRMs) such as enhancer elements, which give rise to specific patterns of temporal and spatial activity [1]. Recent years have seen a dramatic increase in the ability to identify the location of regulatory elements using genome-wide information on TF occupancy [2,3,4,5,6,7], cofactor recruitment [8,9] and chromatin modifications [10,11,12,13,14,15] These studies have identified thousands to hundreds-of-thousands of regulatory elements that could be potentially active at a given time and in a given cell-type during development. The perimeter of the segmented regions was calculated (using Matlab function bwperim) and overlaid onto the CRM expression data

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call