Abstract

BackgroundPrevious studies have shown that CpG dinucleotides are enriched in a subset of promoters and the CpG content of promoters is positively correlated with gene expression levels. But the relationship between divergence of CpG content and gene expression evolution has not been investigated. Here we calculate the normalized CpG (nCpG) content in DNA regions around transcription start site (TSS) and transcription terminal site (TTS) of genes in nine organisms, and relate them with expression levels measured by RNA-seq.ResultsThe nCpG content of TSS shows a bimodal distribution in all organisms except platypus, whereas the nCpG content of TTS only has a single peak. When the nCpG contents are compared between different organisms, we observe a different evolution pattern between TSS and TTS: compared with TTS, TSS exhibits a faster divergence rate between closely related species but are more conserved between distant species. More importantly, we demonstrate the link between gene expression evolution and nCpG content changes: up-/down- regulation of genes in an organism is accompanied by the nCpG content increase/decrease in their TSS and TTS proximal regions.ConclusionsOur results suggest that gene expression changes between different organisms are correlated with the alterations in normalized CpG contents of promoters. Our analyses provide evidences for the impact of nCpG content on gene expression evolution.Electronic supplementary materialThe online version of this article (doi:10.1186/1471-2164-15-693) contains supplementary material, which is available to authorized users.

Highlights

  • Previous studies have shown that CpG dinucleotides are enriched in a subset of promoters and the CpG content of promoters is positively correlated with gene expression levels

  • We found a high correlation between normalized CpG (nCpG) content of promoters and expression level of transcription start site (TSS) quantified by Cap Analysis of Gene Expression (CAGE) in human cell lines [18]

  • Normalized CpG content of promoters in nine species We investigate the nCpG contents of all promoters (3 kb centering on TSS) in 9 vertebrate species

Read more

Summary

Introduction

Previous studies have shown that CpG dinucleotides are enriched in a subset of promoters and the CpG content of promoters is positively correlated with gene expression levels. There is no satisfying way to associate CGIs with genes To address this issue in the context of promoter studies, Saxonov et al defined a metric called normalized CpG (nCpG) content– the ratio of the observed number of CpG dinucleotide to the expected number within a 3 kb region around the TSS of genes [13]. They found that human promoters displayed a bimodal distribution in their nCpG content, and could be divided into two classes: high CpG promoters (HCPs) and low CpG promoters (LCP)

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call