Abstract

Increasing knowledge of chromatin structure in various cell types raises the challenge of deciphering the contribution of epigenetic modifications to the regulation of nuclear functions in mammals. In a recent study, we have analysed the genome-wide distributions of thirteen epigenetic marks in the human cell line K562 at 100 kb resolution of Mean Replication Timing (MRT) data. Using classical clustering techniques, we have shown that the combinatorial complexity of these epigenetic data can be reduced to four predominant chromatin states that replicate at different periods of the S-phase. C1 is an early replicating transcriptionally active euchromatin state, C2 a mid-S repressive type of chromatin associated with Polycomb complexes, C3 a silent chromatin with lack of chromatin marks that replicates later than C2 but before C4, a HP1-associated heterochromatin state that replicates at the end of S-phase. These four chromatin states display remarkable similarities with those recently reported in fly, worm and plants at higher ∼ 1 kb resolution of gene expression data. Here, we extend our integrative analysis of epigenetic data in the K562 human cell line to this smaller scale by focusing on gene promoters (±3 kb around transcription start sites). We show that these promoters can similarly be classified into four main chromatin states: P1 regroups all the marks of transcriptionally active chromatin and corresponds to CpG rich promoters of highly expressed genes; P2 is notably associated with the histone modification H3K27me3 that is the mark of a polycomb repressed chromatin state; P3 corresponds to promoters that are not enriched for any available marks as the signature of a ‘null’ or ‘black’ silent heterochromatin state and P4 characterizes the few gene promoters that contain only the constitutive heterochromatin histone modification H3K9me3. When investigating the coherence between promoter activity (P1, P2, P3 or P4) and the large-scale chromatin environment (C1, C2, C3 or C4), we find that the higher the gene density in a considered 100 kb-window, the higher (resp. the lower) the probability of a P1 active promoter (resp. silent P2, P3 and P4 promoters) to be surrounded by an open euchromatin C1 (resp. facultative C2, black C3 or HP1-associated C4 heterochromatin) environment. From large to small scales, it is mainly C4 and to a lesser extent C3 heterochromatin environments both corresponding to gene poor regions, that strongly conditions promoters to belong to the inactive P3 and P4 classes. If C1 (resp. C2) environment surrounds a majority of corresponding active P1 (resp. P2) promoters, it also contains a non-negligible proportion of inactive P2 and P3 (resp. active P1 and inactive P3) promoters. When further investigating the large-scale organization of human genes with respect to ‘master’ replication origins that were shown to border megabase-sized U-shaped MRT domains, we reveal some significant enrichment of highly expressed P1 genes in a closed neighbourhood of these early initiation zones consistently with the gradient of chromatin states observed from C1 at U-domain borders followed by C2, C3 and C4 at U-domain centers. On the contrary to P2 promoters that are mainly found in the C2 environment at finite distance (∼200–300 kb) from U-domain borders, the inactive P3 and P4 promoters are distributed rather homogeneously inside U-domains. The generalization of our study to different cell types including ES, somatic and cancer cells is likely to provide new insight on the global reorganization of replication domains during differentiation (or disease) in relation to coordinated changes in chromatin environment and gene expression.

Highlights

  • Our goal is to extend our integrative analysis of epigenetic data in the K562 human cell line from the 100 kb scale of Mean Replication Timing (MRT) data to a few kb scale characteristic of gene promoters as previously performed in plants (Roudier et al 2011), worm (Liu et al 2011) and fly (Filion et al 2010; Sexton et al 2012) by investigating the coherence between the chromatin states obtained at these two scales, we will be in a position to study to what extent the promoter activity does condition its large-scale chromatin environment and vice versa

  • We perform a combinatorial analysis of chromatin marks in K562 and we describe the epigenetic content of the four prevalent chromatin states at gene promoters

  • We study the coherence between promoter activity, as characterized by their ‘small-scale’ chromatin state, and the ‘large-scale’ chromatin environment (namely the C1, C2, C3 and C4 chromatin states found in Julienne et al (2013))

Read more

Summary

Introduction

It is increasingly recognized that the dynamics of DNA folding and unfolding within the nucleus of eukaryotic cells plays a major role in the regulation of nuclear functions including gene expression and DNA replication (Belmont et al 1999; Cook 1999, 2001; Cremer & Cremer 2001; Berezney 2002; Chakalova et al 2005; Gilbert et al 2005; Branco & Pombo 2007; Fraser & Bickmore 2007; Kouzarides 2007; Misteli 2007; Sexton et al 2007; Gilbert 2010; Maric & Prioleau 2010; Arneodo et al 2011; Zhou et al 2011; Bickmore & van Steensel 2013). This study reveals that the huge combinatorial epigenetic complexity can be reduced to a rather small number of predominant chromatin states that interestingly share strong similarities with the ones previously found in Arabidopsis thaliana (Roudier et al 2011), Caenorhabditis elegans (Liu et al 2011) and Drosophila (Filion et al 2010; Sexton et al 2012) These four main chromatin states were further shown to correlate with MRT, namely from early to late replicating, a transcriptionally active euchromatin state (C1) enriched in insulator binding protein CTCF, a polycomb repressed facultative heterochromatin state (C2), a silent heterochromatin state (C3) not enriched in any available marks and a HP1-associated heterochromatin state (C4). In the final section, by discussing some perspectives for further studies in different cell types, in other mammalian genomes in both health and disease

Annotation and expression data
Read density computation around promoters
Rank transformation and Spearman correlation matrix
Principal component analysis
Definition of promoter chromatin states
Combinatorial analysis of chromatin marks at human gene promoters
Principal promoter chromatin states
Epigenetic content of the four prevalent promoter chromatin states
Gene expression
Findings
Repartition of promoter chromatin states along human chromosomes
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.