Abstract

Characterization of distinct histone methylation and acetylation binding patterns in promoters and prediction of novel regulatory regions remains an important area of genomic research, as it is hypothesized that distinct chromatin signatures may specify unique genomic functions. However, methods that have been proposed in the literature are either descriptive in nature or are fully parametric and hence more restrictive in pattern discovery. In this article, we propose a two-step non-parametric statistical inference procedure to characterize unique histone modification patterns and apply it to analyzing the binding patterns of four histone marks, H3K4me2, H3K4me3, H3K9ac, and H4K20me1, in human B-lymphoblastoid cells. In the first step, we used a functional principal component analysis method to represent the concatenated binding patterns of these four histone marks around the transcription start sites as smooth curves. In the second step, we clustered these curves to reveal several unique classes of binding patterns. These uncovered patterns were used in turn to scan the whole-genome to predict novel and alternative promoters. Our analyses show that there are three distinct promoter binding patterns of active genes. Further, 19654 regions not within known gene promoters were found to overlap with human ESTs, CpG islands, or common SNPs, indicative of their potential role in gene regulation, including being potential novel promoter regions.

Highlights

  • The importance of characterizing histone modification patterns in the promoter regions has been elucidated in Heintzman et al [1] and other research after this seminal publication [2,3,4,5,6,7]

  • For dimension reduction to better handle noise, we chose seven functional principal components (i.e. q = 7) that account for 90% of the variance in the data

  • After the intensity count data over all four histones are represented as smooth curves using functional principal component analysis (FPCA), we clustered them to uncover the underlying histone modification patterns

Read more

Summary

Introduction

The importance of characterizing histone modification patterns in the promoter regions has been elucidated in Heintzman et al [1] and other research after this seminal publication [2,3,4,5,6,7]. Found in the cell nucleus, on which DNA is wrapped around; they are epigenetic marks that play an important role in gene expression regulation. A common form of histone modification is histone methylation, in which one, two, or three methyl groups may be transferred to lysine. The mono-, di-, ad tri-methylation of lysine 4 on histone H3, denoted as H3K4me, H3K4me, and H3K4me, are the addition of one, two, or three. Characterization of histone modification patterns and prediction of novel promoters analysis, decision to publish, or preparation of the manuscript. There was no additional external funding received for this study

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call