Combinatorial epigenetic patterns as quantitative predictors of chromatin biology

Marcin Cieślik,Stefan Bekiranov

doi:10.1186/1471-2164-15-76

Abstract

BackgroundChromatin immunoprecipitation followed by deep sequencing (ChIP-seq) is the most widely used method for characterizing the epigenetic states of chromatin on a genomic scale. With the recent availability of large genome-wide data sets, often comprising several epigenetic marks, novel approaches are required to explore functionally relevant interactions between histone modifications. Computational discovery of "chromatin states" defined by such combinatorial interactions enabled descriptive annotations of genomes, but more quantitative approaches are needed to progress towards predictive models.ResultsWe propose non-negative matrix factorization (NMF) as a new unsupervised method to discover combinatorial patterns of epigenetic marks that frequently co-occur in subsets of genomic regions. We show that this small set of combinatorial "codes" can be effectively displayed and interpreted. NMF codes enable dimensionality reduction and have desirable statistical properties for regression and classification tasks. We demonstrate the utility of codes in the quantitative prediction of Pol2-binding and the discrimination between Pol2-bound promoters and enhancers. Finally, we show that specific codes can be linked to molecular pathways and targets of pluripotency genes during differentiation.ConclusionsWe have introduced and evaluated a new computational approach to represent combinatorial patterns of epigenetic marks as quantitative variables suitable for predictive modeling and supervised machine learning. To foster widespread adoption of this method we make it available as an open-source software-package – epicode at https://github.com/mcieslik-mctp/epicode.

Highlights

Chromatin immunoprecipitation followed by deep sequencing (ChIP-seq) is the most widely used method for characterizing the epigenetic states of chromatin on a genomic scale
We have shown that negative matrix factorization (NMF) applied to epigenetic marks yield sparse codes with an important nesting property
Further we have demonstrated the benefits of using codes over individual marks in predictive modeling of Pol2binding

Summary

Introduction

Chromatin immunoprecipitation followed by deep sequencing (ChIP-seq) is the most widely used method for characterizing the epigenetic states of chromatin on a genomic scale. With the recent availability of large genome-wide data sets, often comprising several epigenetic marks, novel approaches are required to explore functionally relevant interactions between histone modifications. Chromatin immunoprecipitation followed by deep sequencing (ChIP-seq) is becoming the standard method for the genome-wide mapping of histone modifications and transcription factor (TF) binding sites [2]. Most of the existing analysis tools are focused on the delineation of enriched sites from a single sample with optional “input control” [4] For histone modifications this task becomes more challenging as their enrichments are often weaker and less localized. None of the standard peak-based method deals with multiple marks and the reconciliation of several sets of peaks is an added challenge [11,12,13]

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Genomics	Publication Date: Jan 1, 2014
Citations: 83	License type: cc-by

R Discovery Prime

R Discovery Prime

Combinatorial epigenetic patterns as quantitative predictors of chromatin biology

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics

Lead the way for us

Similar Papers

Quantitative Assessment of Chromatin Immunoprecipitation Grade Antibodies Directed against Histone Modifications Reveals Patterns of Co-occurring Marks on Histone Protein Molecules
Sally E Peach ... Namrata D Udeshi
Molecular & Cellular Proteomics | VOL. 11
Sally E Peach, et. al.Sally E Peach ... Namrata D Udeshi
21 Mar 2012
Molecular & Cellular Proteomics | VOL. 11

An Integrated Platform for Genome-wide Mapping of Chromatin States Using High-throughput ChIP-sequencing in Tumor Tissues
Ayush Raman ... Zhiyi Liu
Journal of Visualized Experiments | VOL. -
Ayush Raman, et. al.Ayush Raman ... Zhiyi Liu
05 Apr 2018
Journal of Visualized Experiments | VOL. -

An Integrated Platform for Genome-wide Mapping of Chromatin States Using High-throughput ChIP-sequencing in Tumor Tissues.
Christopher Terranova ... Kunal Rai
Journal of Visualized Experiments | VOL. 2018
Christopher Terranova, et. al.Christopher Terranova ... Kunal Rai
05 Apr 2018
Journal of Visualized Experiments | VOL. 2018

RiceENCODE: A comprehensive epigenomic database as a rice Encyclopedia of DNA Elements
Liang Xie ... Guoliang Li
Molecular Plant | VOL. 14
Liang Xie, et. al.Liang Xie ... Guoliang Li
27 Aug 2021
Molecular Plant | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Combinatorial epigenetic patterns as quantitative predictors of chromatin biology

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics