Detecting broad domains and narrow peaks in ChIP-seq data with hiddenDomains.

Joshua Starmer,Terry Magnuson

doi:10.1186/s12859-016-0991-z

Joshua Starmer, Terry Magnuson

Open Access

https://doi.org/10.1186/s12859-016-0991-z

Copy DOI

Journal: BMC Bioinformatics	Publication Date: Mar 24, 2016
Citations: 54	License type: CC BY 4.0

Affiliation: University of North Carolina at Chapel Hill

Abstract

BackgroundCorrectly identifying genomic regions enriched with histone modifications and transcription factors is key to understanding their regulatory and developmental roles. Conceptually, these regions are divided into two categories, narrow peaks and broad domains, and different algorithms are used to identify each one. Datasets that span these two categories are often analyzed with a single program for peak calling combined with an ad hoc method for domains.ResultsWe developed hiddenDomains, which identifies both peaks and domains, and compare it to the leading algorithms using H3K27me3, H3K36me3, GABP, ESR1 and FOXA ChIP-seq datasets. The output from the programs was compared to qPCR-validated enriched and depleted sites, predicted transcription factor binding sites, and highly-transcribed gene bodies. With every method, hiddenDomains, performed as well as, if not better than algorithms dedicated to a specific type of analysis.ConclusionshiddenDomains performs as well as the best domain and peak calling algorithms, making it ideal for analyzing ChIP-seq datasets, especially those that contain a mixture of peaks and domains.Electronic supplementary materialThe online version of this article (doi:10.1186/s12859-016-0991-z) contains supplementary material, which is available to authorized users.

Highlights

Identifying genomic regions enriched with histone modifications and transcription factors is key to understanding their regulatory and developmental roles
Using ChIP-seq datatsets for H3K27me3, GA-binding protein (GABP), Estrogen Receptor 1 (ESR1) and Forkhead Box A1 (FOXA1), we have shown that hiddenDomains’s sensitivities and specificities are among the best, if not better than, methods that are dedicated to identifying broad domains or narrow peaks
We have shown that a larger percentage of hiddenDomains’s GABP, ESR1 and FOXA1 results overlap predicted binding sites than any other method using the default bin size (1 kb) and much smaller, 212 and 200 bp, bin sizes

Summary

Introduction

Identifying genomic regions enriched with histone modifications and transcription factors is key to understanding their regulatory and developmental roles. These regions are divided into two categories, narrow peaks and broad domains, and different algorithms are used to identify each one. Datasets that span these two categories are often analyzed with a single program for peak calling combined with an ad hoc method for domains. ChIP-seq analysis algorithms have specialized in identifying one of two types of enrichment: broad domains (i.e. histone modifications that cover entire gene bodies) or narrow peaks (i.e. a transcription factor bound to an enhancer). A program that accurately identifies both broad domains and narrow peaks simultaneously would greatly simplify these analyses

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Detecting broad domains and narrow peaks in ChIP-seq data with hiddenDomains.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Broad-Enrich: functional interpretation of large sets of broad genomic regions.
Raymond G Cavalcante ... Laura J Scott
Bioinformatics | VOL. 30
Raymond G Cavalcante, et. al.Raymond G Cavalcante ... Laura J Scott
22 Aug 2014
Bioinformatics | VOL. 30

RECAP reveals the true statistical significance of ChIP-seq peak calls.
Justin G Chitpin ... Theodore J Perkins
Bioinformatics | VOL. 35
Justin G Chitpin, et. al.Justin G Chitpin ... Theodore J Perkins
01 Mar 2019
Bioinformatics | VOL. 35

OccuPeak: ChIP-Seq Peak Calling Based on Internal Background Modelling
Bouke A De Boer ... Vincent M Christoffels
PLoS ONE | VOL. 9
Bouke A De Boer, et. al.Bouke A De Boer ... Vincent M Christoffels
17 Jun 2014
PLoS ONE | VOL. 9

Accounting for GC-content bias reduces systematic errors and batch effects in ChIP-seq data.
Mingxiang Teng ... Rafael A Irizarry
Genome Research | VOL. 27
Mingxiang Teng, et. al.Mingxiang Teng ... Rafael A Irizarry
12 Oct 2017
Genome Research | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Detecting broad domains and narrow peaks in ChIP-seq data with hiddenDomains.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics