Abstract

In recent years, epigenetic research has enjoyed explosive growth as high-throughput sequencing technologies become more accessible and affordable. However, this advancement has not been matched with similar progress in data analysis capabilities from the perspective of experimental biologists not versed in bioinformatic languages. For instance, chromatin immunoprecipitation followed by next-generation sequencing (ChIP-seq) is at present widely used to identify genomic loci of transcription factor binding and histone modifications. Basic ChIP-seq data analysis, including read mapping and peak calling, can be accomplished through several well-established tools, but more sophisticated analyzes aimed at comparing data derived from different conditions or experimental designs constitute a significant bottleneck. We reason that the implementation of a single comprehensive ChIP-seq analysis pipeline could be beneficial for many experimental (wet lab) researchers who would like to generate genomic data. Here we present ChIPdig, a stand-alone application with adjustable parameters designed to allow researchers to perform several analyzes, namely read mapping to a reference genome, peak calling, annotation of regions based on reference coordinates (e.g. transcription start and termination sites, exons, introns, and 5' and 3' untranslated regions), and generation of heatmaps and metaplots for visualizing coverage. Importantly, ChIPdig accepts multiple ChIP-seq datasets as input, allowing genome-wide differential enrichment analysis in regions of interest to be performed. ChIPdig is written in R and enables access to several existing and highly utilized packages through a simple user interface powered by the Shiny package. Here, we illustrate the utility and user-friendly features of ChIPdig by analyzing H3K36me3 and H3K4me3 ChIP-seq profiles generated by the modENCODE project as an example. ChIPdig offers a comprehensive and user-friendly pipeline for analysis of multiple sets of ChIP-seq data by both experimental and computational researchers. It is open source and available at https://github.com/rmesse/ChIPdig.

Highlights

  • Interactions between nuclear proteins and DNA are vital for cell and organism function

  • Chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq) is a powerful method for assessing such interactions and has been widely used in recent years to map the location of post-translationally modified histones, transcription factors, chromatin modifiers and other non-histone DNA-associated proteins in a genome-wide manner

  • In light of this progress, ChIP-seq data sets are continuously deposited in publicly-accessible databases, such as the National Center for Biotechnology Information (NCBI) Gene Expression Omnibus (GEO) and the Encyclopedia of DNA elements (ENCODE) consortium portal[1]

Read more

Summary

31 Jul 2019 report report report

2. Hinrich Gronemeyer , University of Strasbourg, Illkirch, France Marco Antonio Mendoza-Parra, University of Evry-val-d’Essonne, University Paris-Saclay, Évry, France. Any reports and responses or comments on the article can be found at the end of the article. Keywords ChIP-seq, read mapping, peak calling, genomic region annotation, differential enrichment analysis, heatmaps, metaplots. This article is included in the RPackage gateway

Introduction
Methods
Conclusions
Langmead B
20. Gu Z: EnrichedHeatmap
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call