Heat*seq: an interactive web tool for high-throughput sequencing experiment comparison with public data.

Guillaume Devailly,Anagha Joshi,Anna Mantsoki

doi:10.1093/bioinformatics/btw407

Guillaume Devailly, Anagha Joshi + Show 1 more

Open Access

https://doi.org/10.1093/bioinformatics/btw407

Copy DOI

Journal: Bioinformatics	Publication Date: Jul 4, 2016
Citations: 8	License type: CC BY 4.0

Affiliation: Roslin Institute, University of Edinburgh

Abstract

Summary: Better protocols and decreasing costs have made high-throughput sequencing experiments now accessible even to small experimental laboratories. However, comparing one or few experiments generated by an individual lab to the vast amount of relevant data freely available in the public domain might be limited due to lack of bioinformatics expertise. Though several tools, including genome browsers, allow such comparison at a single gene level, they do not provide a genome-wide view. We developed Heat*seq, a web-tool that allows genome scale comparison of high throughput experiments chromatin immuno-precipitation followed by sequencing, RNA-sequencing and Cap Analysis of Gene Expression) provided by a user, to the data in the public domain. Heat*seq currently contains over 12 000 experiments across diverse tissues and cell types in human, mouse and drosophila. Heat*seq displays interactive correlation heatmaps, with an ability to dynamically subset datasets to contextualize user experiments. High quality figures and tables are produced and can be downloaded in multiple formats.Availability and Implementation: Web application: http://www.heatstarseq.roslin.ed.ac.uk/. Source code: https://github.com/gdevailly.Contact: Guillaume.Devailly@roslin.ed.ac.uk or Anagha.Joshi@roslin.ed.ac.ukSupplementary information: Supplementary data are available at Bioinformatics online.

Highlights

High throughput sequencing is becoming routine for many biological assays including transcriptome analysis through RNA sequencing (RNA-seq), or transcription factor (TF) binding sites identification through chromatin immuno-precipitation followed by sequencing (ChIPseq)
An oestrogen receptor (ER) alpha ChIP-seq in MCF7 cells (Zhuang et al, 2015) comparison to the ENCODE TFBS dataset by sub-selecting ENCODE ER ChIP-seq experiments revealed that the binding pattern of ERα in MCF7 cells was more similar to its binding pattern in T-47D cells than in ECC-1 cells (Figure 1A)
MCF7 and T-47D were derived from mammary tumours while ECC-1 is an endometrial cell line

Summary

Introduction

High throughput sequencing is becoming routine for many biological assays including transcriptome analysis through RNA sequencing (RNA-seq), or transcription factor (TF) binding sites identification through chromatin immuno-precipitation followed by sequencing (ChIPseq) Collaborative projects such as Bgee (Bastian et al.), ENCODE (Bernstein et al, 2012), and Roadmap Epigenomics (Kundaje et al, 2015) have generated genome-wide datasets across hundreds of cell types or tissues. Despite this large data being freely available in the public domain, the lack of computational tools accessible to experimental scientists with no or elementary computational skills prohibits the use of this data to its full potential for discovery. Heat*seq is an interactive web tool that allows users to contextualise their sequencing data with respect to vast amounts of public data in a few minutes without requiring any programming skills

Methods

Results

User data quality control

Cell context identification

New hypotheses by data integration

Public data assessment

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Heat*seq: an interactive web tool for high-throughput sequencing experiment comparison with public data.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Similar Papers

Genome-Wide Analysis of Transcription Start Sites and Core Promoter Elements in Hevea brasiliensis
Yuko Makita ... Nyok-Sean Lau
-
Yuko Makita, et. al.Yuko Makita ... Nyok-Sean Lau
01 Jan 2020
01 Jan 2020

High-throughput experiments for rare-event rupture of materials
Yifan Zhou ... Tongqing Lu
Matter | VOL. 5
Yifan Zhou, et. al.Yifan Zhou ... Tongqing Lu
20 Jan 2022
Matter | VOL. 5

SMAGEXP: a galaxy tool suite for transcriptomics data meta-analysis
Samuel Blanck ... Guillemette Marot
GigaScience | VOL. 8
Samuel Blanck, et. al.Samuel Blanck ... Guillemette Marot
29 Jan 2019
GigaScience | VOL. 8

MOIRAI: a compact workflow system for CAGE analysis.
Akira Hasegawa ... Piero Carninci
BMC Bioinformatics | VOL. 15
Akira Hasegawa, et. al.Akira Hasegawa ... Piero Carninci
16 May 2014
BMC Bioinformatics | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Heat*seq: an interactive web tool for high-throughput sequencing experiment comparison with public data.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Bioinformatics