Abstract

BackgroundEpigenetic phenomena are crucial for explaining the phenotypic plasticity seen in the cells of different tissues, developmental stages and diseases, all holding the same DNA sequence. As technology is allowing to retrieve epigenetic information in a genome-wide fashion, massive epigenomic datasets are being accumulated in public repositories. New approaches are required to mine those data to extract useful knowledge. We present here an automatic approach for detecting genomic regions with epigenetic variation patterns across samples related to a grouping of these samples, as a way of detecting regions functionally associated to the phenomenon behind the classification.ResultsWe show that the regions automatically detected by the method in the whole human genome associated to three different classifications of a set of epigenomes (cancer vs. healthy, brain vs. other organs, and fetal vs. adult tissues) are enriched in genes associated to these processes.ConclusionsThe method is fully automatic and can exhaustively scan the whole human genome at any resolution using large collections of epigenomes as input, although it also produces good results with small datasets. Consequently, it will be valuable for obtaining functional information from the incoming epigenomic information as it continues to accumulate.

Highlights

  • Epigenetic phenomena are crucial for explaining the phenotypic plasticity seen in the cells of different tissues, developmental stages and diseases, all holding the same DNA sequence

  • The expression of the information encoded in the genome is determined by its DNA sequence but by many other factors that affect the complex process of gene expression [1]

  • We applied the procedure described in Methods to the three classification schemas in order to detect the genomic regions associated to the phenomena behind the classifications

Read more

Summary

Introduction

Epigenetic phenomena are crucial for explaining the phenotypic plasticity seen in the cells of different tissues, developmental stages and diseases, all holding the same DNA sequence. Different consortia at national and international levels were assembled with the objective of obtaining this genome-wide epigenomic information for hundreds of samples (“epigenomes”), including different organs, developmental states and cancer lines [3] Examples of such initiatives are ENCODE [9], BLUEPRINT [10] and Roadmap Epigenomics [11]. This massive epigenomic information is starting to be mined in a similar way as genomic data It is being used, alone or in combination with other evidences, to reconstruct the 3D structure of the chromatin [12,13,14], detect functionally related genes [15], interpreting the results of “genomewide association studies” (GWAS) [16] and non-coding variants in general [17]. This genome-wide epigenetic information is being used for epigenomic-GWAS

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call