Abstract

BackgroundCurrent technology allows rapid assessment of DNA sequences and methylation levels at a single-site resolution for hundreds of thousands of sites in the human genome, in thousands of individuals simultaneously. This has led to an increase in epigenome-wide association studies (EWAS) of complex traits, particularly those that are poorly explained by previous genome-wide association studies (GWAS). However, the genome and epigenome are intertwined, e.g., DNA methylation is known to affect gene expression through, for example, genomic imprinting. There is thus a need to go beyond single-omics data analyses and develop interaction models that allow a meaningful combination of information from EWAS and GWAS.ResultsWe present two new methods for genetic association analyses that treat offspring DNA methylation levels as environmental exposure. Our approach searches for statistical interactions between SNP alleles and DNA methylation (G ×Me) and between parent-of-origin effects and DNA methylation (PoO ×Me), using case-parent triads or dyads. We use summarized methylation levels over nearby genomic region to ease biological interpretation. The methods were tested on a dataset of parent–offspring dyads, with EWAS data on the offspring. Our results showed that methylation levels around a SNP can significantly alter the estimated relative risk. Moreover, we show how a control dataset can identify false positives.ConclusionsThe new methods, G ×Me and PoO ×Me, integrate DNA methylation in the assessment of genetic relative risks and thus enable a more comprehensive biological interpretation of genome-wide scans. Moreover, our strategy of condensing DNA methylation levels within regions helps overcome specific disadvantages of using sparse chip-based measurements. The methods are implemented in the freely available R package Haplin (https://cran.r-project.org/package=Haplin), enabling fast scans of multi-omics datasets.

Highlights

  • Current technology allows rapid assessment of DNA sequences and methylation levels at a single-site resolution for hundreds of thousands of sites in the human genome, in thousands of individuals simultaneously

  • For each Single-nucleotide polymorphism (SNP), we investigate DNA methylation (DNAm) in various genomic regions and at a single Cytosine-guanine dinucleotide (CpG) site because correlated CpGs are known to exert their effects over long stretches of DNA [15,16,17]

  • Implementation of the methods As mentioned, the main idea behind our methods is to treat the level of DNAm as environmental exposure in the G×E or PoO×E analyses to estimate the change of Relative risk (RR) of a disease, depending on DNAm levels

Read more

Summary

Introduction

Current technology allows rapid assessment of DNA sequences and methylation levels at a single-site resolution for hundreds of thousands of sites in the human genome, in thousands of individuals simultaneously This has led to an increase in epigenome-wide association studies (EWAS) of complex traits, those that are poorly explained by previous genome-wide association studies (GWAS). Among several epigenetic modifications characterized to date, DNAm is by far the most studied epigenetic mark in humans It is a process by which a methyl group binds to the cytosine (C) at a CpG dinucleotide, resulting in activation or repression of gene expression through mechanisms that are highly region- and context-dependent [3,4,5]. It has been shown that the state of methylation is controlled by several enzymes [6] and is influenced by environmental exposures [7]

Methods
Results
Discussion
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.