Abstract

Expression quantitative trait (eQTL) studies are a powerful tool for identifying genetic variants that affect levels of messenger RNA. Since gene expression is controlled by a complex network of gene-regulating factors, one way to identify these factors is to search for interaction effects between genetic variants and mRNA levels of transcription factors (TFs) and their respective target genes. However, identification of interaction effects in gene expression data pose a variety of methodological challenges, and it has become clear that such analyses should be conducted and interpreted with caution. Investigating the validity and interpretability of several interaction tests when screening for eQTL SNPs whose effect on the target gene expression is modified by the expression level of a transcription factor, we characterized two important methodological issues. First, we stress the scale-dependency of interaction effects and highlight that commonly applied transformation of gene expression data can induce or remove interactions, making interpretation of results more challenging. We then demonstrate that, in the setting of moderate to strong interaction effects on the order of what may be reasonably expected for eQTL studies, standard interaction screening can be biased due to heteroscedasticity induced by true interactions. Using simulation and real data analysis, we outline a set of reasonable minimum conditions and sample size requirements for reliable detection of variant-by-environment and variant-by-TF interactions using the heteroscedasticity consistent covariance-based approach.

Highlights

  • Gene-gene and gene-environment interaction effects on common human traits and diseases have been difficult to identify [1]

  • We examined the impact of non-linear transformations and rank-based inverse normal transformations in particular, as the latter approach is commonly applied in eQTL studies

  • In this study we attempted to screen for single nucleotide polymorphisms (SNP) by transcription factors (TFs) interaction on gene-expression using data from the ECLIPSE study, and we describe two methodological issues related to the detection and interpretation of statistical genomic interactions

Read more

Summary

Introduction

Gene-gene and gene-environment interaction effects on common human traits and diseases have been difficult to identify [1]. Part of the challenge is the small effect size of genetic variants on macro-phenotypes (e.g. disease status or anthropometric traits). R01NR013377, R01HL125583 (CPH), and R01HL113264 (EKS and MHC). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript

Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.