Abstract

Genome-wide association studies (GWASs) have identified a large number of noncoding associations, calling for systematic mapping to causal regulatory variants and their distal target genes. A widely used method, quantitative trait loci (QTL) mapping for chromatin or expression traits, suffers from sample-to-sample experimental variation and trans-acting or environmental effects. Instead, alleles at heterozygous loci can be compared within a sample, thereby controlling for those confounding factors. Here we introduce a method for chromatin structure-based allele-specific pairing of regulatory variants and target transcripts. With phased genotypes, much of allele-specific expression could be explained by paired allelic cis-regulation across a long range. This approach showed approximately two times greater sensitivity than QTL mapping. There are cases in which allele imbalance cannot be tested because heterozygotes are not available among reference samples. Therefore, we employed a machine learning method to predict missing positive cases based on various features shared by observed allele-specific pairs. We showed that only 10 reference samples are sufficient to achieve high prediction accuracy with a low sampling variation. In conclusion, our method enables highly sensitive fine mapping and target identification for trait-associated variants based on a small number of reference samples.

Highlights

  • Most disease associations discovered by genome-wide association studies (GWASs) are distant from coding genes

  • As a set of test variants, we obtained 2,351 single-nucleotide polymorphisms (SNPs) associated with autoimmune diseases, allergic diseases, inflammation-related diseases, and laboratory results for immune cells (S2 Table) from the GWAS catalogue

  • For a given set of trait-associated variants, our method enables the identification of causal regulatory variants in linkage disequilibrium (LD) and their functional target genes

Read more

Summary

Introduction

Most disease associations discovered by genome-wide association studies (GWASs) are distant from coding genes. It was shown that these noncoding variants are concentrated in regulatory DNA marked by DNase I hypersensitivity[1] or histone modifications[2]. This enables epigenetic fine mapping of noncoding GWAS single-nucleotide polymorphisms (SNPs)[3]. In this regard, quantitative trait loci (QTL) mapping and more recently, allele-specific analyses, are used to test the functional differentiation of different alleles in terms of chromatin accessibility[4,5], histone modification levels[6,7,8,9,10,11], or transcription factor binding[12]

Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.