Abstract

Genome-wide association studies have seen unprecedented success in identifying genetic loci that correlate with disease susceptibility and severity. Early phases of these studies have predominantly been performed in the Caucasian populations. The next phase in medical genetics is to extend the exploration across genetically diverse populations to leverage on larger sample sizes for locating smaller effects that may be present in most human populations. However, discoveries from these studies do not actually reveal the underlying functional changes to the human genome, but only point to broad regions stipulated by the extent of linkage disequilibrium (LD). Fine-mapping the functional variants can, however, be hampered by extensive LD, which can yield multiple perfect surrogates that are not distinguishable from the underlying causal variants, although several studies have illustrated the value of relying on multiple genetically diverse populations to narrow the candidate regions where the functional variants can be found in. Here, we explore the efficiency of trans-ethnic meta-analysis in discovering genetic association and in fine-mapping the causal variants by asking: are there any population diversity metrics that will be useful for: (i) identifying the populations or genomic regions where meta-analysis are likely to be more successful for discovering associations?; (ii) identifying the populations or loci to perform deep targeted sequencing for the purpose of fine-mapping causal variants? Our results indicate that simple metrics like the F(ST) or the population specificity of haplotypes are useful in trans-ethnic meta-analyses, while the degree of haplotype sharing and LD variation are informative of the efficiency in trans-ethnic fine-mapping.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call