Abstract

Genome-wide association (GWA) studies for fine-scale mapping of disease loci are conceptually based on the occurrence and history of non-random association of alleles at different loci in populations (or linkage disequilibrium). Thus, a new disease mutation arises on a single ancestral haplotype of a given population in complete linkage disequilibrium with syntenic polymorphic markers. Owing to historical recombination events, markers close to the disease locus will tend to be in strongest disequilibrium than distant markers when the presence of evolutionary mechanisms generating non-uniform disequilibrium patterns is ignored. Therefore, the use of optimal measures to assess gradients of the strength of disequilibrium along chromosomes is crucial for fine mapping of disease loci. In practice, an ideal disequilibrium measure for fine-scale mapping should be a monotone function of the recombination fraction between the marker and the disease locus. In an influential paper, Devlin and Risch (1995, henceforth referred to as “DR also described in the literature as r), Yule's Q (Nei and Li, 1980), and Kaplan and Weir's d (Kaplan and Weir, 1992). It was concluded that the measures δ and D' outperform all other measures for fine-scale mapping. In particular, D&R established analytically that δ is the ideal measure for fine mapping because changes of δ over generations are directly related to the recombination fraction (θ) between the disease and the marker locus. In contrast, they found that D' not only depends on θ, but also on the haplotype frequencies. This supposed advantage of δ over D' for fine-scale mapping of disease loci has become a classical paradigm in the literature on disequilibrium. Nevertheless, D&R's conclusion about the relative merits of δ and D' for fine-scale mapping based on analytic results is erroneous. As is shown below, the source of the error lies in a misinterpretation of the formulas used by D&R to assess the relationship between the decay of disequilibrium by recombination and the measures δ and D'. Despite the time elapsed, this misconception remains uncorrected. Following DR where D = π11π22 − π12π21 and Dmax is the lesser of π1+π+2 and π+1π2+ when D > 0, or the lesser of π1+π+1 and π2+π+2 when D 0 at generation 0; and θ between the disease and the marker locus is constant along generations. After n generations, the decay of the initial disequilibrium (D0) can be obtained as a function of θ by means of the expression Dn = (1 − θ)nD0, where Dn is the value of disequilibrium in the nth generation (Hedrick, 2005). Expressing the measures δ and D' in terms of (1 − θ)n, D&R obtained then the following mathematical relationship: (1−θ)n=Dn/D0           =⌊π11π22−π12π21⌋/π+1π22=δ (1) where π+1π22 was considered to be the best estimate of D0, the initial amount of linkage disequilibrium, given that π21 = 0 at generation 0 and hence π11 = π+1. In contrast: (1−θ)n=D′[1+(π21/π22)] (2) DR the formula (1) can be then rewritten as follows: (1−θ)n=Dn/D0           =⌊π11(n)π22(n)−π12(n)π21(n)⌋/π+1π22(0)=δ[π22(n)/π22(0)] Therefore, the relationship between δ and θ depends on the frequencies of the haplotype π22 at generations n and 0. In terms of D', the formula (1) can be rewritten as: (1−θ)n=Dn/D0           =Dn/π+1π22(0)           =Dn/π+1π2+           =Dn/Dmax=D′n given that π22(0) = π2+ and Dmax = π+1π2+ when D > 0 and π+1π2+ < π1+π+2. It is shown, therefore, that D' is directly related to the recombination fraction. Likewise, the formula (2) can be rewritten as: (1−θ)n=D′n{1+[π21(0)/π22(0)]}=D′n given that π21(0) = 0. D&R studied the relationship between D' and θ when π1+ < π+1. Note, however, that π1+ = [π11(0) + π12(0)] cannot be lower than π+1 = [π11(0) + π21(0)] = π11 (0). Table 1 Layout and notation for population haplotypes, marker alleles, and disease allele frequencies in a 2 × 2 table. Our reanalysis on the relationship of the recombination fraction with the measures of disequilibrium δ and D' thus, contradicts the contention by D&R that δ outperforms D' for fine-scale mapping because changes of δ over generations are directly related to the recombination fraction. In fact, we reached the opposite conclusion to that of D&R. This finding reinforces the view that D' exhibits better statistical properties as a general measure of linkage desequilibrium than other commonly used measures (Zapata, 2011). In particular, D' seems to be an optimal measure for mapping of marker association and localization of disease loci sensu D&R.

Highlights

  • Genome-wide association (GWA) studies for fine-scale mapping of disease loci are conceptually based on the occurrence and history of non-random association of alleles at different loci in populations

  • The use of optimal measures to assess gradients of the strength of disequilibrium along chromosomes is crucial for fine mapping of disease loci

  • Devlin and Risch (1995, referred to as “D&R”) compared the efficiency of five measures of linkage disequilibrium used to refine the location of disease loci: the robust formulation of the population attributable risk δ (Lewin and Bertell, 1978), Lewontin’s D’ (Lewontin, 1964), the correlation coefficient (Hill and Robertson, 1968; described in the literature as r), Yule’s Q (Nei and Li, 1980), and Kaplan and Weir’s d (Kaplan and Weir, 1992)

Read more

Summary

Introduction

A comparison of linkage disequilibrium measures for fine-scale mapping by Devlin, B., and Risch, N. Genome-wide association (GWA) studies for fine-scale mapping of disease loci are conceptually based on the occurrence and history of non-random association of alleles at different loci in populations (or linkage disequilibrium). A new disease mutation arises on a single ancestral haplotype of a given population in complete linkage disequilibrium with syntenic polymorphic markers.

Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call