The Disequilibrium Maximum-Likelihood–Binomial Test Does Not Replace the Transmission/Disequilibrium Test

Christine Windemuth,Steve Horvath,Michael Knapp

doi:10.1086/303014

Abstract

To the Editor: In a previous issue of the Journal, Huang and Jiang (1999) introduced the disequilibrium maximum-likelihood–binomial test (DMLB) for affected-sibship data. The DMLB is supposed to combine the advantages of the mean test (Blackwelder and Elston 1985) and the transmission/disequilibrium test (TDT) (Terwilliger and Ott 1992; Spielman et al. 1993), in that the DMLB performs well when linkage disequilibrium (LD) is low and has power higher than or equal to that of the TDT when the LD ranges from moderate to strong. If this claim was correct, the TDT would be obsolete. In this letter, we show how to compute exact P values and exact critical values for the DMLB (and for the TDT), and we show that, when these exact critical values are used, the DMLB is never significantly more powerful than the TDT when there is complete LD. The opposite is true: the TDT is often significantly more powerful than the DMLB. Even when LD is at 80% of its maximum, the TDT still outperforms the DMLB when the marker- and disease-allele frequencies are identical. The asymptotic approximation used by Huang and Jiang (1999) can be inaccurate. We show that their choice of the critical value for the DMLB (cDMLB) is often anticonservative—that is, it violates the false-positive rate—whereas their choice of the critical value for the TDT (cTDT) tends to be overly conservative. The exact critical values depend on the number of heterozygous parents in the sample, and we are making available (contact the corresponding author) an SAS Institute (1990) program that computes exact critical values. Huang and Jiang (1999) introduce DMLB tests for two different cases of hypotheses. For the sake of brevity, we will focus only on the more important two-sided hypothesis, which is relevant when there is no prior knowledge about which marker allele is in LD with the disease. Let us give a brief description of the TDT and the DMLB for families with two affected children. Suppose that there are n2 heterozygous B1B2 parents in the data set. Let n22 denote the number of heterozygous parents who transmitted allele B1 to both children, let n21 denote the number of heterozygous parents who transmitted B1 to one child and B2 to the other child, and let n20 denote the number of heterozygous parents who transmitted B2 to both children. Then the TDT statistic is given by with an asymptotic χ21 distribution under the null hypothesis of no linkage. The score-statistic version of the DMLB is given by Incidentally, we note that equals the mean test for these data. Huang and Jiang (1999) show that, under the null hypothesis of no linkage, the DMLB has the asymptotic distribution .5χ21+.5χ22. They use this asymptotic distribution to compute the critical value cDMLB=17.38, corresponding to a false-positive rate of α=.0001. Similarly, under the null hypothesis of no linkage, the TDT has an asymptotic χ21 distribution, which can be used to show that, for the same false-positive rate, the critical value of the TDT is given by cTDT=15.14. These critical values are not ideal, as can be seen from table 1, which lists the exact error rates as a function of the number of heterozygous parents n2. Fortunately, one does not need to rely on asymptotic approximations, since, under the null hypothesis, one can easily compute exact P values for both tests. However, even if one is not interested in exact P values, one can easily compute the exact critical values that should be used, for families with two affected offspring, to maintain the correct type I error rate. The key observation for these calculations is that, under the null hypothesis, (n22,n21,n20) has a multinomial distribution with parameters n2 and (p2,p1,p0)=(.25,.5,.25), and the DMLB is a simple function of this low-dimensional distribution. These null distributions can be used to compute the exact critical values for both tests, some of which are listed in table 2. The critical values depend on the sample sizes, but there is no monotonous relationship between the number of heterozygous parents n2 and the critical values. Since interpolation between the different values of n2 is difficult, we are making available (contact the corresponding author) an SAS Institute (1990) program that calculates the critical values for both tests. Table 1 Exact Error Rates of the DMLB and the TDT Test Statistics When the Critical Values (Corresponding to α=.0001) Proposed by Huang and Jiang (1999) Are Used[Note] Table 2 Exact Critical Values for the TDT and the DMLB Corresponding to α, as a Function of n2 To compare the power of the two tests, we conducted simulation studies for the genetic models studied by Huang and Jiang (1999). We considered four genetic models: additive, dominant, multiplicative, and recessive. Let f0, f1, and f2 be the penetrances of disease genotypes dd, Dd, and DD, respectively, where D is the disease-causing allele. The relative genotypic risks (GRRs) are defined as r1=f1/f0 and r2=f2/f0. Like Huang and Jiang, we considered the following GRR values in the power calculation: (1) for the additive model, r1=4, r2=7; (2) for the dominant model, r1=4, r2=4; (3) for the multiplicative model, r1=4, r2=16; and (4) for the recessive model, r1=1, r2=4. We assumed that the biallelic marker and the disease loci are tightly linked (θ=0), and we studied two marker-allele frequencies m (.2 and .5) and three disease-allele frequencies p (.1, .2, and .5). We looked at four different values (1, .80, .50, and .30) of the normalized LD δp=Δ/Δmax, where Δ=P(B1D)-mp and . For each genetic model, we determined the approximate number of families N required to yield 80% power for the TDT (Knapp 1999). If N 1,000, then each sample was limited to 1,000 families. Both tests were evaluated for the same replicates. For each replicate, we determined the number n2 of heterozygous parents in the sample and then used it to compute exact critical values for both tests. Since both tests have a discrete distribution, we used a randomized test to reject at an exact false-positive rate of α=.0001. Table 3 lists the results of our simulation studies. When the marker-allele frequency equals the disease-allele frequency (m=p), the TDT has more power than the DMLB when δp⩾.8. Even when δp=.5, the DMLB is not consistently more powerful than the TDT. Table 3 Comparison of the Power of the DMLB with That of the TDT, When α=.0001 When m≠p and δp=1, the TDT is more powerful than the DMLB in all but one case (multiplicative, p=.5, m=.2). However, when δp=.8, the DMLB is, “on average,” more powerful than the TDT. When δp⩽.5, the DMLB is usually more powerful than the TDT. However, in many cases in which the DMLB is significantly more powerful than the TDT, the required sample sizes are unrealistic (>1,000 families) anyway. Therefore, neither test would be useful in such a setting. We conclude that, even though tests that can adapt to the degree of LD are a good idea, our simulations have shown that, if the degree of LD is strong (δp⩾.80), the DMLB usually is not more powerful than the TDT. For a candidate-gene study in which the typed marker affects the disease risk (i.e., m=p and δp=1), the TDT is preferable to the DMLB. In their study, Huang and Jiang (1999) showed that, when the LD is very weak, the mean test has more power than the DMLB. Therefore, the DMLB is most useful when there is moderate LD between marker and disease locus. Unfortunately, in practice, the amount of LD is usually unknown.

Full Text