Estimation of Denominator Degrees of Freedom of F-Distributions for Assessing Wald Statistics for Fixed-Effect Factors in Unbalanced Mixed Models
SUMMARY Tests for fixed-effect factors in unbalanced mixed models have previously used t-tests on a contrastby-contrast basis or Wald statistics without a universally accepted method of calculating the denominator degrees of freedom. This situation has arisen because the variances of different contrasts are differently weighted sums of the variance components with associated degrees of freedom that are not necessarily equal. A simultaneous F-test for differences between all levels of a fixed-effect factor can be derived by forming new contrasts, by rotation of the original contrasts, with variances that are close to being the same weighted sum of variance components. The associated degrees of freedom for these new contrasts are nearly equal. A small simulation study shows the appropriateness of a X2 approximation to the distribution of the weighted sums of variance components. Three simple examples are used to demonstrate the effects of rotation. The last of these examples is also used to compare the proposed simultaneous F-test with the distribution of the Wald statistic obtained by numerical simulation. The method of rotations is then applied to data on the range size of mountain hares (Lepus timidus) to assess the evidence for a two-way interaction between season and habitat. Analysis of data from balanced experiments is traditionally performed by an analysis of variance in which the total sum of squared deviations of observations about their mean is partitioned into sums of squares attributable to either the different treatment effects or the different random effects. Under standard distributional assumptions, each random effect sum of squares follows a multiple of a X2-distribution, this multiple being the product of the degrees of freedom (the number of independent error contrasts whose square has the required expectation) and a linear combination of the different variance components, the coefficients in this linear combination being fixed by the design. Under any null hypothesis concerning the absence of treatment effects, the corresponding treatment sum of squares also follows such a multiple of a X2-distribution. Furthermore, the linear combination of variance components in the multiple for the treatment sum of squares is identical to the linear combination for one of the random-effect sum of squares; hence, hypotheses about the treatment effects can be tested by dividing treatment mean squares by the appropriate error mean square to form a variance ratio with an F-distribution. The benefits of experimental design accrue, first, through the variance of all contrasts for any given treatment factor or interaction having the same expectation under the null hypothesis; second, through this variance being as small as possible; and, third, through there being as many error contrasts as possible whose variance is the same as that for the treatment factor or interaction. Analysis of data that lacks these properties of balance is much less straightforward, yet there are many areas in which, due to the nature of experimental material, the treatment effects cannot be applied in a balanced fashion, and so such unbalanced data are the norm rather than the
- Research Article
23
- 10.1016/0047-259x(92)90080-y
- Jul 1, 1992
- Journal of Multivariate Analysis
Nonnegative estimation of variance components in unbalanced mixed models with two variance components
- Research Article
29
- 10.5860/choice.46-5082
- May 1, 2009
- Choice Reviews Online
Data analysis for experimental design
- Research Article
31
- 10.1198/016214504000000322
- Jun 1, 2004
- Journal of the American Statistical Association
We consider the problem of setting a confidence interval or bound for a linear combination of variance components related to a multivariate normal distribution, which includes important applications such as comparing variance components and testing the bioequivalence between two drug products. The lack of an exact confidence interval for a general linear combination of variance components spurred the development of a modified large-sample (MLS) method that was shown to be superior to many other approximation methods. But existing MLS method requires the use of independent estimators of variance components. Using a chi-squared representation of a quadratic form of a multivariate normal vector, we extend the MLS method to situations in which estimators of variance components are dependent. Using Edgeworth and Cornish–Fisher expansions, we explicitly derive the second-order asymptotic coverage error of the MLS confidence bound. Our results show that the MLS confidence bound is not second-order accurate in general, but is much better than the confidence bound based on normal approximation and is nearly second-order accurate in some special cases. Our results also show how to construct an MLS confidence bound that is second-order accurate. As an application, we discuss the use of the MLS method in assessing population bioequivalence, with some simulation results and an example.
- Research Article
1
- 10.3934/math.20241601
- Jan 1, 2024
- AIMS Mathematics
<p>This study introduced a novel exact-scheme analysis of variance to tackle the challenge of incomplete data within the Greco-Latin square experimental design (GLSED), specifically for scenarios with a single missing observation across any treatment and block level, thus eliminating the need for conventional data imputation methods. This approach innovatively addresses and mitigates the bias in the treatment sum of squares, a significant drawback of traditional missing plot techniques, by providing a precise, exact-scheme-based formula for calculating the treatment sum of squares in fixed-effect GLSED contexts with unrecorded values. Moreover, it offers a method for correcting biased treatment sum of squares values, presenting an adjustment mechanism for instances where the least squares method was previously employed to estimate missing values. This comprehensive strategy not only enhances the methodological accuracy and integrity of GLSED studies but also contributes significantly to the field by offering a solution to navigate the complexities of incomplete datasets without resorting to data imputation, thus improving the rigor and validity of experimental designs in the face of missing data challenges.</p>
- Research Article
22
- 10.2307/2527880
- Sep 1, 1958
- Biometrics
In a previous paper [10] based on the fundamental paper by Yates [11], the writer has discussed the estimation of missing observations in incomplete data. When the data are completed with these estimates, standard calculations give the correct estimates of treatment effects, etc., and a standard analysis of variance yields the correct residual sum of squares for the estimation of error. However, as Yates [11] observed, other component sums of squares in the standard analysis are incorrect (though perhaps not seriously so). For the randomized blocks design, and for a Latin square with one missing value, Yates derived formulae for adjusting the treatment sum of squares to its correct value. Cornish [2, 3] gave the corresponding results for incomplete block designs, and for designs such as lattice squares, with one missing value. The present paper, following on from the previous paper [10], deals with correcting the standard analysis of variance, and derives a general formula for the necessary corrections. Specific formulae are given which, in particular, provide the necessary correction of the treatment sum of squares when several observations are missing, for the designs with two-way restriction. The standard formulae for determining variances and covariances of treatment comparisons, etc., will also need to be adjusted when the comparisons involve missing values. Tocher [8] gave a general formula for this purpose, which is here extended to cover the singular cases. The principles discussed are illustrated by application to the analysis of a Latin square experiment (numerical example), and in the derivation of an analysis for B.I.B. designs with missing blocks.
- Research Article
8
- 10.2307/2985719
- Nov 1, 1960
- Applied Statistics
In experimental work the results of one or more observations may be missing. An agricultural plot may be trampled, an animal may die, a test tube may be dropped, or a machine may break down. When missing values occur, the usual method of computing the various sums of squares cannot be used unless the missing values are first estimated from the existing data. Cornish' obtained a specific formula for a single missing value and an iterative procedure for several missing values by minimising the error sum of squares. He also showed that the resulting treatment sum of squares had a positive bias and how to eliminate it. Wilkinson3 obtained a general procedure for estimating missing values which involves the solutions of matrix equations. The method of Glenn and Kramer2 may also be used by considering that a balanced incomplete block design is a randomised block design with missing observations, but this procedure would require the estimation of a large number of missing values. This paper will deal with the estimation of several missing values in a balanced incomplete block design by minimising the error sum of squares. Explicit formulae for each value missing will be derived for many special cases and a completely general solution will be given. These formulae may prove to be less tedious in application than the ones now available. A direct method of analysis of the augmented data, not requiring a correction for bias in the treatment sum of squares, will be given. It should be noted that, by estimating missing values, the symmetry in the design is restored, but the estimates serve only to expedite analysis of the data; they do not by any means restore the information lost in the missing observations.
- Research Article
3149
- 10.2307/3001469
- Sep 1, 1956
- Biometrics
In many fields of research, one is faced with the task of comparing the effects of treatments which have been replicated unequally. This happens for a number of reasons. In an experiment on animals, some may get sick and have to be removed from the experiment. In some experiments, the amount of material available for certain treatments may not be as much as for other treatments. If the experimenter has specified orthogonal contrasts that he is interested in before he runs the experiment, one can test the various treatment effects by an F-test after the treatment sum of squares has been partitioned into individual degrees of freedom for each orthogonal contrast. If the experimenter has not specified orthogonal contrasts, one is faced with the problem of deciding which treatments are significantly different. Several writers, including Duncan, Keuls, Newman, and Tukey, have developed multiple range tests to show differences among treatments that have been replicated the same number of times and when nothing was specified concerning the treatments. Duncan [1] compares the above methods and gives citations. This extension to unequal numbers of replications will be exemplified with reference to Duncan's New Multiple Range Test, but is applicable to any of the above writers' tests; all one has to do is use their tabled ranges. In Duncan's test for an equal number of replications, the difference between any two ranked means is significant if the difference exceeds a shortest significant range. This shortest significant range is designated by R, and is obtained by multiplying the standard error of a mean, s,, by a given value, zn2, obtained from a table of significant studentized ranges which Duncan has tabled for both the 5% and 1% test. In Duncan's terminology, n2 is the degrees of freedom of the error mean square and p = 1, 2, * *, t is the number of means concerned. Consider an experiment with five treatments, A, B. C, D, and E, each replicated n times. Suppose on ranking the means from low to high one obtains
- Research Article
1
- 10.1080/01621459.1965.10480791
- Mar 1, 1965
- Journal of the American Statistical Association
In this paper the first two moments of the ratio (treatment sum of squares)/(treatment sum of squares + error sum of squares) over all possible random assignment of treatments to the experimental plots, for a class of 2 associate PBIBD has been obtained. These two monemts are compared with the corresponding moments of a continuous beta distribution to settle the question of approximating the randomization test by the usual F-test. It has been shown that a reasonable approximation to the randomization test based on the statistic F is equivalent to modifying the normal theory test by multiplying the numbers of d.f. of the F-distribution by a factor depending on the heterogeneity of the blocks.
- Research Article
3
- 10.1111/j.2517-6161.1972.tb00890.x
- Sep 1, 1972
- Journal of the Royal Statistical Society Series B: Statistical Methodology
Summary A general method of analysis is given for a design with a set of treatments added to a p-way orthogonal classification. If there is any grouping within the treatments, the treatment sum of squares may be partitioned; the partitioning is assisted by expressing this sum of squares as a quadratic form in the estimated treatment parameters. When the treatments fall into two groups, possibly with unequal replication, the complete partition of the treatment sum of squares is derived. Designs on three-way classifications are considered in some detail. The more useful ones are mostly on single Latin squares, and new ways of adding various numbers of treatments to 5 times 5 and 6times6 Latin squares are described, examples of possible designs being given. Lack of balance of the treatments with respect to one of the orthogonal classifications may be compensated for in another classification so that some designs are better balanced with three classifications than with one or two.
- Research Article
22
- 10.1214/13-sts454
- May 1, 2014
- Statistical Science
The Neyman-Fisher controversy considered here originated with the 1935 presentation of Jerzy Neyman's Statistical Problems in Agricultural Experimentation to the Royal Statistical Society. Neyman asserted that the standard ANOVA F-test for randomized complete block designs is valid, whereas the analogous test for Latin squares is invalid in the sense of detecting differentiation among the treatments, when none existed on average, more often than desired (i.e., having a higher Type I error than advertised). However, Neyman's expressions for the expected mean residual sum of squares, for both designs, are generally incorrect. Furthermore, Neyman's belief that the Type I error (when testing the null hypothesis of zero average treatment effects) is higher than desired, whenever the expected mean treatment sum of squares is greater than the expected mean residual sum of squares, is generally incorrect. Simple examples show that, without further assumptions on the potential outcomes, one cannot determine the Type I error of the F-test from expected sums of squares. Ultimately, we believe that the Neyman-Fisher controversy had a deleterious impact on the development of statistics, with a major consequence being that potential outcomes were ignored in favor of linear models and classical statistical procedures that are imprecise without applied contexts.
- Research Article
29
- 10.1080/00401706.1984.10487938
- May 1, 1984
- Technometrics
A problem that can occur in a variance component analysis is the estimation and construction of confidence intervals for linear combinations of the variance components. This article considers the unbalanced one-way classification model and develops a procedure that can be used to construct a confidence interval on a linear combination of the among- and within-group variances. A simulation study suggests that the procedure provides an interval that in most cases has an achieved confidence coefficient at least as great as the stated level.
- Research Article
21
- 10.1080/01621459.1978.10480068
- Sep 1, 1978
- Journal of the American Statistical Association
Methodology is proposed for the construction of exact confidence intervals on nonnegative linear combinations of variance components from nested classification models. Examples are given for the one-fold and two-fold classifications. The robustness of these confidence intervals to model breakdown is also discussed.
- Research Article
14
- 10.2307/1268107
- May 1, 1984
- Technometrics
A problem that can occur in a variance component analysis is the estimation and construction of confidence intervals for linear combinations of the variance components. This article considers the unbalanced one-way classification model and develops a procedure that can be used to construct a confidence interval on a linear combination of the among- and within-group variances. A simulation study suggests that the procedure provides an interval that in most cases has an achieved confidence coefficient at least as great as the stated level.
- Research Article
5
- 10.1080/02331888708802011
- Jan 1, 1987
- Statistics
It is shown that, by a reparametrization, the problems of estimationg a linear combination of variance components can be reduced to that of estimating a single variance component. Such a reduction is used to obtain some characterizations of nonnegatively estimable linear combinations of varaince components. Characterization of nonnegative estimability using MINQUE is also discussed
- Research Article
1
- 10.33140/pcii.06.02.06
- Apr 20, 2023
- Petroleum and Chemical Industry International
This study focuses on optimizing the process of biofuel production from citrus peel using the Design of Experiments (DOE) technique. This study aims to determine the optimal values for the variables that have a significant impact on the production of biofuel. The variance within and between data groups was determined using the analysis of variance (ANOVA) table. The ANOVA table shows how much of the response variable's variation (biofuel production) can be explained by the independent variables (A, B, C, D, E, AB, AC, AD, AE, and BJ) and how much is caused by random error. The ANOVA table comprises of three primary parts: the F-statistic, the p-value, the df, the mean square (MS), the source of variation, and the sum of squares (SS). The wellspring of variety alludes to the beginning of the information variety, which can be either the lingering or the model. The amount of squares estimates the information's changeability, with the absolute amount of squares addressing the amount of the squared deviations of the genuine qualities from the mean worth. The residual is the sum of the squared deviations from the predicted values of the actual values, while the model's sum of squares is the sum of the squared deviations from the mean of the predicted values. The model has 10 degrees of freedom (the number of independent variables) and the residual has 4 degrees of freedom (the number of observations minus the number of independent variables). These degrees of freedom represent the number of independent pieces of information used to estimate a parameter. The mean square, which indicates the typical amount of variation for each variation source, is calculated by dividing the sum of squares by the degrees of freedom. The degree to which the model explains the variation in the data is indicated by the F-statistic, which is the ratio of the model's mean square to the residual's mean square. The probability of obtaining an F-statistic that is as large as the one observed if the null hypothesis is true is represented by the p-value. The independent variables' insignificant impact on biofuel production is the null hypothesis in this instance. The model's p-esteem in this study is under 0.05, demonstrating that the free factors essentially affect biofuel creation and that the model is genuinely huge. In addition, the model is significant because the F-statistic is relatively large in comparison to the F-distribution for the 10 and 4 degrees of freedom, respectively. The estimated coefficients for the linear regression model used to investigate the production of biofuel from citrus peel can be found in the ANOVA coefficients table. The table provides a list of the intercept and independent variables' coefficients, standard errors, t-values, and p-values. When all of the independent variables are zero, the intercept has a coefficient of 0.0672, indicating the estimated value of the response variable. The fact that the intercept does not differ significantly from zero is supported by the fact that its p-value is not significant. The fact that the coefficients of the independent variables A, E, AC, AD, AE, and BJ are not statistically significant indicates that these variables have little impact on the response variable. On the other hand, the positive coefficients and significant p-values of the independent variables B and C suggest that an increase in their values could result in an increase in the production of biofuel from citrus peel. In conclusion, the key variables that influence the production of biofuel from citrus peel have been identified thanks to the use of the Design of Experiments (DOE) method. According to the findings of this study, an increase in the production of biofuel from citrus peel may result from an increase in the values of the independent variables B and C. The development of environmentally friendly energy sources and the optimization of biofuel production processes will benefit greatly from these findings