Statistical power of EDF tests of normality and the sample size required to distinguish geometric-normal (lognormal) from arithmetic-normal distributions of low variability

Philip D Gingerich

doi:10.1006/jtbi.1995.0050

Abstract

Many biological variables are distributed geometrically (proportionally) rather than arithmetically, and they are lognormal rather than normal on the usual arithmetic scale of measurement. The distinction is important because it affects statistical interpretation at many levels: for example, logarithmic transformations commonly used in biology to standardize variances and linearize relationship of arithmetic measurements will skew underlying distributions if these are inherently arithmetic-normal but not if they are geometric-normal. The purpose of this study is to determine theoretically, using Monte Carlo simulation, which of a range of recommended tests of normality has greatest power, and what sample size is required to distinguish geometric-normal from arithmetic-normal distributions when inherent variability is low, as it is in most biological distributions. Lilliefors’ version of the Kolmogorov-Smirnov test, Frosini’s test, and the Anderson-Darling test are three non-parametric goodness-of-fit tests of normality based on an observed empirical distribution function. When inherent variability Vis on the order of 0.10 (standard deviation 10% of mean), Lilliefors’ test requires a minimum sample of about 2200 to correctly distinguish lognormal distributions from normal 95% of the time (with the level of significance or type I error rate α and the type II error rate β both 0.05). In the same situation, Frosini’s test requires a minimum sample of about 1700; the Anderson-Darling test is more powerful, but still requires a minimum sample of about 1500. Power is sensitive to inherent variability: when V= 0.05 the Anderson-Darling test requires a minimum sample much greater than 2500, but when V= 0.15 it requires a minimum sample of only about 650. Sensitivity of the power of all tests to inherent variability means that the normality of body measurements like weight with Vtypically ≈0.15 is more easily tested than the normality of body lengths with Vtypically ≈0.05 in the same sample. Inherent variability must be considered in designing empirical tests of normality, and differences in inherent variability must be considered in interpreting results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Statistical power of EDF tests of normality and the sample size required to distinguish geometric-normal (lognormal) from arithmetic-normal distributions of low variability

Abstract

Talk to us

Similar Papers

More From: Journal of Theoretical Biology

Lead the way for us

Journal: Journal of Theoretical Biology	Publication Date: Mar 1, 1995
Citations: 28

Similar Papers

A Bayes classifier when the class distributions come from a common multivariate normal distribution : Jack Cartinhour. IEEE Trans. Reliab.41, 124 (1992)
-
Microelectronics Reliability | VOL. 32
--
01 Dec 1992
Microelectronics Reliability | VOL. 32

Evaluation of Techniques for Univariate Normality Test Using Monte Carlo Simulation
...
American Journal of Theoretical and Applied Statistics | VOL. 6
, et. al. ...
09 Jun 2017
American Journal of Theoretical and Applied Statistics | VOL. 6

Detection of Non-Normality in Data Sets and Comparison between Different Normality Tests
Nwakuya T Maureen ... Nduka Wonu
Asian Journal of Probability and Statistics | VOL. -
Nwakuya T Maureen, et. al.Nwakuya T Maureen ... Nduka Wonu
04 Jan 2020
Asian Journal of Probability and Statistics | VOL. -

Empirical Power Comparison Of Goodness of Fit Tests for Normality In The Presence of Outliers
Mayette Saculinggan ... Emily Amor Balase
Journal of Physics: Conference Series | VOL. 435
Mayette Saculinggan, et. al.Mayette Saculinggan ... Emily Amor Balase
26 Apr 2013
Journal of Physics: Conference Series | VOL. 435

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Statistical power of EDF tests of normality and the sample size required to distinguish geometric-normal (lognormal) from arithmetic-normal distributions of low variability

Abstract

Talk to us

Similar Papers

More From: Journal of Theoretical Biology