Abstract

A major challenge confronting meta-analysts seeking to synthesize existing empirical research on a given topic is the frequent failure of primary studies to fully report their sample statistics. Because such research cannot be included in a meta-analysis unless the unreported statistics can somehow be recovered, a number of methods have been devised to estimate the sample mean and standard deviation from other quantities. This note compares several recently proposed sets of estimators that rely on extrema and/or quartiles to estimate unreported statistics for any given sample. The simplest method relies on an underlying model of normality, while the more complex methods are explicitly designed to accommodate non-normality. Our empirical comparison uses a previously developed data set containing 58 samples, ranging in size from 48 to 2,528 observations, from a standard depression screening instrument, the nine-item Patient Health Questionnaire (PHQ-9). When only the median and extrema are known, we find that the estimation method based on normality yields the most accurate estimates of both the mean and standard deviation, despite the existence of asymmetry throughout the data set; and when other information is given, the normality-based estimators have accuracy comparable to that of the other estimators reviewed here. Additionally, if the sample size is unknown, the method based on normality is the only feasible approach. The simplicity of the normality-based approach provides an added convenience for practitioners.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call