Abstract

The multilevel model has become a staple of social research. I textually and formally explicate sample design features that, I contend, are required for unbiased estimation of macro-level multilevel model parameters and the use of tools for statistical inference, such as standard errors. After detailing the limited and conflicting guidance on sample design in the multilevel model didactic literature, illustrative nationally-representative datasets and published examples that violate the posited requirements are identified. Because the didactic literature is either silent on sample design requirements or in disagreement with the constraints posited here, two Monte Carlo simulations are conducted to clarify the issues. The results indicate that bias follows use of samples that fail to satisfy the requirements outlined; notably, the bias is poorly-behaved, such that estimates provide neither upper nor lower bounds for the population parameter. Further, hypothesis tests are unjustified. Thus, published multilevel model analyses using many workhorse datasets, including NELS, AdHealth, NLSY, GSS, PSID, and SIPP, often unwittingly convey substantive results and theoretical conclusions that lack foundation. Future research using the multilevel model should be limited to cases that satisfy the sample requirements described.

Highlights

  • The multilevel model (MLM) has become a staple of social research

  • Anyone using the MLM must either use context-representative probability samples (i.e., p j = 1.00) or must explain why they believe Eq 12 holds for the parameters of interest that vary across contexts

  • The good news is that full information maximum likelihood (FIML) MLM estimates of fixed level-1 coefficients are generally unbiased even in datasets that fail to meet the criteria of fully multilevel probability (FMP) sampling

Read more

Summary

Introduction

The multilevel model (MLM) has become a staple of social research. Researchers have used it to explore and explain cross-national fertility differences (Mason et al 1983), effects of track location on student achievement (Gamoran 1992), life circumstance effects on criminal conduct (Horney et al 1995), labor market restructuring and gender (McCall 2000), grandparents’ effects on child mortality (Beise and Voland 2002), childcare availability effects on. Lucas fertility decisions (Hank and Kreyenfeld 2003), schools’ varying racial differences in college preparatory course-taking (Lucas and Berends 2007), the power of community characteristics on community attachment (Flaherty and Brown 2010), effects of race and sex discrimination on earnings (Lucas 2013b), and much more.1 In these and other MLM analyses researchers have estimated statistical models more consistent with their multilevel theories, facilitated more effective partition of variance into multiple levels, and improved the accuracy of standard errors. Estimators are biased and the tools of inferential statistics (e.g., standard errors) are inapplicable, dissolving the foundation for findings from such studies This circumstance may not be rare; the processes transforming probability samples into problematic samples for the MLM may be inconspicuous but widespread. Monte Carlo simulations illustrate costs of violating posited MLM sample requirements, followed by a concluding section

The multilevel model
Probability sampling theory and its implications: textual explication
Representativeness and its implications
Three familiar sample problems
Types of representativeness
Types of MLM parameters
Theorized data requirements for multilevel estimation: formal explication
Context-representative micro-level probability sampling
Macro-level probability sampling
Two important implications
The multilevel model in didactic perspective
Common complex sample designs and the MLM
The dangers of simpler designs
On inconvenient datasets
Monte Carlo simulations
Study 1—design
Study 1—results
Study 2—design
Study 2—results
Findings
Discussion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call