Abstract

Non-linear hierarchical models are commonly used in many disciplines. However, inference in the presence of non-nested effects and on large datasets is challenging and computationally burdensome. This paper provides two contributions to scalable and accurate inference. First, I derive a new mean-field variational algorithm for estimating binomial logistic hierarchical models with an arbitrary number of non-nested random effects. Second, I propose “marginally augmented variational Bayes” (MAVB) that further improves the initial approximation through a step of Bayesian post-processing. I prove that MAVB provides a guaranteed improvement in the approximation quality at low computational cost and induces dependencies that were assumed away by the initial factorization assumptions. I apply these techniques to a study of voter behavior using a high-dimensional application of the popular approach of multilevel regression and post-stratification (MRP). Existing estimation took hours whereas the algorithms proposed run in minutes. The posterior means are well-recovered even under strong factorization assumptions. Applying MAVB further improves the approximation by partially correcting the under-estimated variance. The proposed methodology is implemented in an open source software package.

Highlights

  • Introduction and Motivating ExampleHierarchical models, often known as multilevel, mixed, or random effects models, are ubiquitous in the social sciences (Gelman and Hill 2006; Rabe-Hesketh and Skrondal 2008)

  • I compare my variational algorithms against two gold standards (Laplace approximation using blme - Bates et al 2015; Chung et al 2015; Hamiltonian Monte Carlo (HMC) in STAN using brms; Burkner 2017) and Automatic Differentiation Variational Inference (ADVI; Kucukelbir et al 2017)

  • This section re-analyses the results in Ghitza and Gelman (2013) where I compare my results against Hamiltonian Monte Carlo (HMC)

Read more

Summary

Introduction and Motivating Example

Hierarchical models, often known as multilevel, mixed, or random effects models, are ubiquitous in the social sciences (Gelman and Hill 2006; Rabe-Hesketh and Skrondal 2008). This induces dependencies between the parameters that were assumed away in estimating the initial procedure and provides a provable guaranteed improvement upon the original approximation This pushes forward the literature on variational inference for hierarchical models by extending work in the case of a single random effect (Hall et al 2011; Ormerod and Wand 2012; Tan and Nott 2013; Hall et al 2019) or two non-nested random effects (Jeon et al 2017; Menictas et al 2019) to the general case. The latter shows dramatic gains in speed: Even after applying MAVB and drawing 4,000 samples, the fastest variational algorithm is nearly 60 times faster than Laplace approximation and nearly 350 times faster than Hamiltonian Monte Carlo for the most complex models. I use this to draw out some guidance for practitioners of MRP in other substantive domains

Mean-Field Variational Inference for Binomial Hierarchical Models
Variational Inference
Marginally Augmented Variational Bayes
Simulation Study
Application
Brief Explanation of MRP
Estimating Complex Hierarchical Models
Choosing an Optimal Model
Method
Findings
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call