Evaluating Stan’s Variational Bayes Algorithm for Estimating Multidimensional IRT Models

Esther Ulitzsch,Steffen Nestler

doi:10.3390/psych4010007

Esther Ulitzsch, Steffen Nestler

Open Access

https://doi.org/10.3390/psych4010007

Copy DOI

Abstract

Bayesian estimation of multidimensional item response theory (IRT) models in large data sets may come with impractical computational burdens when general-purpose Markov chain Monte Carlo (MCMC) samplers are employed. Variational Bayes (VB)—a method for approximating the posterior distribution—poses a potential remedy. Stan’s general-purpose VB algorithms have drastically improved the accessibility of VB methods for a wide psychometric audience. Using marginal maximum likelihood (MML) and MCMC as benchmarks, the present simulation study investigates the utility of Stan’s built-in VB function for estimating multidimensional IRT models with between-item dimensionality. VB yielded a marked speed-up in comparison to MCMC, but did not generally outperform MML in terms of run time. VB estimates were trustworthy only for item difficulties, while bias in item discriminations depended on the model’s dimensionality. Under realistic conditions of non-zero correlations between dimensions, VB correlation estimates were subject to severe bias. The practical relevance of performance differences is illustrated with data from PISA 2018. We conclude that in its current form, Stan’s built-in VB algorithm does not pose a viable alternative for estimating multidimensional IRT models.

Highlights

Multidimensional item response theory (IRT) models are the method of choice for analyzing data from cognitive tests assessing multiple competencies
We focus on multidimensional IRT models with between-item dimensionality because we believe (1)
Note that these comparisons need to be interpreted with caution, as run times of Markov chain Monte Carlo (MCMC) and maximum likelihood (MML) are heavily dependent on the software as well as the number of iterations and nodes employed, respectively

Summary

Introduction

Multidimensional item response theory (IRT) models are the method of choice for analyzing data from cognitive tests assessing multiple competencies (e.g., science, mathematical literacy, and reading). Markov chain Monte Carlo (MCMC) samplers that were developed for specific IRT models, such as the Gibbs sampler by [7] for the Rasch model, or they can use general-purpose software for Bayesian estimation such as JAGS [8] or Stan [9]. In our experience, applied researchers typically employ the latter software packages because they provide them with a high flexibility in model specification. This allows them to take the specifics of their data into account without the need to develop their own customized samplers (for which they may not have the time). The last point may be a disadvantage, at least when there is little research on the performance of these new techniques as implemented in the general-purpose software

Objectives

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Psych	Publication Date: Feb 5, 2022
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Evaluating Stan’s Variational Bayes Algorithm for Estimating Multidimensional IRT Models

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Psych

Lead the way for us

Similar Papers

A General Bayesian Multidimensional Item Response Theory Model for Small and Large Samples.
Ken A Fujimoto ... Sabina R Neugebauer
Educational and Psychological Measurement | VOL. 80
Ken A Fujimoto, et. al.Ken A Fujimoto ... Sabina R Neugebauer
10 Jan 2020
Educational and Psychological Measurement | VOL. 80

Investigating Subscores of VERA 3 German Test Based on Item Response Theory/Multidimensional Item Response Theory Models
Güler Yavuz Temel ... Christian Rietz
Frontiers in Education | VOL. 7
Güler Yavuz Temel, et. al.Güler Yavuz Temel ... Christian Rietz
08 Apr 2022
Frontiers in Education | VOL. 7

Psychometric Models for a New State Science Assessment Aligned to the Next Generation Science Standards
Jing Chen ... Jonghwan Lee
-
Jing Chen, et. al.Jing Chen ... Jonghwan Lee
01 Jan 2020
01 Jan 2020

Specifying Ability Growth Models Using a Multidimensional Item Response Model for Repeated Measures Categorical Ordinal Item Response Data
Insu Paek ... Hyun-Jeong Park
Multivariate Behavioral Research | VOL. 51
Insu Paek, et. al.Insu Paek ... Hyun-Jeong Park
20 Jun 2016
Multivariate Behavioral Research | VOL. 51

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluating Stan’s Variational Bayes Algorithm for Estimating Multidimensional IRT Models

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Psych