The Efficacy of Common Fit Indices for Enumerating Classes in Growth Mixture Models When Nested Data Structure Is Ignored

Qi Chen,Amber Mcenturff,Ryan Glaman,Gregory J Palardy,Wen Luo

doi:10.1177/2158244017700459

Abstract

Growth mixture model (GMM) is a flexible statistical technique for analyzing longitudinal data when there are unknown heterogeneous subpopulations with different growth trajectories. When individuals are nested within clusters, multilevel growth mixture model (MGMM) should be used to account for the clustering effect. A review of recent literature shows that a higher level of nesting was described in 43% of articles using GMM, none of which used MGMM to account for the clustered data. We conjecture that researchers sometimes ignore the higher level to reduce analytical complexity, but in other situations, ignoring the nesting is unavoidable. This Monte Carlo study investigated whether the correct number of classes can still be retrieved when a higher level of nesting in MGMM is ignored. We investigated six commonly used model selection indices: Akaike information criterion (AIC), consistent AIC (CAIC), Bayesian information criterion (BIC), sample size–adjusted BIC (SABIC), Vuong–Lo–Mendell–Rubin likelihood ratio test (VLMR), and adjusted Lo–Mendell–Rubin likelihood ratio test (ALMR). Results showed that accuracy of class enumeration decreased for all six indices when the higher level is ignored. BIC, CAIC, and SABIC were the most effective model selection indices under the misspecified model. BIC and CAIC were preferable when sample size was large and/or intraclass correlation (ICC) was small, whereas SABIC performed better when sample size was small and/or ICC was large. In addition, SABIC and VLMR/ALMR tended to overextract the number of classes when there are more than two subpopulations and the sample size is large.

Highlights

Multilevel growth mixture model (MGMM) is a relatively new modeling technique for extracting unknown subpopulations in multilevel longitudinal data
Bayesian information criterion (BIC) had the highest percentage of correct classification (98%), followed by size–adjusted BIC (SABIC) and consistent AIC (CAIC) (97%), adjusted Lo–Mendell–Rubin likelihood ratio test (ALMR) (90%), Vuong–Lo–Mendell–Rubin likelihood ratio test (VLMR) (89%), and Akaike information criterion (AIC) (76%)
SABIC had the highest percentage of correct classification (91%), followed by BIC (82%), ALMR (81%), VLMR (80%), CAIC (78%), and AIC (66%)

Summary

Introduction

Multilevel growth mixture model (MGMM) is a relatively new modeling technique for extracting unknown subpopulations in multilevel longitudinal data. This technique integrates multilevel modeling, finite mixture modeling, and structural equation modeling The multilevel aspect of MGMM is attractive to applied researchers because longitudinal data are often collected through cluster sampling, which creates multilevel data structure with repeated measures nested within individuals and individuals further nested within organizations. It should be noted that other methods, such as item response theory (e.g., Bartolucci, Pennoni, & Vittadini, 2011), can be used to account for longitudinal data structure in the analysis

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: SAGE Open	Publication Date: Jan 1, 2017
Citations: 51	License type: cc-by

R Discovery Prime

R Discovery Prime

The Efficacy of Common Fit Indices for Enumerating Classes in Growth Mixture Models When Nested Data Structure Is Ignored

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: SAGE Open

Lead the way for us

Similar Papers

Modelling Service Times Using Some Beta-Based Compound Distribution
John, O T ... Singla, S
African Journal of Mathematics and Statistics Studies | VOL. 7
John, O T, et. al.John, O T ... Singla, S
28 Oct 2024
African Journal of Mathematics and Statistics Studies | VOL. 7

An Evaluation of Information Criteria Use for Correct Cross-Classified Random Effects Model Selection
S Natasha Beretvas ... Daniel L Murphy
The Journal of Experimental Education | VOL. 81
S Natasha Beretvas, et. al.S Natasha Beretvas ... Daniel L Murphy
02 Oct 2013
The Journal of Experimental Education | VOL. 81

Model selection criteria for dual-inflated data
Ting Hsiang Lin ... Min-Hsiao Tsai
Journal of Statistical Computation and Simulation | VOL. 86
Ting Hsiang Lin, et. al.Ting Hsiang Lin ... Min-Hsiao Tsai
30 Nov 2015
Journal of Statistical Computation and Simulation | VOL. 86

The impact of total and partial inclusion or exclusion of active and inactive time invariant covariates in growth mixture models.
Thierno M O Diallo ... Alexandre J S Morin
Psychological Methods | VOL. 22
Thierno M O Diallo, et. al.Thierno M O Diallo ... Alexandre J S Morin
01 Mar 2017
Psychological Methods | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Efficacy of Common Fit Indices for Enumerating Classes in Growth Mixture Models When Nested Data Structure Is Ignored

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: SAGE Open