Tackling Challenges in Data Pooling: Missing Data Handling in Latent Variable Models with Continuous and Categorical Indicators

Lihan Chen,Milica Miočević,Carl F Falk

doi:10.1080/10705511.2023.2300079

Abstract

Data pooling is a powerful strategy in empirical research. However, combining multiple datasets often results in a large amount of missing data, as variables that are not present in some datasets effectively contain missing values for all participants in those datasets. Furthermore, data pooling typically leads to a mix of continuous and categorical items with nonnormal multivariate distributions. We investigated two popular approaches to handle missing data in this context: (1) applying direct maximum likelihood by treating data as continuous (con-ML), and (2) applying categorical least squares using a polychoric correlation matrix computed from pairwise deletion (cat-LS). These approaches are available for free and relatively straightforward for empirical researchers to implement. Through simulation studies with confirmatory factor analysis and latent mediation analysis, we found cat-LS to be unsuitable for pooled data analysis, whereas con-ML yielded acceptable performance for the estimation of latent path coefficients barring severe nonnormality.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Tackling Challenges in Data Pooling: Missing Data Handling in Latent Variable Models with Continuous and Categorical Indicators

Abstract

Talk to us

Similar Papers

More From: Structural Equation Modeling: A Multidisciplinary Journal

Lead the way for us

Similar Papers

Estimating Hidden Parameters from a Dataset with Large Amount of Missing Data
Yuzhong Hu
-
Yuzhong HuYuzhong Hu
06 Dec 2022
06 Dec 2022

Handling Missing Data With Multilevel Structural Equation Modeling and Full Information Maximum Likelihood Techniques.
Donna L Schminkey ... Linda Bullock
Research in Nursing & Health | VOL. 39
Donna L Schminkey, et. al.Donna L Schminkey ... Linda Bullock
13 May 2016
Research in Nursing & Health | VOL. 39

Missing Network Data A Comparison of Different Imputation Methods
Robert W Krause ... Tom A.B Snijders
-
Robert W Krause, et. al.Robert W Krause ... Tom A.B Snijders
01 Aug 2018
01 Aug 2018

Phylogeny of Extant and Fossil Juglandaceae Inferred from the Integration of Molecular and Morphological Data Sets
Paul S Manos ... Sang-Hun Oh
Systematic Biology | VOL. 56
Paul S Manos, et. al.Paul S Manos ... Sang-Hun Oh
01 Jun 2007
Systematic Biology | VOL. 56

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Tackling Challenges in Data Pooling: Missing Data Handling in Latent Variable Models with Continuous and Categorical Indicators

Abstract

Talk to us

Similar Papers

More From: Structural Equation Modeling: A Multidisciplinary Journal