Covariate Selection for Multilevel Models with Missing Data.

Miguel Marino,Orfeu M Buxton,Yi Li

doi:10.1002/sta4.133

Abstract

Missing covariate data hampers variable selection in multilevel regression settings. Current variable selection techniques for multiply-imputed data commonly address missingness in the predictors through list-wise deletion and stepwise-selection methods which are problematic. Moreover, most variable selection methods are developed for independent linear regression models and do not accommodate multilevel mixed effects regression models with incomplete covariate data. We develop a novel methodology that is able to perform covariate selection across multiply-imputed data for multilevel random effects models when missing data is present. Specifically, we propose to stack the multiply-imputed data sets from a multiple imputation procedure and to apply a group variable selection procedure through group lasso regularization to assess the overall impact of each predictor on the outcome across the imputed data sets. Simulations confirm the advantageous performance of the proposed method compared with the competing methods. We applied the method to reanalyze the Healthy Directions-Small Business cancer prevention study, which evaluated a behavioral intervention program targeting multiple risk-related behaviors in a working-class, multi-ethnic population.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Covariate Selection for Multilevel Models with Missing Data.

Abstract

Talk to us

Similar Papers

More From: Stat (International Statistical Institute)

Lead the way for us

Journal: Stat (International Statistical Institute)	Publication Date: Jan 1, 2017
Citations: 8

Similar Papers

Application of variable selection and dimension reduction on predictors of MSE\u2019s development
Habtamu Tilaye Wubetie
Journal of Big Data | VOL. 6
Habtamu Tilaye WubetieHabtamu Tilaye Wubetie
18 Feb 2019
Application of variable selection and dimension reduction on predictors of MSE\u2019s development
Habtamu Tilaye Wubetie

Variable selection for multiply‐imputed data with application to dioxin exposure study
Qixuan Chen ... Sijian Wang
Statistics in Medicine | VOL. 32
Qixuan Chen, et. al.Qixuan Chen ... Sijian Wang
25 Mar 2013
Statistics in Medicine | VOL. 32

Variable Selection in the Presence of Missing Data: Imputation-based Methods.
Yize Zhao ... Qi Long
WIREs Computational Statistics | VOL. 9
Yize Zhao, et. al.Yize Zhao ... Qi Long
24 May 2017
WIREs Computational Statistics | VOL. 9

Dynr.mi: An R Program for Multiple Imputation in Dynamic Modeling.
Yanling Li ... Sy-Miin Chow
World academy of science, engineering and technology | VOL. 13
Yanling Li, et. al.Yanling Li ... Sy-Miin Chow
01 Jan 2019
World academy of science, engineering and technology | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Covariate Selection for Multilevel Models with Missing Data.

Abstract

Talk to us

Similar Papers

More From: Stat (International Statistical Institute)