Abstract

BackgroundGeneralized linear mixed models (GLMMs), typically used for analyzing correlated data, can also be used for smoothing by considering the knot coefficients from a regression spline as random effects. The resulting models are called semiparametric mixed models (SPMMs). Allowing the random knot coefficients to follow a normal distribution with mean zero and a constant variance is equivalent to using a penalized spline with a ridge regression type penalty. We introduce the least absolute shrinkage and selection operator (LASSO) type penalty in the SPMM setting by considering the coefficients at the knots to follow a Laplace double exponential distribution with mean zero.MethodsWe adopt a Bayesian approach and use the Markov Chain Monte Carlo (MCMC) algorithm for model fitting. Through simulations, we compare the performance of curve fitting in a SPMM using a LASSO type penalty to that of using ridge penalty for binary data. We apply the proposed method to obtain smooth curves from data on the relationship between the amount of pack years of smoking and the risk of developing chronic obstructive pulmonary disease (COPD).ResultsThe LASSO penalty performs as well as ridge penalty for simple shapes of association and outperforms the ridge penalty when the shape of association is complex or linear.ConclusionWe demonstrated that LASSO penalty captured complex dose-response association better than the Ridge penalty in a SPMM.

Highlights

  • Generalized linear mixed models (GLMMs), typically used for analyzing correlated data, can be used for smoothing by considering the knot coefficients from a regression spline as random effects

  • The smoothing parameter is selected by Generalized Cross Validation (GCV) [47] or Un-Biased Risk Estimator (UBRE)[14] or Akaike Information Criteria (AIC) [48] or Laplace approximation to Restricted Maximum Likelihood (REML) [14] or by regression splines with fixed degrees of freedom, the REML appeared to be most effective choice [46]

  • We report mean average squared distance (MASE), mean average 95% coverage probability (MACP), and mean average coverage length (MACL) measures for full curve and boundaries for each K, penalty and curve of x values

Read more

Summary

Introduction

Generalized linear mixed models (GLMMs), typically used for analyzing correlated data, can be used for smoothing by considering the knot coefficients from a regression spline as random effects. Allowing the random knot coefficients to follow a normal distribution with mean zero and a constant variance is equivalent to using a penalized spline with a ridge regression type penalty. We introduce the least absolute shrinkage and selection operator (LASSO) type penalty in the SPMM setting by considering the coefficients at the knots to follow a Laplace double exponential distribution with mean zero. To reduce bias resulting from mis-specifying the functional form and from the loss of efficiency in testing induced by categorizing continuous variables, the use of nonparametric (flexible) regression models is often recommended to model the effect of variables recorded on a continuous scale [1, 2]. An important issue is to select a suitable value for the smoothing parameter, which is not a trivial task

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call