Abstract

We group approaches to modeling correlated binary data accord ing to data recorded cross-sectionally as opposed to data recorded longi tudinally; according to models that are population-averaged as opposed to subject-specific; and according to data with time-dependent covariates as opposed to time-independent covariates. Standard logistic regression mod els are appropriate for cross-sectional data. However, for longitudinal data, methods such as generalized estimating equations (GEE) and generalized method of moments (GMM) are commonly used to fit population-averaged models, while random-effects models such as generalized linear mixed mod els (GLMM) are used to fit subject-specific models. Some of these methods account for time-dependence in covariates while others do not. This paper addressed these approaches with an illustration using a Medicare dataset as it relates to rehospitalization. In particular, we compared results from standard logistic models, GEE models, GMM models, and random-effects models by analyzing a binary outcome for four successive hospitalizations. We found that these procedures address differently the correlation among responses and the feedback from response to covariate. We found marginal GMM logistic regression models to be more appropriate when covariates are classified as time-dependent in comparison to GEE models. We also found conditional random-intercept models with time-dependent covariates decom posed into components to be more appropriate when time-dependent covari ates are present in comparison to ordinary random-effects models. We used the SAS procedures GLIMMIX, NLMIXED, IML, GENMOD, and LOGIS TIC to analyze the illustrative dataset, as well as unique programs written using the R language.

Highlights

  • There are not many suitable models for binary responses taken over time, when the data include correlation between responses and covariates that are time-dependent

  • We present marginal logistic regression models suitable for population-averaged inferences, including standard generalized estimating equations (GEE) models and the generalized method of moments (GMM) models which we present for analyzing correlated logistic regression models with time-dependent covariates based on Yin et al (2013)

  • They showed through a simulation study that when there are time-dependent covariates, some of the estimating equations applied by using the GEE method with an arbitrary working correlation structure are not valid

Read more

Summary

Introduction

There are not many suitable models for binary responses taken over time, when the data include correlation between responses and covariates that are time-dependent. For a subject-specific logistic regression model, the interpretations of parameter estimates relate to the odds ratio comparing two different covariate values for a single subject, given an individual baseline propensity for the response of interest. In this paper we considered the population-averaged and the subject-specific models, as well as time-dependent versus time-independent covariate models We addressed these modeling decisions in the context of predicting rehospitalization probabilities using a Medicare dataset. We chose to consider the effects on rehospitalization of total number of diagnoses, length of stay, total number of procedures, and the existence of coronary atherosclerosis We found these data attractive in that the responses are correlated and the covariates were time-dependent.

Standard Logistic Regression
Longitudinal Studies
Marginal Correlated Logistic Regression
Generalized Estimating Equations
Generalized Method of Moments
Model Fitting
Random Intercept Models
Random Slopes Models
Decomposition of Time-Dependent Covariates
Modeling Rehospitalization Data
Marginal Logistic Regression Models for Rehospitalization
Conditional Logistic Regression Models for Rehospitalization
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call