Abstract
We group approaches to modeling correlated binary data accord ing to data recorded cross-sectionally as opposed to data recorded longi tudinally; according to models that are population-averaged as opposed to subject-specific; and according to data with time-dependent covariates as opposed to time-independent covariates. Standard logistic regression mod els are appropriate for cross-sectional data. However, for longitudinal data, methods such as generalized estimating equations (GEE) and generalized method of moments (GMM) are commonly used to fit population-averaged models, while random-effects models such as generalized linear mixed mod els (GLMM) are used to fit subject-specific models. Some of these methods account for time-dependence in covariates while others do not. This paper addressed these approaches with an illustration using a Medicare dataset as it relates to rehospitalization. In particular, we compared results from standard logistic models, GEE models, GMM models, and random-effects models by analyzing a binary outcome for four successive hospitalizations. We found that these procedures address differently the correlation among responses and the feedback from response to covariate. We found marginal GMM logistic regression models to be more appropriate when covariates are classified as time-dependent in comparison to GEE models. We also found conditional random-intercept models with time-dependent covariates decom posed into components to be more appropriate when time-dependent covari ates are present in comparison to ordinary random-effects models. We used the SAS procedures GLIMMIX, NLMIXED, IML, GENMOD, and LOGIS TIC to analyze the illustrative dataset, as well as unique programs written using the R language.
Highlights
There are not many suitable models for binary responses taken over time, when the data include correlation between responses and covariates that are time-dependent
We present marginal logistic regression models suitable for population-averaged inferences, including standard generalized estimating equations (GEE) models and the generalized method of moments (GMM) models which we present for analyzing correlated logistic regression models with time-dependent covariates based on Yin et al (2013)
They showed through a simulation study that when there are time-dependent covariates, some of the estimating equations applied by using the GEE method with an arbitrary working correlation structure are not valid
Summary
There are not many suitable models for binary responses taken over time, when the data include correlation between responses and covariates that are time-dependent. For a subject-specific logistic regression model, the interpretations of parameter estimates relate to the odds ratio comparing two different covariate values for a single subject, given an individual baseline propensity for the response of interest. In this paper we considered the population-averaged and the subject-specific models, as well as time-dependent versus time-independent covariate models We addressed these modeling decisions in the context of predicting rehospitalization probabilities using a Medicare dataset. We chose to consider the effects on rehospitalization of total number of diagnoses, length of stay, total number of procedures, and the existence of coronary atherosclerosis We found these data attractive in that the responses are correlated and the covariates were time-dependent.
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.