Abstract

Semi-parametric approaches based on generalized estimating equation (GEE) are widely used to analyse correlated outcomes. Most available softwares had been developed for longitudinal settings. In this paper, we present a R package CRTgeeDR for estimating parameters in marginal regression in cluster randomized trials (CRTs). Theory for adjusting for missing at random outcomes by inverse-probability weighting methods (IPW) based on the use of a propensity score had been largely studied and implemented. We exhibit that in CRTs most of the available softwares use an implementation of weights that lead to a bias in estimation if a non-independence working correlation structure is chosen. In CRTgeeDR, we solve this problem by using a different implementation while keeping the consistency properties of the IPW. Moreover, in CRTs using an augmented GEE (AUG) allow to improve efficiency by adjusting for treatment-covariate interactions and imbalance in baseline covariates between treatment groups using an outcome model. In CRTgeeDR, we extend the abilities of existing packages such as geepack and geeM to allow such data augmentation. Finally, one may want to combine IPW and AUG in a Doubly Robust (DR) estimator, which lead to consistent estimation when either the propensity score or the outcome model corresponds to the true data generation process (Prague, Wang, Stephens, Tchetgen Tchetgen, and De gruttola 2015). The DR approach is implemented in CRTgeeDR. Simulations studies demonstrate the consistency of IPW implemented in CRTgeeDR and the gains associated with the use of the DR for analyzing a binary outcome using a logit regression. Finally, we reanalyzed data from a sanitation CRT in developing countries (Guiteras, Levinsohn, and Mobarak 2015a) with the DR approach compared to classical GEE and demonstrated a signiffcant intervention effect.

Highlights

  • We describe the R package CRTgeeDR, for estimating coefficients of regression in a marginal mean model

  • The method is designed to analyze data collected in cluster randomized trials (CRTs) where 1) observations within a cluster may be correlated, 2) observations in separate clusters are independent, 3) a monotone transformation of expectation of the outcome is linearly related to the explanatory variables, and 4) treatment is randomized at a cluster level

  • We demonstrated that the inverse probability weighted generalized estimating equation (IPW) can be biased in cluster randomized trials with missing data (CRTs) if the weights are not implemented as described in Robins et al (1995) and a non-independence working correlation structure is chosen

Read more

Summary

Introduction

We describe the R package CRTgeeDR, for estimating coefficients of regression in a marginal mean model. Tchetgen Tchetgen et al (2012) made a similar comment regarding the analysis of incomplete longitudinal data in which time-varying covariates and previous outcome values are needed to model the missingness process This article clarifies this issue for CRTs and proposes an implementation in R that allows for unbiased IPW (and DR) estimation with non-independence working correlation structure. Recent advances in methods for analysis of data from CRTs have used augmented GEE to improve efficiency of inferences by incorporating baseline covariates (Stephens et al, 2012); we denote this estimator the AUG They have been extended to accommodate missing data using an approach based on the IPW which is doubly robust GEE (DR). If despite our concern about the implementation of weights, one wants to use the same implementation as in packages geepack or proc GENMOD in SAS, one can set typeweights="GENMOD"

Determine group-specific OM
Method
Findings
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call