Abstract
Causal inference with observational longitudinal data and time-varying exposures is often complicated by time-dependent confounding and attrition. The G-computation formula is one approach for estimating a causal effect in this setting. The parametric modeling approach typically used in practice relies on strong modeling assumptions for valid inference, and moreover depends on an assumption of missing at random, which is not appropriate when the missingness is missing not at random (MNAR) or due to death. In this work we develop a flexible Bayesian semi-parametric G-computation approach for assessing the causal effect on the subpopulation that would survive irrespective of exposure, in a setting with MNAR dropout. The approach is to specify models for the observed data using Bayesian additive regression trees, and then use assumptions with embedded sensitivity parameters to identify and estimate the causal effect. The proposed approach is motivated by a longitudinal cohort study on cognition, health, and aging, and we apply our approach to study the effect of becoming a widow on memory. We also compare our approach to several standard methods.
Highlights
Causal inference in non-randomised longitudinal studies with time-varying exposures is often complicated by time-dependent confounding and attrition
We propose a Bayesian semi-parametric (BSP) modelling approach based on Bayesian Additive Regression Trees (BART, Chipman, George, McCulloch, et al, 2010) for the observed data distribution
This paper has proposed a BSP framework for estimating the survivors average causal effect’ (SACE) with longitudinal cohort data
Summary
Causal inference in non-randomised longitudinal studies with time-varying exposures is often complicated by time-dependent confounding and attrition. Valid inference with the parametric G-formula requires correct model specification This can be extremely difficult when there is a large set of regressors, the relationship is non-linear and/or includes interaction terms and there are multiple observation times. Non- and semi-parametric estimation techniques that do not require prespecified distributional or functional forms of the data, have become popular in the causal inference literature Häggström, 2018; Hill, 2011; Karim et al, 2017; Kim et al, 2017; Tan & Roy, 2019; Wager & Athey, 2018) One such modelling strategy is Bayesian Additive Regression Trees (BART, Chipman, George, McCulloch, et al, 2010). BART does not rely on strong modelling assumptions and in contrast to other tree-based algorithms BART yields interval estimates for full posterior inference
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: Journal of the Royal Statistical Society. Series C, Applied statistics
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.