Rose and van der Laan Respond to "Some Advantages of the Relative Excess Risk due to Interaction"

S Rose,M Van Der Laan

doi:10.1093/aje/kwt317

Abstract

We appreciate VanderWeele and Vansteelandt's perspective (1) on our article (2). Our commentary largely focused on a discussion of marginal estimators for case-control study designs not mentioned in VanderWeele and Vansteelandt's original article (3). In our presentation (2), we highlighted the case-control-weighted targeted maximum likelihood estimator (TMLE) (4–7) and Robins' “approximately valid” inverse-probability-weighted estimator for case-control data (8). We appreciate VanderWeele and Vansteelandt's continued dialogue on methods for case-control study designs, as well as their inclusion of a new double robust estimator in their commentary (1), since there is a strong need for more work in this area. In this response, we precisely frame the efficiency properties of the case-control-weighted TMLE, which have been discussed elsewhere (2, 4–7) but were not completely presented in VanderWeele and Vansteelandt's commentary and Web Appendix (available at http://aje.oxfordjournals.org/) (1) or in our original commentary (2). We also emphasize the need for flexible nonparametric estimators that incorporate machine learning in the modern “big data” era of epidemiology in large databases. When defining our research question, we must be explicit about the model we are specifying. We wish to consider either a nonparametric model or a semiparametric model, thereby making fewer restrictive assumptions on our data-generating distribution than when imposing a parametric model. We are not limited to nonparametric statistical models, and we can make additional assumptions based on investigator knowledge in a semiparametric model. The efficiency claims made for the case-control-weighted TMLE are based on this nonparametric or semiparametric model (4–7). Before comparing the efficiency of estimators, it is important to agree on the model. Comparing parametric model efficiency with nonparametric or semiparametric model efficiency is not an apt comparison. Our case-control weighting effectively maps a function of the full-data sampled observations into a function for the biased case-control sampled observations. It has been demonstrated that case-control weighting of the efficient TMLE for the full-data model leads to an efficient TMLE for the case-control model. The required regularity conditions have been described previously (5). The case-control-weighted TMLE with known prevalence probability is consistent if either the outcome regression or the exposure mechanism is consistently estimated, and it is efficient if both are consistently estimated. Notably, the estimator is not defined as the solution to an estimating equation, although it does solve the efficient influence curve estimating equation. We also wish to underscore that using a nonparametric or semiparametric model is not a limitation; in fact, we consider it a compelling advantage. Especially when considering the advent of large data sets in epidemiology, researchers are increasingly interested in more flexible procedures that do not rely on restrictive parametric models. Since the goal is to have a statistical model that contains the true data distribution, assuming a nonparametric or semiparametric model may be preferable, as will using an estimator that allows for the incorporation of machine learning or ensembling methods (9, 10). This avoids the problems of 1) having more parameters than observations in a parametric model, 2) committing to a specific functional form of the data, and 3) attempting to represent complex relationships with a parametric regression. Integrating machine learning methods and causal inference is a burgeoning field in statistical science, one with promising potential for new methodological innovation in epidemiology. Novel robust estimators for case-control studies are an important area of methodological work, and we look forward to future contributions from VanderWeele, Vansteelandt, and other investigators.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Rose and van der Laan Respond to "Some Advantages of the Relative Excess Risk due to Interaction"

Abstract

Talk to us

Similar Papers

More From: American Journal of Epidemiology

Lead the way for us

Journal: American Journal of Epidemiology	Publication Date: Jan 31, 2014
Citations: 3

Similar Papers

Targeted Maximum Likelihood Estimation for Causal Inference in Observational Studies.
Megan S Schuler ... Sherri Rose
American Journal of Epidemiology | VOL. 185
Megan S Schuler, et. al.Megan S Schuler ... Sherri Rose
09 Dec 2016
American Journal of Epidemiology | VOL. 185

Collaborative double robust targeted maximum likelihood estimation.
Mark J Van Der Laan ... Susan Gruber
The International Journal of Biostatistics | VOL. 6
Mark J Van Der Laan, et. al.Mark J Van Der Laan ... Susan Gruber
17 Jan 2010
The International Journal of Biostatistics | VOL. 6

The relative performance of targeted maximum likelihood estimators.
Kristin E Porter ... Jasjeet S Sekhon
The international journal of biostatistics | VOL. 7
Kristin E Porter, et. al.Kristin E Porter ... Jasjeet S Sekhon
17 Jan 2011
The international journal of biostatistics | VOL. 7

Machine learning in causal inference for epidemiology.
Chiara Moccia ... Milena Maule
European journal of epidemiology | VOL. -
Chiara Moccia, et. al.Chiara Moccia ... Milena Maule
13 Nov 2024
European journal of epidemiology | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Rose and van der Laan Respond to "Some Advantages of the Relative Excess Risk due to Interaction"

Abstract

Talk to us

Similar Papers

More From: American Journal of Epidemiology