In observational studies, identifying and adjusting for a sufficient set of confounders is crucial for accurately estimating the causal effect of the exposure on the outcome. Even in studies with large sample sizes, which typically benefit from small variances in estimates, there is a risk of producing estimates that are precisely inaccurate if the study suffers from systematic errors or biases, including confounding bias. To date, several approaches have been developed for selecting confounders. In this article, we first summarize the epidemiological and statistical approaches to identify a sufficient set of confounders. Particularly, we introduce the modified disjunctive cause criterion as one of the most useful approaches, which involves controlling for any pre-exposure covariate that affects the exposure, outcome, or both. It then excludes instrumental variables but includes proxies for the shared common cause of exposure and outcome. Statistical confounder selection is also useful when dealing with a large number of covariates, even in studies with small sample sizes. After introducing several approaches, we discuss some pitfalls and considerations in confounder selection, such as the adjustment for instrumental variables, intermediate variables, and baseline outcome variables. Lastly, as it is often difficult to comprehensively measure key confounders, we introduce two statistics, E-value and Robustness value, for assessing sensitivity to unmeasured confounders. Illustrated examples are provided using the National Health and Nutritional Examination Survey Epidemiologic Follow-up Study. Integrating these principles and approaches will enhance our understanding of confounder selection and facilitate better reporting and interpretation of future epidemiological studies.
Read full abstract