Abstract

SummaryValid estimation of a causal effect using instrumental variables requires that all of the instruments are independent of the outcome conditional on the risk factor of interest and any confounders. In Mendelian randomization studies with large numbers of genetic variants used as instruments, it is unlikely that this condition will be met. Any given genetic variant could be associated with a large number of traits, all of which represent potential pathways to the outcome which bypass the risk factor of interest. Such pleiotropy can be accounted for using standard multivariable Mendelian randomization with all possible pleiotropic traits included as covariates. However, the estimator obtained in this way will be inefficient if some of the covariates do not truly sit on pleiotropic pathways to the outcome. We present a method that uses regularization to identify which out of a set of potential covariates need to be accounted for in a Mendelian randomization analysis in order to produce an efficient and robust estimator of a causal effect. The method can be used in the case where individual-level data are not available and the analysis must rely on summary-level data only. It can be used where there are any number of potential pleiotropic covariates up to the number of genetic variants less one. We show the results of simulation studies that demonstrate the performance of the proposed regularization method in realistic settings. We also illustrate the method in an applied example which looks at the causal effect of urate plasma concentration on coronary heart disease.

Highlights

  • Instrumental variables can be used to estimate the causal effect of an exposure on an outcome from observational data

  • Only some of the traits will sit on pathways to the outcome which bypasses the risk factor. If this is the case, the estimator of the causal effect obtained by a multivariable Mendelian randomization analysis with all potential covariates included will be inefficient

  • It should be noted that the analysis here has been performed in a two-sample framework, implying that the confidence interval from the model chosen by the regularization method could be too narrow

Read more

Summary

INTRODUCTION

Instrumental variables can be used to estimate the causal effect of an exposure ( called a risk factor) on an outcome from observational data. Any given genetic variant could associate with a number of traits other than the risk factor of interest If any of these traits, which we refer to as covariates, associate with the outcome via pathways that bypass the risk factor, the third instrumental variables assumption is violated and estimates of the causal effect will be biased. Only some of the traits will sit on pathways to the outcome which bypasses the risk factor If this is the case, the estimator of the causal effect obtained by a multivariable Mendelian randomization analysis with all potential covariates included will be inefficient. We assess the performance of our approach in simulation studies and demonstrate it with an applied example that looks at the effect of plasma urate concentration on coronary heart disease

THE MODEL
Estimating the causal effect
Two-sample Mendelian randomization
Inference
SIMULATIONS
EXTENDING THE CAUSAL DIAGRAM
DISCUSSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call