Abstract

Mediation analysis with high-dimensional DNA methylation markers is important in identifying epigenetic pathways between environmental exposures and health outcomes. There have been some methodology developments of mediation analysis with high-dimensional mediators. However, high-dimensional mediation analysis methods for time-to-event outcome data are still yet to be developed. To address these challenges, we propose a new high-dimensional mediation analysis procedure for survival models by incorporating sure independent screening and minimax concave penalty techniques for variable selection, with the Sobel and the joint method for significance test of indirect effect. The simulation studies show good performance in identifying correct biomarkers, false discovery rate control, and minimum estimation bias of the proposed procedure. We also apply this approach to study the causal pathway from smoking to overall survival among lung cancer patients potentially mediated by 365,307 DNA methylations in the TCGA lung cancer cohort. Mediation analysis using a Cox proportional hazards model estimates that patients who have serious smoking history increase the risk of lung cancer through methylation markers including cg21926276, cg27042065, and cg26387355 with significant hazard ratios of 1.2497(95%CI: 1.1121, 1.4045), 1.0920(95%CI: 1.0170, 1.1726), and 1.1489(95%CI: 1.0518, 1.2550), respectively. The three methylation sites locate in the three genes which have been showed to be associated with lung cancer event or overall survival. However, the three CpG sites (cg21926276, cg27042065 and cg26387355) have not been reported, which are newly identified as the potential novel epigenetic markers linking smoking and survival of lung cancer patients. Collectively, the proposed high-dimensional mediation analysis procedure has good performance in mediator selection and indirect effect estimation.

Highlights

  • Mediation analysis based on counterfactuals has been widely used in understanding the causal pathways from an exposure to an outcome

  • We demonstrate the performance of the proposed procedure through mediator selection and indirect effect estimation

  • We proposed a high-dimensional mediation survival model utilizing the time-to-event outcome in place of binary outcome to enhance accuracy of variable selection and minimize the estimation bias

Read more

Summary

Introduction

Mediation analysis based on counterfactuals has been widely used in understanding the causal pathways from an exposure to an outcome. The relationships among exposure, mediator, and outcome can be characterized. The graphical model of causal mediation models can be illustrated by using a directed acyclic graph (DAG) [4]. Through mediating effect analysis, the total effect of an exposure on the outcome is decomposed into two parts. One is the natural direct effect, which is the effect of an exposure directly on the outcome that is not through mediators. Another part is the natural indirect effect, which describes the effect of an exposure on the outcome through the mediators

Objectives
Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call