Imputing missing covariates in time-to-event analysis within distributed research networks: A simulation study.

Dongdong Li,Jenna Wong,Sengwee Toh,Rui Wang,Xiaojuan Li

doi:10.1002/pds.5563

Abstract

In distributed research network (DRN) settings, multiple imputation cannot be directly implemented because pooling individual-level data are often not feasible. The performance of multiple imputation in combination with meta-analysis is not well understood within DRNs. To evaluate the performance of imputation for missing baseline covariate data in combination with meta-analysis for time-to-event analysis within DRNs, we compared two parametric algorithms including one approximated linear imputation model (Approx), and one nonlinear substantive model compatible imputation model (SMC), as well as two non-parametric machine learning algorithms including random forest (RF), and classification and regression trees (CART), through simulation studies motivated by a real-world data set. Under the setting with small effect sizes (i.e., log-Hazard ratios [logHR]) and homogeneous missingness mechanisms across sites, all imputation methods produced unbiased and more efficient estimates while the complete-case analysis could be biased and inefficient; and under heterogeneous missingness mechanisms, estimates with RF method could have higher efficiency. Estimates from the distributed imputation combined by meta-analysis were similar to those from the imputation using pooled data. When logHRs were large, the SMC imputation algorithm generally performed better than others. These findings suggest the validity and feasibility of imputation within DRNs in the presence of missing covariate data in time-to-event analysis under various settings. The performance of the four imputation algorithms varies with the effect sizes and level of missingness.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Imputing missing covariates in time-to-event analysis within distributed research networks: A simulation study.

Abstract

Talk to us

Similar Papers

More From: Pharmacoepidemiology and drug safety

Lead the way for us

Similar Papers

Confounding Adjustment in Comparative Effectiveness Research Conducted Within Distributed Research Networks
Sengwee Toh ... Jeffrey S Brown
Medical Care | VOL. 51
Sengwee Toh, et. al.Sengwee Toh ... Jeffrey S Brown
01 Aug 2013
Medical Care | VOL. 51

A normalization method for combination of laboratory test results from different electronic healthcare databases in a distributed research network.
Dukyong Yoon ... Man Young Park
Pharmacoepidemiology and Drug Safety | VOL. 25
Dukyong Yoon, et. al.Dukyong Yoon ... Man Young Park
03 Nov 2015
Pharmacoepidemiology and Drug Safety | VOL. 25

Establishment of an International Evidence Sharing Network Through Common Data Model for Cardiovascular Research.
Seng Chan You ... Seongwon Lee
Korean Circulation Journal | VOL. 52
Seng Chan You, et. al.Seng Chan You ... Seongwon Lee
01 Jan 2021
Korean Circulation Journal | VOL. 52

A Generic Method and Implementation to Evaluate and Improve Data Quality in Distributed Research Networks
S Stahl-Toyota ... E.E Schmidt
Methods of Information in Medicine | VOL. 58
S Stahl-Toyota, et. al.S Stahl-Toyota ... E.E Schmidt
01 Sep 2019
Methods of Information in Medicine | VOL. 58

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Imputing missing covariates in time-to-event analysis within distributed research networks: A simulation study.

Abstract

Talk to us

Similar Papers

More From: Pharmacoepidemiology and drug safety