Abstract

Non-ignorable dropout is common in studies with long follow-up time, and it can bias study results unless handled carefully in the study design and the statistical analysis. A double-sampling design allocates additional resources to pursue a subsample of the dropouts and find out their outcomes, which can address potential biases due to non-ignorable dropout. It is desirable to construct semiparametric estimators for the double-sampling design because of their robustness properties. However, obtaining such semiparametric estimators remains a challenge due to the requirement of the analytic form of the efficient influence function (EIF), the derivation of which can be ad hoc and difficult for the double-sampling design. Recent work has shown how the derivation of EIF can be made deductive and computerizable using the functional derivative representation of the EIF in nonparametric models. This approach, however, requires deriving the mixture of a continuous distribution and a point mass, which can itself be challenging for complicated problems such as the double-sampling design. We propose semiparametric estimators for the survival probability in double-sampling designs by generalizing the deductive and computerizable estimation approach. In particular, we propose to build the semiparametric estimators based on a discretized support structure, which approximates the possibly continuous observed data distribution and circumvents the derivation of the mixture distribution. Our approach is deductive in the sense that it is expected to produce semiparametric locally efficient estimators within finite steps without knowledge of the EIF. We apply the proposed estimators to estimating the mortality rate in a double-sampling design component of the President’s Emergency Plan for AIDS Relief (PEPFAR) program. We evaluate the impact of double-sampling selection criteria on the mortality rate estimates. Simulation studies are conducted to evaluate the robustness of the proposed estimators.

Highlights

  • Studies with long follow-up often suffer from a high dropout rate

  • Semiparametric estimators (Newey, 1990; Tsiatis, 2007), which focus on modeling the parameter of interest and treat the rest of the model as nuisance parameters, have had great success in various areas (Cox, 1972; Liang and Zeger, 1986; Robins et al, 1994; Zhang et al, 2008), and they are a promising alternative to the parametric estimators for double-sampling designs

  • We proposed a deductive method to produce semiparametric estimators for estimating survival probability in the double-sampling design

Read more

Summary

Introduction

Studies with long follow-up often suffer from a high dropout rate. Dropouts can depend on the outcome of interest, even after adjusting for observed covariates. If a semiparametric estimator can be obtained deductively, human effort can be saved from difficult mathematical derivations and can be transferred, for example, to designing new studies Their approach is not directly applicable to the data analysis of double-sampling designs, because their estimation procedure requires analytically evaluating the parameter of interest at a mixture of a continuous distribution and a point mass.

Double-sampling design
Identification of parameter
Method
Proposed method: deductive estimation with discretized support
Simulation
Generative models
Simulation on consistency
Simulation with incorrect double-sampling model for S
Simulation on the impact of α
Application to PEPFAR
Discussion
Findings
A Robustness of the deductive estimator
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call