Survival data collected from a prevalent cohort are subject to left truncation and the analysis is challenging. Conditional approaches for left-truncated data could be inefficient as they ignore the information in the marginal likelihood of the truncation times. Length-biased sampling methods may improve the estimation efficiency but only when the underlying truncation time is uniform; otherwise, they may generate biased estimates. We propose a semiparametric method for left-truncated data under the Cox model with no parametric distributional assumption about the truncation times. Our approach is to make inference based on the conditional likelihood augmented with a pairwise likelihood, which eliminates the truncation distribution, yet retains the information about the regression coefficients and the baseline hazard function in the marginal likelihood. An iterative algorithm is provided to solve for the regression coefficients and the baseline hazard function simultaneously. By empirical process and U-process theories, it has been shown that the proposed estimator is consistent and asymptotically normal with a closed-form consistent variance estimator. Simulation studies show substantial efficiency gain of our estimator in both the regression coefficients and the cumulative baseline hazard function over the conditional approach estimator. When the uniform truncation assumption holds, our estimator enjoys smaller biases and efficiency comparable to that of the full maximum likelihood estimator. An application to the analysis of a chronic kidney disease cohort study illustrates the utility of the method.
Read full abstract