Length-biased data occur often in many scientific fields, including clinical trials, epidemiology surveys and genome-wide association studies, and many methods have been proposed for their analysis under various situations. In this article, we consider the situation where one faces length-biased and partly interval-censored failure time data under the proportional hazards model, for which it does not seem to exist an established method. For the estimation, we propose an efficient nonparametric maximum likelihood method by incorporating the distribution information of the observed truncation times. For the implementation of the method, a flexible and stable EM algorithm via two-stage data augmentation is developed. By employing the empirical process theory, we establish the asymptotic properties of the resulting estimators. A simulation study conducted to assess the finite-sample performance of the proposed method suggests that it works well and is more efficient than the conditional likelihood approach. An application to an AIDS cohort study is also provided.
Read full abstract