Abstract

BackgroundCompleteness of recording for cancer stage at diagnosis is often historically poor in cancer registries, making it challenging to provide long-term stage-specific survival estimates. Stage-specific survival differences are driven by differences in short-term prognosis, meaning estimated survival metrics using period analysis are unlikely to be sensitive to imputed historical stage data.MethodsWe used data from the Surveillance, Epidemiology, and End Results (SEER) Program for lung, colon and breast cancer. To represent missing data patterns in less complete registry data, we artificially inflated the proportion of missing stage information conditional on stage at diagnosis and calendar year of diagnosis. Period analysis was applied and missing stage at diagnosis information was imputed under four different conditions to emulate extreme imputed stage distributions.ResultsWe fit a flexible parametric model for each cancer stage on the excess hazard scale and the differences in stage-specific marginal relative survival were assessed. Estimates were also obtained from non-parametric approaches for validation. There was little difference between the 10-year stage-specific marginal relative survival estimates, regardless of the assumed historical stage distribution.ConclusionsWhen conducting a period analysis, multiple imputation can be used to obtain stage-specific long-term estimates of relative survival, even when the historical stage information is largely incomplete.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call