Abstract

Studies investigating the population-mixing hypothesis in childhood leukemia principally use two analytical approaches: (1) nonrandom selection of areas according to specific characteristics, followed by comparisons of their incidence of childhood leukemia with that expected based on the national average; and (2) regression analyses of region-wide data to identify characteristics associated with the incidence of childhood leukemia. These approaches have generated contradictory results. We compare these approaches using observed and simulated data. We generated 10,000 simulated regions using the correlation structure and distributions from a United Kingdom dataset. We simulated cases using a Poisson distribution with the incidence rate set to the national average assuming the null hypothesis that only population size drives the number of cases. Selection of areas within each simulated region was based on characteristics considered responsible for elevated infection rates (population density and inward migration) and/or elevated leukemia rates. We calculated effect estimates for 10,000 simulations and compared results to corresponding observed data analyses. When the selection of areas for analysis is based on apparent clusters of childhood leukemia, biased assessments occur; the estimated 5-year incidence of childhood leukemia ranged between zero and eight per 10,000 children in contrast to the simulated two cases per 10,000 children, similar to the observed data. Performing analyses on region-wide data avoids these biases. Studies using nonrandom selection to investigate the association between childhood leukemia and population mixing are likely to have generated biased findings. Future studies can avoid such bias using a region-wide analytical strategy. See video abstract at, http://links.lww.com/EDE/B431.

Highlights

  • Studies investigating the population-mixing hypothesis in childhood leukemia principally use two analytical approaches: (1) nonrandom selection of areas according to specific characteristics, followed by comparisons of their incidence of childhood leukemia with that expected based on the national average; and (2) regression analyses of region-wide data to identify characteristics associated with the incidence of childhood leukemia

  • Where selection was based on either a high incidence of leukemia, either alone or together with one or both exposures (Scenarios 3 and 6–15), the proportions of significant P values were substantially greater than the 5% that would be expected if the null were true

  • For analyses of data simulated under the null hypothesis, type I error rates of 2.8% to 3.7% were observed under Scenarios 1, 2, 4, and 5 (Table 1), consistent with random subregion selection (i.e., 3.5% type I error rate)

Read more

Summary

Introduction

Studies investigating the population-mixing hypothesis in childhood leukemia principally use two analytical approaches: (1) nonrandom selection of areas according to specific characteristics, followed by comparisons of their incidence of childhood leukemia with that expected based on the national average; and (2) regression analyses of region-wide data to identify characteristics associated with the incidence of childhood leukemia.

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call