We provide an overview the All of Us Research Program, a National Institutes of Health funded research database, and report on the utility of this database for Radiation Oncology research. The All of Us Research Program aims to create a large and diverse health database with participants from across the US. Patients consent to join the database and agree to share electronic health records, complete surveys, provide physical measurements, and donate at least one biospecimen. In this observational study, we used the public data browser feature of the All of Us Dataset to evaluate utility for future radiation oncology research. Within the database, we report on the number of cases of common cancers in the US, using ICD10 codes, and also the number of patients treated with common radiotherapy procedures, using CPT4 codes. We then qualitatively report on additional data of interest. Of note, public patient counts in All of Us is rounded to the nearest interval of 20. Additional tiers of data access exist, including individual level data, which were not used in this survey of public data. The database includes 372,380 participants, of which 46,380 patients have a diagnosis of cancer. Within the identified top 10 cancers according to ACS, there were a total of 26,540 primary malignant cancers. The most common type of cancers were breast cancer (ICD10 - c50 [n = 6,960]), Prostate (ICD10 - c61 [n = 4,500]), Non-Hodgkin lymphoma (ICD10 - c85 [n = 2860]), Lung & bronchus (ICD10 - c34 [n = 2000]), Colon (ICD10 - c18 [n = 1940]), Melanoma of the skin (ICD10 - c43 [n = 1740]), Thyroid(ICD10 - c73 [n = 1720]), Kidney (ICD10 - c64 [n = 1300]), Urinary bladder (ICD10 - c67 [n = 1140]). The most common radiation therapy procedures were Computerized Tomography simulation (CPT - 77290 [n = 4,040]), Intensity Modulated Radiation Therapy (CPT - 77386 [n = 780]), 3D conformal radiation therapy (CPT - 77412 [n = 1,720]), Stereotactic Body Radiation Therapy (CPT - 77373 [n = 400]), Brachytherapy (CPT - 77770- 77772 [n = 240]), and Proton Therapy (CPT - 77520-77525 [n = 140]). Qualitatively, other data within All of Us include labs values, drug exposures, comorbidities, other procedures, physical measurements, and wearable biometrics (Fitbit) data. Genomic data typically includes germline testing using whole genome sequencing or genotyping array. Patient surveys include data on personal medical history, family health history, lifestyle, social determinants of health, COVID19 experience, and overall health. Potential outcome variables of interest to oncology research in the database include survival, cause of death, additional procedures or diagnoses after cancer treatment, and quality of life metrics derived from the survey instruments. The All Of Us Research Database includes a large number of cancer cases, of which a substantial number received radiotherapy. Future work will focus on focused hypothesis-driven research questions.
Read full abstract