Abstract

642 Background: Large genomic databases linked to electronic health records promise to shed light on molecular mechanisms underlying rare diseases, such as pancreatic cancer. However, accurately identifying patients with the desired phenotype can be challenging. This is particularly the case for pancreatic tumors, since ICD codes do not distinguish between pancreatic adenocarcinoma (PDAC) and pancreatic neuroendocrine tumors (pNET). Previous studies have shown that ICD codes aggregated by phenotype, known as “phecodes”, have a higher accuracy in identifying specific phenotypes than ICD codes themselves; however, their performance in identifying cancers of the pancreas has not been studied. Methods: From a large deidentified genomic database, two queries were performed to identify all adults with pancreatic cancer for a GWAS study, one using ICD-9/10 codes and the other using phecodes. The medical records for all patients identified from both queries were then reviewed to confirm the presence and histologic type of pancreatic cancer. Results: Of the 91,985 genotyped adults in the database, ICD-9/10 codes identified 1,247 patients with pancreatic cancer, compared with only 422 patients identified by the phecode query. All patients in the phecode cohort were also found in the ICD cohort. Of the 1,247 patients in the ICD cohort, 760 were confirmed to have pancreatic cancer on review of the health records (594 with PDAC, 166 with pNET) whereas in the phecode cohort, only 251 were confirmed to have pancreatic cancer (159 with PDAC, 92 pNET). The positive predictive value (PPV) for PDAC in the ICD query was 47%, compared with 38% for the phecode cohort. The ICD and phecode cohorts had similarly low numbers of pre-malignant cystic tumors (5% in each cohort) and other periampullary cancers (3%). Conclusions: In this large genomic database, the use of ICD-9/10 codes for pancreatic cancer was able to identify nearly three times as many patients with pancreatic cancer and had a higher PPV compared to using phecodes. Therefore, ICD codes, rather than phecodes, should be used to identify patients with pancreatic cancer for subsequent genotyping analysis, though caution is required because the PPV is still low.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.