Abstract

BackgroundThere are no evidence-based guidelines for data cleaning of electronic health record (EHR) databases in Parkinson's disease (PD). Previous filtering criteria have primarily used the 9th International Statistical Classification of Diseases and Related Health Problems (ICD) with variable accuracy for true PD cases. Prior studies have not excluded atypical or drug-induced parkinsonism, and little is known about differences in accuracy by race. ObjectiveTo determine if excluding parkinsonism diagnoses improves accuracy of ICD-9 and -10 PD diagnosis codes. MethodsWe included ≥2 instances of an ICD-9 and/or −10 code for PD. We removed any records with at least one code indicating atypical or drug-induced parkinsonism first in all races, and then in Non-Hispanic White and Black patients. We manually reviewed 100 randomly selected charts per group before and after filtering, and performed a test of proportion (null hypothesis 0.5) for confirmed PD. Results5633 records had ≥2 instances of a PD code. 2833 remained after filtering. The rate of true PD cases was low before and after filtering to remove parkinsonism codes (0.55 vs. 0.51, p = 0.84). Accuracy was lowest in Black patients before filtering (0.48, p = 0.69), but filtering had a greater (though modest) impact on accuracy (0.68, p < 0.001). ConclusionsThere was inadequate accuracy of PD diagnosis codes in the largest study of ICD-9 and -10 codes. Accuracy was lowest in Black patients but improved the most with removing other parkinsonism codes. This highlights the limitations of using current real-world EHR data in PD research and need for further study.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call