Abstract
ObjectivesAlgorithms have been developed to identify rheumatoid arthritis-interstitial lung disease (RA-ILD) in administrative data with positive predictive values (PPVs) between 70 and 80%. We hypothesized that including ILD-related terms identified within chest computed tomography (CT) reports through text mining would improve the PPV of these algorithms in this cross-sectional study. MethodsWe identified a derivation cohort of possible RA-ILD cases (n = 114) using electronic health record data from a large academic medical center and performed medical record review to validate diagnoses (reference standard). ILD-related terms (e.g., ground glass, honeycomb) were identified in chest CT reports by natural language processing. Administrative algorithms including diagnostic and procedural codes as well as specialty were applied to the cohort both with and without the requirement for ILD-related terms from CT reports. We subsequently analyzed similar algorithms in an external validation cohort of 536 participants with RA. ResultsThe addition of ILD-related terms to RA-ILD administrative algorithms increased the PPV in both the derivation (improvement ranging from 3.6 to 11.7%) and validation cohorts (improvement 6.0 to 21.1%). This increase was greatest for less stringent algorithms. Administrative algorithms including ILD-related terms from CT reports exceeded a PPV of 90% (maximum 94.6% derivation cohort). Increases in PPV were accompanied by a decline in sensitivity (validation cohort -3.9 to -19.5%). ConclusionsThe addition of ILD-related terms identified by text mining from chest CT reports led to improvements in the PPV of RA-ILD algorithms. With high PPVs, use of these algorithms in large data sets could facilitate epidemiologic and comparative effectiveness research in RA-ILD.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.