Building bridges across electronic health record systems through inferred phenotypic topics

You Chen,Joydeep Ghosh,Cosmin Adrian Bejan,Carl A Gunter,Siddharth Gupta,Abel Kho,David Liebovitz,Jimeng Sun,Joshua Denny,Bradley Malin

doi:10.1016/j.jbi.2015.03.011

Abstract

ObjectiveData in electronic health records (EHRs) is being increasingly leveraged for secondary uses, ranging from biomedical association studies to comparative effectiveness. To perform studies at scale and transfer knowledge from one institution to another in a meaningful way, we need to harmonize the phenotypes in such systems. Traditionally, this has been accomplished through expert specification of phenotypes via standardized terminologies, such as billing codes. However, this approach may be biased by the experience and expectations of the experts, as well as the vocabulary used to describe such patients. The goal of this work is to develop a data-driven strategy to (1) infer phenotypic topics within patient populations and (2) assess the degree to which such topics facilitate a mapping across populations in disparate healthcare systems. MethodsWe adapt a generative topic modeling strategy, based on latent Dirichlet allocation, to infer phenotypic topics. We utilize a variance analysis to assess the projection of a patient population from one healthcare system onto the topics learned from another system. The consistency of learned phenotypic topics was evaluated using (1) the similarity of topics, (2) the stability of a patient population across topics, and (3) the transferability of a topic across sites. We evaluated our approaches using four months of inpatient data from two geographically distinct healthcare systems: (1) Northwestern Memorial Hospital (NMH) and (2) Vanderbilt University Medical Center (VUMC). ResultsThe method learned 25 phenotypic topics from each healthcare system. The average cosine similarity between matched topics across the two sites was 0.39, a remarkably high value given the very high dimensionality of the feature space. The average stability of VUMC and NMH patients across the topics of two sites was 0.988 and 0.812, respectively, as measured by the Pearson correlation coefficient. Also the VUMC and NMH topics have smaller variance of characterizing patient population of two sites than standard clinical terminologies (e.g., ICD9), suggesting they may be more reliably transferred across hospital systems. ConclusionsPhenotypic topics learned from EHR data can be more stable and transferable than billing codes for characterizing the general status of a patient population. This suggests that EHR-based research may be able to leverage such phenotypic topics as variables when pooling patient populations in predictive models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Building bridges across electronic health record systems through inferred phenotypic topics

Abstract

Talk to us

Similar Papers

More From: Journal of Biomedical Informatics

Lead the way for us

Journal: Journal of Biomedical Informatics	Publication Date: Apr 1, 2015
Citations: 55

Similar Papers

REDCap on FHIR: Clinical Data Interoperability Services
A.C Cheng ... P.A Harris
Journal of Biomedical Informatics | VOL. 121
A.C Cheng, et. al.A.C Cheng ... P.A Harris
21 Jul 2021
Journal of Biomedical Informatics | VOL. 121

Rapid identification of chronic kidney disease in electronic health record database using computable phenotype combining a common data model.
Huai-Yu Wang ... Chao Yang
Chinese medical journal | VOL. 136
Huai-Yu Wang, et. al.Huai-Yu Wang ... Chao Yang
05 Apr 2023
Chinese medical journal | VOL. 136

Medical Student Use of Electronic and Paper Health Records During Inpatient Clinical Clerkships: Results of a National Longitudinal Study.
Lauren M Foster ... Maya M Hammoud
Academic medicine : journal of the Association of American Medical Colleges | VOL. 93
Lauren M Foster, et. al.Lauren M Foster ... Maya M Hammoud
01 Nov 2018
Academic medicine : journal of the Association of American Medical Colleges | VOL. 93

HIR Collaborating with the CODATA Conference
Hyejung Chang ... William T F Goossen
Healthcare Informatics Research | VOL. 19
Hyejung Chang, et. al.Hyejung Chang ... William T F Goossen
01 Jan 2013
Healthcare Informatics Research | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Building bridges across electronic health record systems through inferred phenotypic topics

Abstract

Talk to us

Similar Papers

More From: Journal of Biomedical Informatics