Identify Patients with Congestive Heart Failure through Analyzing Free-Text Clinical Notes

Margot Yann,Therese Stukel,Karen Tu,Liisa Jaakkimainen

doi:10.23889/ijpds.v3i4.1032

Abstract

IntroductionA number of challenges exist in analyzing unstructured free text data in electronic medical records (EMRs). EMR text are difficult to represent and model due to their high dimensionality, heterogeneity, sparsity, incompleteness, random errors and the presence of noise. Objectives and ApproachStandard Natural Language Processing (NLP) tools make errors when applied to clinical notes due to physician use of unconventional language, involving polysemy, abbreviations, ambiguity, misspelling, variations, and negation. This paper presents a novel NLP framework, “Clinical Learning On Natural Expression” (CLONE), to automatically learn from a large primary care EMR database, analyzing free text clinical notes from primary care practices. CLONE’s predictive clinical models using text mining and neural network approach to extract features to identify patterns. To demonstrate effectiveness, we evaluate CLONE’s ability in a case study to identify patients with a specific chronic condition: congestive heart failure (CHF). ResultsA random selected sample of 7500 patients from Electronic Medical Record Administrative data Linked Database (EMRALD) is used. In this dataset, each patient’s medical chart includes a reference standard, manually reviewed by medical practitioners. Prevalence of CHF is approximately 2%. The low prevalence leads to another challenging problem in machine learning: imbalanced datasets. After pre-processing, we build deep learning models to represent and extract important medical information from free text to identify CHF patients through analyzing patient charts. We evaluated the effectiveness of CLONE by comparing the predicted labels with the standard references on a holdout test dataset. Comparing it with a number of alternative algorithms, we improve the overall accuracy to over 90% on a test dataset. Conclusion/ImplicationsAs the role of NLP in EMR data expands, the CLONE natural language processing framework can lead to substantial reduction in manual processing, while improving predictive accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Identify Patients with Congestive Heart Failure through Analyzing Free-Text Clinical Notes

Abstract

Talk to us

Similar Papers

More From: International Journal of Population Data Science

Lead the way for us

Journal: International Journal of Population Data Science	Publication Date: Sep 11, 2018
License type: CC BY-NC-ND 4.0

Similar Papers

Comparing prescribing and dispensing databases to study antibiotic use: a validation study of the Electronic Medical Record Administrative data Linked Database (EMRALD).
Kevin L Schwartz ... Kevin A Brown
Journal of Antimicrobial Chemotherapy | VOL. 74
Kevin L Schwartz, et. al.Kevin L Schwartz ... Kevin A Brown
25 Feb 2019
Journal of Antimicrobial Chemotherapy | VOL. 74

Validation of physician billing and hospitalization data to identify patients with ischemic heart disease using data from the Electronic Medical Record Administrative data Linked Database (EMRALD
Karen Tu ... Jack V Tu
Canadian Journal of Cardiology | VOL. 26
Karen Tu, et. al.Karen Tu ... Jack V Tu
01 Aug 2010
Canadian Journal of Cardiology | VOL. 26

Effects of implementing electronic medical records on primary care billings and payments: a before-after study.
R L Jaakkimainen ... S E Schultz
CMAJ open | VOL. 1
R L Jaakkimainen, et. al.R L Jaakkimainen ... S E Schultz
17 Oct 2013
CMAJ open | VOL. 1

166: Using Electronic Medical Records to Estimate Overweight and Obesity Rates in Children in Ontario, Canada
C Birken ... A Guttmann
Paediatrics & Child Health | VOL. 19
C Birken, et. al.C Birken ... A Guttmann
01 Jun 2014
Paediatrics & Child Health | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Identify Patients with Congestive Heart Failure through Analyzing Free-Text Clinical Notes

Abstract

Talk to us

Similar Papers

More From: International Journal of Population Data Science