Coronary artery disease risk assessment from unstructured electronic health records using text mining

Jitendra Jonnagaddala,Siaw-Teng Liaw,Pradeep Ray,Manish Kumar,Nai-Wen Chang,Hong-Jie Dai

doi:10.1016/j.jbi.2015.08.003

Abstract

Coronary artery disease (CAD) often leads to myocardial infarction, which may be fatal. Risk factors can be used to predict CAD, which may subsequently lead to prevention or early intervention. Patient data such as co-morbidities, medication history, social history and family history are required to determine the risk factors for a disease. However, risk factor data are usually embedded in unstructured clinical narratives if the data is not collected specifically for risk assessment purposes. Clinical text mining can be used to extract data related to risk factors from unstructured clinical notes. This study presents methods to extract Framingham risk factors from unstructured electronic health records using clinical text mining and to calculate 10-year coronary artery disease risk scores in a cohort of diabetic patients. We developed a rule-based system to extract risk factors: age, gender, total cholesterol, HDL-C, blood pressure, diabetes history and smoking history. The results showed that the output from the text mining system was reliable, but there was a significant amount of missing data to calculate the Framingham risk score. A systematic approach for understanding missing data was followed by implementation of imputation strategies. An analysis of the 10-year Framingham risk scores for coronary artery disease in this cohort has shown that the majority of the diabetic patients are at moderate risk of CAD.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Biomedical Informatics	Publication Date: Aug 28, 2015
Citations: 80	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Coronary artery disease risk assessment from unstructured electronic health records using text mining

Abstract

Talk to us

Similar Papers

More From: Journal of Biomedical Informatics

Lead the way for us

Similar Papers

Identifying the Vulnerable Patient with Rupture-Prone Plaque
Howard S Weintraub
The American Journal of Cardiology | VOL. 101
Howard S WeintraubHoward S Weintraub
01 Jun 2008
The American Journal of Cardiology | VOL. 101

Ascending Aorta 4D Time to Peak Distention Sexual Dimorphism and Association with Coronary Plaque Burden Severity in Women.
Ahmed H Hamimi ... Reham M Elgarf
Journal of cardiovascular translational research | VOL. 17
Ahmed H Hamimi, et. al.Ahmed H Hamimi ... Reham M Elgarf
09 Aug 2023
Journal of cardiovascular translational research | VOL. 17

The Role of the Framingham Risk Score to Predict the Presence of Subclinical Coronary Atherosclerosis in Patients with HIV Infection
Rosario Rossi ... Gabriella Orlando
JAIDS Journal of Acquired Immune Deficiency Syndromes | VOL. 52
Rosario Rossi, et. al.Rosario Rossi ... Gabriella Orlando
01 Oct 2009
JAIDS Journal of Acquired Immune Deficiency Syndromes | VOL. 52

Prevalence of Subclinical Coronary Artery Disease in Masters Endurance Athletes With a Low Atherosclerotic Risk Profile.
Ahmed Merghani ... Rachel Bastiaenan
Circulation | VOL. 136
Ahmed Merghani, et. al.Ahmed Merghani ... Rachel Bastiaenan
02 May 2017
Circulation | VOL. 136

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Coronary artery disease risk assessment from unstructured electronic health records using text mining

Abstract

Talk to us

Similar Papers

More From: Journal of Biomedical Informatics