Developing a Classification Algorithm for Prediabetes Risk Detection From Home Care Nursing Notes: Using Natural Language Processing.

Eunjoo Jeon,Jisoo Lee,Hyunsook Heo,Hana Lee,Aeri Kim,Kyungmi Woo

doi:10.1097/cin.0000000000001000

Abstract

This study developed and validated a rule-based classification algorithm for prediabetes risk detection using natural language processing from home care nursing notes. First, we developed prediabetes-related symptomatic terms in English and Korean. Second, we used natural language processing to preprocess the notes. Third, we created a rule-based classification algorithm with 31 484 notes, excluding 315 instances of missing data. The final algorithm was validated by measuring accuracy, precision, recall, and the F1 score against a gold standard testing set (400 notes). The developed terms comprised 11 categories and 1639 words in Korean and 1181 words in English. Using the rule-based classification algorithm, 42.2% of the notes comprised one or more prediabetic symptoms. The algorithm achieved high performance when applied to the gold standard testing set. We proposed a rule-based natural language processing algorithm to optimize the classification of the prediabetes risk group, depending on whether the home care nursing notes contain prediabetes-related symptomatic terms. Tokenization based on white space and the rule-based algorithm were brought into effect to detect the prediabetes symptomatic terms. Applying this algorithm to electronic health records systems will increase the possibility of preventing diabetes onset through early detection of risk groups and provision of tailored intervention.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Developing a Classification Algorithm for Prediabetes Risk Detection From Home Care Nursing Notes: Using Natural Language Processing.

Abstract

Talk to us

Similar Papers

More From: CIN: Computers, Informatics, Nursing

Lead the way for us

Journal: CIN: Computers, Informatics, Nursing	Publication Date: Jul 1, 2023
Citations: 1

Similar Papers

Natural language processing to identify social determinants of health in Alzheimer's disease and related dementia from electronic health records.
Wenbo Wu ... Elham Mahmoudi
Health services research | VOL. 58
Wenbo Wu, et. al.Wenbo Wu ... Elham Mahmoudi
03 Aug 2023
Health services research | VOL. 58

A rule-based classification algorithm: A rough set approach
Chia-Chi Liao ... Kuo-Wei Hsu
-
Chia-Chi Liao, et. al. Chia-Chi Liao ... Kuo-Wei Hsu
01 Jul 2012
01 Jul 2012

Identifying lupus patients in electronic health records: Development and validation of machine learning algorithms and application of rule-based algorithms
April Jorge ... Candace H Feldman
Seminars in Arthritis and Rheumatism | VOL. 49
April Jorge, et. al.April Jorge ... Candace H Feldman
04 Jan 2019
Seminars in Arthritis and Rheumatism | VOL. 49

Development and application of pharmacological statin-associated muscle symptoms phenotyping algorithms using structured and unstructured electronic health records data.
Boguang Sun ... Meijia Song
JAMIA Open | VOL. 6
Boguang Sun, et. al.Boguang Sun ... Meijia Song
04 Oct 2023
JAMIA Open | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Developing a Classification Algorithm for Prediabetes Risk Detection From Home Care Nursing Notes: Using Natural Language Processing.

Abstract

Talk to us

Similar Papers

More From: CIN: Computers, Informatics, Nursing