Using a natural language processing toolkit to classify electronic health records by psychiatric diagnosis.

Alissa Hutto,Tarek M Zikry,Buck Bohac,Terra Rose,Jasmine Staebler,Janet Slay,C Ray Cheever,Michael R Kosorok,Rebekah P Nash

doi:10.1177/14604582241296411

Abstract

Objective: We analyzed a natural language processing (NLP) toolkit's ability to classify unstructured EHR data by psychiatric diagnosis. Expertise can be a barrier to using NLP. We employed an NLP toolkit (CLARK) created to support studies led by investigators with a range of informatics knowledge. Methods: The EHR of 652 patients were manually reviewed to establish Depression and Substance Use Disorder (SUD) labeled datasets, which were split into training and evaluation datasets. We used CLARK to train depression and SUD classification models using training datasets; model performance was analyzed against evaluation datasets. Results: The depression model accurately classified 69% of records (sensitivity = 0.68, specificity = 0.70, F1 = 0.68). The SUD model accurately classified 84% of records (sensitivity = 0.56, specificity = 0.92, F1 = 0.57). Conclusion: The depression model performed a more balanced job, while the SUD model's high specificity was paired with a low sensitivity. NLP applications may be especially helpful when combined with a confidence threshold for manual review.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Using a natural language processing toolkit to classify electronic health records by psychiatric diagnosis.

Abstract

Talk to us

Similar Papers

More From: Health informatics journal

Lead the way for us

Journal: Health informatics journal	Publication Date: Oct 1, 2024
License type: CC BY-NC 4.0

Similar Papers

Grammar rule-based sentiment categorisation model for classification of Tamil tweets
Nadana Ravishankar ... R Shriram
International Journal of Intelligent Systems Technologies and Applications | VOL. 17
Nadana Ravishankar, et. al.Nadana Ravishankar ... R Shriram
01 Jan 2018
International Journal of Intelligent Systems Technologies and Applications | VOL. 17

Grammar rule-based sentiment categorisation model for classification of Tamil tweets
Nadana Ravishankar ... R Shriram
International Journal of Intelligent Systems Technologies and Applications | VOL. 17
Nadana Ravishankar, et. al.Nadana Ravishankar ... R Shriram
01 Jan 2018
International Journal of Intelligent Systems Technologies and Applications | VOL. 17

Hospital Admission Rate, Cumulative Hospitalized Days, and Time to Admission Among Older Persons With Substance Use and Psychiatric Conditions.
Wossenseged Birhane Jemberie ... Dennis Mccarty
Frontiers in psychiatry | VOL. 13
Wossenseged Birhane Jemberie, et. al.Wossenseged Birhane Jemberie ... Dennis Mccarty
22 Apr 2022
Frontiers in psychiatry | VOL. 13

UNLT: Urdu Natural Language Toolkit
Jawad Shafi ... Hafiz Rizwan Iqbal
Natural Language Engineering | VOL. 29
Jawad Shafi, et. al.Jawad Shafi ... Hafiz Rizwan Iqbal
19 Jan 2022
Natural Language Engineering | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Using a natural language processing toolkit to classify electronic health records by psychiatric diagnosis.

Abstract

Talk to us

Similar Papers

More From: Health informatics journal