An interpretable natural language processing system for written medical examination assessment.

Abeed Sarker,Polina Harik,Janet Mee,Ari Z Klein,Graciela Gonzalez-Hernandez

doi:10.1016/j.jbi.2019.103268

Abstract

The assessment of written medical examinations is a tedious and expensive process, requiring significant amounts of time from medical experts. Our objective was to develop a natural language processing (NLP) system that can expedite the assessment of unstructured answers in medical examinations by automatically identifying relevant concepts in the examinee responses. Our NLP system, Intelligent Clinical Text Evaluator (INCITE), is semi-supervised in nature. Learning from a limited set of fully annotated examples, it sequentially applies a series of customized text comparison and similarity functions to determine if a text span represents an entry in a given reference standard. Combinations of fuzzy matching and set intersection-based methods capture inexact matches and also fragmented concepts. Customizable, dynamic similarity-based matching thresholds allow the system to be tailored for examinee responses of different lengths. INCITE achieved an average F1-score of 0.89 (precision = 0.87, recall = 0.91) against human annotations over held-out evaluation data. Fuzzy text matching, dynamic thresholding and the incorporation of supervision using annotated data resulted in the biggest jumps in performances. Long and non-standard expressions are difficult for INCITE to detect, but the problem is mitigated by the use of dynamic thresholding (i.e., varying the similarity threshold for a text span to be considered a match). Annotation variations within exams and disagreements between annotators were the primary causes for false positives. Small amounts of annotated data can significantly improve system performance. The high performance and interpretability of INCITE will likely significantly aid the assessment process and also help mitigate the impact of manual assessment inconsistencies.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Biomedical Informatics	Publication Date: Aug 14, 2019
Citations: 25	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

An interpretable natural language processing system for written medical examination assessment.

Abstract

Talk to us

Similar Papers

More From: Journal of Biomedical Informatics

Lead the way for us

Similar Papers

Application of natural language processing to post-structuring of rectal cancer MRI reports
W Liu ... Y Li
Clinical Radiology | VOL. 79
W Liu, et. al.W Liu ... Y Li
17 Nov 2023
Clinical Radiology | VOL. 79

Facilitating cancer research using natural language processing of pathology reports.
Kristin Anderson ... Victor R Grann
Studies in health technology and informatics | VOL. 107
Kristin Anderson, et. al.Kristin Anderson ... Victor R Grann
25 Jun 2015
Studies in health technology and informatics | VOL. 107

A systematic review on natural language processing systems for eligibility prescreening in clinical research.
Betina Idnay ... Rebecca Schnall
Journal of the American Medical Informatics Association | VOL. 29
Betina Idnay, et. al.Betina Idnay ... Rebecca Schnall
02 Nov 2021
Journal of the American Medical Informatics Association | VOL. 29

Using natural language processing to extract mammographic findings
Hongyuan Gao ... Diana S.M Buist
Journal of Biomedical Informatics | VOL. 54
Hongyuan Gao, et. al.Hongyuan Gao ... Diana S.M Buist
03 Feb 2015
Journal of Biomedical Informatics | VOL. 54

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An interpretable natural language processing system for written medical examination assessment.

Abstract

Talk to us

Similar Papers

More From: Journal of Biomedical Informatics