Supervised Extraction of Diagnosis Codes from EMRs: Role of Feature Selection, Data Selection, and Probabilistic Thresholding.

Anthony Rios,Ramakanth Kavuluru

doi:10.1109/ichi.2013.15

Abstract

Extracting diagnosis codes from medical records is a complex task carried out by trained coders by reading all the documents associated with a patient's visit. With the popularity of electronic medical records (EMRs), computational approaches to code extraction have been proposed in the recent years. Machine learning approaches to multi-label text classification provide an important methodology in this task given each EMR can be associated with multiple codes. In this paper, we study the the role of feature selection, training data selection, and probabilistic threshold optimization in improving different multi-label classification approaches. We conduct experiments based on two different datasets: a recent gold standard dataset used for this task and a second larger and more complex EMR dataset we curated from the University of Kentucky Medical Center. While conventional approaches achieve results comparable to the state-of-the-art on the gold standard dataset, on our complex in-house dataset, we show that feature selection, training data selection, and probabilistic thresholding provide significant gains in performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Supervised Extraction of Diagnosis Codes from EMRs: Role of Feature Selection, Data Selection, and Probabilistic Thresholding.

Abstract

Talk to us

Similar Papers

More From: IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics

Lead the way for us

Journal: IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics	Publication Date: Sep 1, 2013
Citations: 43

Similar Papers

An empirical evaluation of supervised learning approaches in assigning diagnosis codes to electronic medical records.
Ramakanth Kavuluru ... Anthony Rios
Artificial Intelligence in Medicine | VOL. 65
Ramakanth Kavuluru, et. al.Ramakanth Kavuluru ... Anthony Rios
15 May 2015
Artificial Intelligence in Medicine | VOL. 65

The Electronic Health Record Objective Structured Clinical Examination Station: Assessing Student Competency in Patient Notes and Patient Interaction.
E Shen ... Joseph Anthony Cristiano
MedEdPORTAL | VOL. 16
E Shen, et. al.E Shen ... Joseph Anthony Cristiano
28 Oct 2020
MedEdPORTAL | VOL. 16

Diagnosis codes overestimate the burden of prostate cancer cases.
Tori Anglin-Foote ... Patrick Alba
Journal of Clinical Oncology | VOL. 40
Tori Anglin-Foote, et. al.Tori Anglin-Foote ... Patrick Alba
20 Feb 2022
Journal of Clinical Oncology | VOL. 40

Multi-layer Representation Learning and Its Application to Electronic Health Records.
Shan Yang ... Xiangwei Zheng
Neural processing letters | VOL. 53
Shan Yang, et. al.Shan Yang ... Xiangwei Zheng
18 Feb 2021
Neural processing letters | VOL. 53

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Supervised Extraction of Diagnosis Codes from EMRs: Role of Feature Selection, Data Selection, and Probabilistic Thresholding.

Abstract

Talk to us

Similar Papers

More From: IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics