UMLS mapping and Word embeddings for ICD code assignment using the MIMIC-III intensive care database.

Christoph M Friedrich,Henning Schafer

doi:10.1109/embc.2019.8856442

Abstract

Diagnosis codes are used as a billing mechanism in the Electronic Health Record and have the capability to benefit decision support systems, which aim to assist coders by suggesting a relevant subset of potential codes to choose from. Due to the large set of possible labels and length of patient records, automatic ICD code assignment is considered to be a challenging task within the field of multi-label classification. This paper introduces a baseline for automatic ICD code assignment using Support Vector Machines (SVM) and FastText with Unified Medical Language System (UMLS) metathesaurus mappings into word embedding models. Training data is obtained from the Medical Information Mart for Intensive Care (MIMIC-III) database and extended with 'is-a' relationships from ICD-9 hierarchy. FastText is evaluated with different label count estimations, of which an approach based on label cardinality yields a F1-Score of 62.2%. FastText achieves high recall results and mentionable performance improvements over previous models. Reported values are obtained through 10-fold cross-validation.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

UMLS mapping and Word embeddings for ICD code assignment using the MIMIC-III intensive care database.

Abstract

Talk to us

Similar Papers

More From: Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference

Lead the way for us

Journal: Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference	Publication Date: Jul 1, 2019
Citations: 25

Similar Papers

Evaluating Biomedical Word Embeddings for Vocabulary Alignment at Scale in the UMLS Metathesaurus Using Siamese Networks.
Goonmeet Bajaj ... Hong Yung Yip
Proceedings of the conference. Association for Computational Linguistics. Meeting | VOL. 2022
Goonmeet Bajaj, et. al.Goonmeet Bajaj ... Hong Yung Yip
01 Jan 2021
Proceedings of the conference. Association for Computational Linguistics. Meeting | VOL. 2022

The impact of learning Unified Medical Language System knowledge embeddings in relation extraction from biomedical texts
Sanda M Harabagiu ... Ramon Maldonado
Journal of the American Medical Informatics Association | VOL. 27
Sanda M Harabagiu, et. al.Sanda M Harabagiu ... Ramon Maldonado
01 Oct 2020
Journal of the American Medical Informatics Association | VOL. 27

ONCO-i2b2: improve patients selection through CBR techniques with heterogeneous distance functions
Valentina Tibollo ... Alberto Zambelli
EMBnet.journal | VOL. 18
Valentina Tibollo, et. al.Valentina Tibollo ... Alberto Zambelli
09 Nov 2012
EMBnet.journal | VOL. 18

Context-Enriched Learning Models for Aligning Biomedical Vocabularies at Scale in the UMLS Metathesaurus.
Thilini Wijesiriwardene ... Hong Yung Yip
Proceedings of the ... International World-Wide Web Conference. International WWW Conference | VOL. 2022
Thilini Wijesiriwardene, et. al.Thilini Wijesiriwardene ... Hong Yung Yip
25 Apr 2022
Proceedings of the ... International World-Wide Web Conference. International WWW Conference | VOL. 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

UMLS mapping and Word embeddings for ICD code assignment using the MIMIC-III intensive care database.

Abstract

Talk to us

Similar Papers

More From: Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference