Abstract

BackgroundClinical terms mentioned in clinical text are often not in their standardized forms as listed in clinical terminologies because of linguistic and stylistic variations. However, many automated downstream applications require clinical terms mapped to their corresponding concepts in clinical terminologies, thus necessitating the task of clinical term normalization.ObjectiveIn this paper, a system for clinical term normalization is presented that utilizes edit patterns to convert clinical terms into their normalized forms.MethodsThe edit patterns are automatically learned from the Unified Medical Language System (UMLS) Metathesaurus as well as from the given training data. The edit patterns are generalized sequences of edits that are derived from edit distance computations. The edit patterns are both character based as well as word based and are learned separately for different semantic types. In addition to these edit patterns, the system also normalizes clinical terms through the subconcepts mentioned within them.ResultsThe system was evaluated as part of the 2019 n2c2 Track 3 shared task of clinical term normalization. It obtained 80.79% accuracy on the standard test data. This paper includes ablation studies to evaluate the contributions of different components of the system. A challenging part of the task was disambiguation when a clinical term could be normalized to multiple concepts.ConclusionsThe learned edit patterns led the system to perform well on the normalization task. Given that the system is based on patterns, it is human interpretable and is also capable of giving insights about common variations of clinical terms mentioned in clinical text that are different from their standardized forms.

Highlights

  • Clinical terms mentioned in clinical notes are not always in their standard forms as listed in standardized terminologies or ontologies

  • We presented a system for the clinical term normalization task

  • It uses edit patterns of both characters and words that are automatically learned from Unified Medical Language System (UMLS) and the training data

Read more

Summary

Introduction

Clinical terms mentioned in clinical notes are not always in their standard forms as listed in standardized terminologies or ontologies. A clinical note may mention “diffuse inflammatory reaction”, but a standard terminology resource such as Unified Medical Language System (UMLS) [2] may list the same clinical concept as “diffuse inflammation” or “inflammation diffuse”. As another example, a clinical note may mention “allergy to ferrous sulphate”, but the terminology may mention “allergy to ferrous sulfate”. A clinical note may mention “allergy to ferrous sulphate”, but the terminology may mention “allergy to ferrous sulfate” A resource such as UMLS includes many synonyms for clinical terms, it does not exhaustively cover them. Many automated downstream applications require clinical terms mapped to their corresponding concepts in clinical terminologies, necessitating the task of clinical term normalization

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call