Enhancing Cross-lingual Biomedical Concept Normalization Using Deep Neural Network Pretrained Language Models

Ying-Chi Lin,Phillip Hoffmann,Erhard Rahm

doi:10.1007/s42979-022-01295-7

Abstract

In this study, we propose a new approach for cross-lingual biomedical concept normalization, the process of mapping text in non-English documents to English concepts of a knowledge base. The resulting mappings, named as semantic annotations, enhance data integration and interoperability of documents in different languages. The US FDA (Food and Drug Administration), therefore, requires all submitted medical forms to be semantically annotated. These standardized medical forms are used in health care practice and biomedical research and are translated/adapted into various languages. Mapping them to the same concepts (normally in English) facilitates the comparison of multiple medical studies even cross-lingually. However, the translation and adaptation of these forms can cause them to deviate from its original text syntactically and in wording. This leads the conventional string matching methods to produce low-quality annotation results. Therefore, our new approach incorporates semantics into the cross-lingual concept normalization process. This is done using sentence embeddings generated by BERT-based pretrained language models. We evaluate the new approach by annotating entire questions of German medical forms with concepts in English, as required by the FDA. The new approach achieves an improvement of 136% in recall, 52% in precision and 66% in F-measure compared to the conventional string matching methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: SN Computer Science	Publication Date: Jul 21, 2022
Citations: 5	License type: open-access

R Discovery Prime

R Discovery Prime

Enhancing Cross-lingual Biomedical Concept Normalization Using Deep Neural Network Pretrained Language Models

Abstract

Talk to us

Similar Papers

More From: SN Computer Science

Lead the way for us

Similar Papers

AACR Cancer Progress Report 2015.
José Baselga ... Susan M Gapstur
Clinical Cancer Research | VOL. 21
José Baselga, et. al.José Baselga ... Susan M Gapstur
30 Sep 2015
AACR Cancer Progress Report 2015.
José Baselga ... Susan M Gapstur

Successful Investigational New Drug Preparation without Reinventing the Wheel
Emily S Gorell ... Alfred T Lane
Journal of Investigative Dermatology | VOL. 131
Emily S Gorell, et. al.Emily S Gorell ... Alfred T Lane
01 May 2011
Journal of Investigative Dermatology | VOL. 131

“Black box” 101: How the Food and Drug Administration evaluates, communicates, and manages drug benefit/risk
Shirley Murphy ... Rosemary Roberts
Journal of Allergy and Clinical Immunology | VOL. 117
Shirley Murphy, et. al.Shirley Murphy ... Rosemary Roberts
29 Dec 2005
Journal of Allergy and Clinical Immunology | VOL. 117

The future of drug safety: What the IOM report may mean to the emergency department
Eric Berger
Annals of Emergency Medicine | VOL. 49
Eric BergerEric Berger
01 Feb 2007
Annals of Emergency Medicine | VOL. 49

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhancing Cross-lingual Biomedical Concept Normalization Using Deep Neural Network Pretrained Language Models

Abstract

Talk to us

Similar Papers

More From: SN Computer Science