A classification approach for detecting cross-lingual biomedical term translations

H Hakami,D Bollegala

doi:10.1017/s1351324915000431

Abstract

AbstractFinding translations for technical terms is an important problem in machine translation. In particular, in highly specialized domains such as biology or medicine, it is difficult to find bilingual experts to annotate sufficient cross-lingual texts in order to train machine translation systems. Moreover, new terms are constantly being generated in the biomedical community, which makes it difficult to keep the translation dictionaries up to date for all language pairs of interest. Given a biomedical term in one language (source language), we propose a method for detecting its translations in a different language (target language). Specifically, we train a binary classifier to determine whether two biomedical terms written in two languages are translations. Training such a classifier is often complicated due to the lack of common features between the source and target languages. We propose several feature space concatenation methods to successfully overcome this problem. Moreover, we study the effectiveness of contextual and character n-gram features for detecting term translations. Experiments conducted using a standard dataset for biomedical term translation show that the proposed method outperforms several competitive baseline methods in terms of mean average precision and top-k translation accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A classification approach for detecting cross-lingual biomedical term translations

Abstract

Talk to us

Similar Papers

More From: Natural Language Engineering

Lead the way for us

Journal: Natural Language Engineering	Publication Date: Dec 14, 2015
Citations: 6

Similar Papers

A cross-lingual similarity measure for detecting biomedical term translations.
Danushka Bollegala ... Neil R Smalheiser
PLOS ONE | VOL. 10
Danushka Bollegala, et. al.Danushka Bollegala ... Neil R Smalheiser
01 Jun 2015
PLOS ONE | VOL. 10

Cross-Lingual Named Entity Recognition for Heterogenous Languages
Yingwen Fu ... Boyu Chen
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 31
Yingwen Fu, et. al.Yingwen Fu ... Boyu Chen
01 Jan 2023
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 31

Improving Machine Translation Quality with Denoising Autoencoder and Pre-Ordering
Tran Hong-Viet ... Nguyen Hoang-Quan
Journal of Computing and Information Technology | VOL. -
Tran Hong-Viet, et. al.Tran Hong-Viet ... Nguyen Hoang-Quan
21 Mar 2022
Journal of Computing and Information Technology | VOL. -

DrugShot: querying biomedical search terms to retrieve prioritized lists of small molecules
Eryk Kropiwnicki ... Daniel J B Clarke
BMC Bioinformatics | VOL. 23
Eryk Kropiwnicki, et. al.Eryk Kropiwnicki ... Daniel J B Clarke
19 Feb 2022
BMC Bioinformatics | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A classification approach for detecting cross-lingual biomedical term translations

Abstract

Talk to us

Similar Papers

More From: Natural Language Engineering