Biomedical Term Extraction: NLP Techniques in Computational Medicine

Antonio Moreno Sandoval,Teófilo Redondo,Julia Díaz,Leonardo Campillos Llanos

doi:10.9781/ijimai.2018.04.001

Antonio Moreno Sandoval, Teófilo Redondo + Show 2 more

Open Access

https://doi.org/10.9781/ijimai.2018.04.001

Copy DOI

Abstract

Artificial Intelligence (AI) and its branch Natural Language Processing (NLP) in particular are main contributors to recent advances in classifying documentation and extracting information from assorted fields, Medicine being one that has gathered a lot of attention due to the amount of information generated in public professional journals and other means of communication within the medical profession. The typical information extraction task from technical texts is performed via an automatic term recognition extractor. Automatic Term Recognition (ATR) from technical texts is applied for the identification of key concepts for information retrieval and, secondarily, for machine translation. Term recognition depends on the subject domain and the lexical patterns of a given language, in our case, Spanish, Arabic and Japanese. In this article, we present the methods and techniques for creating a biomedical corpus of validated terms, with several tools for optimal exploitation of the information therewith contained in said corpus. This paper also shows how these techniques and tools have been used in a prototype.

Full Text