Keyword Extraction from Biomedical Documents Using Deep Contextualized Embeddings

Azer Celikten,Aybars Ugur,Hasan Bulut

doi:10.1109/inista52262.2021.9548470

Abstract

Due to the rapidly increasing amount of biomedical publications, it has become challenging to follow scientific articles and new developments. Keywords in scientific articles provide a quick understanding and summarize the important points of the context. When keywords are not used in some biomedical articles or are not sufficient to express the content of the text, automatic keyword extraction systems are needed. This paper addresses the keyword extraction problem as a sequence labeling task where words are represented as deep contextual embeddings. We predict the keyword tags identified in sequence labeling by fine-tuning XLNET and BERT-based models such as BERT, BioBERT, SCIBERT, and RoBERTa. Our proposed method does not need extra dictionaries required by rule-based methods and feature extraction as in traditional machine learning methods. Performance evaluation on the benchmark dataset for biomedical keyword extraction shows that domain-specific contextualized embeddings (BioBERT, SciBERT) achieve state-of-the-art results compared to the general domain embeddings (BERT, RoBERTa, XLNET) and unsupervised methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Keyword Extraction from Biomedical Documents Using Deep Contextualized Embeddings

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Non-Contextual vs Contextual Word Embeddings in Multiword Expressions Detection
Maciej Piasecki ... Kamil Kanclerz
-
Maciej Piasecki, et. al.Maciej Piasecki ... Kamil Kanclerz
01 Jan 2021
01 Jan 2021

An Ensemble Keyword Extraction Model for News Texts with Statistical and Graphical Features
Aiman Abibullayeva ... Hüma Kılıç
International Journal of Software Engineering and Knowledge Engineering | VOL. -
Aiman Abibullayeva, et. al.Aiman Abibullayeva ... Hüma Kılıç
22 May 2024
International Journal of Software Engineering and Knowledge Engineering | VOL. -

An approach for medical event detection in Chinese clinical notes of electronic health records
Xuesi Zhou ... Haoqi Xiong
BMC Medical Informatics and Decision Making | VOL. 19
Xuesi Zhou, et. al.Xuesi Zhou ... Haoqi Xiong
01 Apr 2019
BMC Medical Informatics and Decision Making | VOL. 19

Keyword Extraction from Scientific Articles in Bahasa Indonesia using TextRank Algorithm
Dani Gunawan ... Ranti Ramadhiana
-
Dani Gunawan, et. al.Dani Gunawan ... Ranti Ramadhiana
03 Sep 2020
03 Sep 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Keyword Extraction from Biomedical Documents Using Deep Contextualized Embeddings

Abstract

Talk to us

Similar Papers