Uncertainty-based Self-training for Biomedical Keyphrase Extraction.

Zelalem Gero,Joyce C Ho

doi:10.1109/bhi50953.2021.9508592

Abstract

To keep pace with the increased generation and digitization of documents, automated methods that can improve search, discovery and mining of the vast body of literature are essential. Keyphrases provide a concise representation by identifying salient concepts in a document. Various supervised approaches model keyphrase extraction using local context to predict the label for each token and perform much better than the unsupervised counterparts. However, existing supervised datasets have limited annotated examples to train better deep learning models. In contrast, many domains have large amount of un-annotated data that can be leveraged to improve model performance in keyphrase extraction. We introduce a self-learning based model that incorporates uncertainty estimates to select instances from large-scale unlabeled data to augment the small labeled training set. Performance evaluation on a publicly available biomedical dataset demonstrates that our method improves performance of keyphrase extraction over state of the art models.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Uncertainty-based Self-training for Biomedical Keyphrase Extraction.

Abstract

Talk to us

Similar Papers

More From: ... IEEE-EMBS International Conference on Biomedical and Health Informatics. IEEE-EMBS International Conference on Biomedical and Health Informatics

Lead the way for us

Journal: ... IEEE-EMBS International Conference on Biomedical and Health Informatics. IEEE-EMBS International Conference on Biomedical and Health Informatics	Publication Date: Jul 27, 2021
Citations: 2

Similar Papers

Deep Neural Models for Key-Phrase Indexing
Saurabh Sharma ... Mamta Juneja
-
Saurabh Sharma, et. al.Saurabh Sharma ... Mamta Juneja
01 Jan 2021
01 Jan 2021

Performance Analysis of Graph based Keyphrase Extraction metrics for uncertain User-generated data
Muskan Garg ... Mukesh Kumar
Procedia computer science | VOL. 143
Muskan Garg, et. al.Muskan Garg ... Mukesh Kumar
01 Jan 2018
Procedia computer science | VOL. 143

Importance Estimation from Multiple Perspectives for Keyphrase Extraction
Mingyang Song ... Lin Xiao
-
Mingyang Song, et. al.Mingyang Song ... Lin Xiao
01 Jan 2020
01 Jan 2020

NamedKeys
Zelalem Gero ... Joyce C Ho
-
Zelalem Gero, et. al.Zelalem Gero ... Joyce C Ho
04 Sep 2019
04 Sep 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Uncertainty-based Self-training for Biomedical Keyphrase Extraction.

Abstract

Talk to us

Similar Papers

More From: ... IEEE-EMBS International Conference on Biomedical and Health Informatics. IEEE-EMBS International Conference on Biomedical and Health Informatics