Few-shot biomedical named entity recognition via knowledge-guided instance generation and prompt contrastive learning.

Peng Chen,Zhihao Yang,Di Zhao,Hongfei Lin,Jian Wang

doi:10.1093/bioinformatics/btad496

Peng Chen, Zhihao Yang + Show 3 more

Open Access

https://doi.org/10.1093/bioinformatics/btad496

Copy DOI

Abstract

Few-shot learning (FSL) that can effectively perform named entity recognition in low-resource scenarios has raised growing attention, but it has not been widely studied yet in the biomedical field. In contrast to high-resource domains, biomedical named entity recognition (BioNER) often encounters limited human-labeled data in real-world scenarios, leading to poor generalization performance when training only a few labeled instances. Recent approaches either leverage cross-domain high-resource data or fine-tune the pre-trained masked language model using limited labeled samples to generate new synthetic data, which is easily stuck in domain shift problems or yields low-quality synthetic data. Therefore, in this paper, we study a more realistic scenario, i.e., few-shot learning for BioNER. Leveraging the domain knowledge graph, we propose knowledge-guided instance generation for few-shot BioNER, which generates diverse and novel entities based on similar semantic relations of neighbor nodes. In addition, by introducing question prompt, we cast BioNER as question answering (QA) task and propose prompt contrastive learning to improve the robustness of the model by measuring the mutual information (MI) between query-answer pairs. Extensive experiments conducted on various few-shot settings show that the proposed framework achieves superior performance. Particularly, in a low-resource scenario with only 20 samples, our approach substantially outperforms recent state-of-the-art (SoTA) models on four benchmark datasets, achieving an average improvement of up to 7.1% F1. Our source code and data are available at https://github.com/cpmss521/KGPC. Supplementary data are available at Bioinformatics online.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Bioinformatics	Publication Date: Aug 1, 2023
Citations: 7	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Few-shot biomedical named entity recognition via knowledge-guided instance generation and prompt contrastive learning.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Similar Papers

D3NER: biomedical named entity recognition using CRF-biLSTM improved with fine-tuned embeddings of various linguistic information.
Thanh Hai Dang ... Hoang-Quynh Le
Bioinformatics | VOL. 34
Thanh Hai Dang, et. al.Thanh Hai Dang ... Hoang-Quynh Le
30 Apr 2018
Bioinformatics | VOL. 34

BioALBERT: A Simple and Effective Pre-trained Language Model for Biomedical Named Entity Recognition
Usman Naseem ... Sakthivel Rajendran
-
Usman Naseem, et. al.Usman Naseem ... Sakthivel Rajendran
18 Jul 2021
18 Jul 2021

Improving deep learning method for biomedical named entity recognition by using entity definition information
Ying Xiong ... Yi Zhou
BMC Bioinformatics | VOL. 22
Ying Xiong, et. al.Ying Xiong ... Yi Zhou
01 Dec 2021
BMC Bioinformatics | VOL. 22

Biomedical named entity recognition using BERT in the machine reading comprehension framework
Cong Sun ... Jian Wang
Journal of Biomedical Informatics | VOL. 118
Cong Sun, et. al.Cong Sun ... Jian Wang
06 May 2021
Journal of Biomedical Informatics | VOL. 118

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Few-shot biomedical named entity recognition via knowledge-guided instance generation and prompt contrastive learning.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics