Biomedical Information Retrieval with Positive-Unlabeled Learning and Knowledge Graphs

Yuqi Wang,Qiuyi Chen,Haiyang Zhang,Wei Wang,Qiufeng Wang,Yushan Pan,Liangru Xie,Kaizhu Huang,Anh Nguyen

doi:10.1145/3702647

Abstract

The rapid growth of biomedical publications has presented significant challenges in the field of information retrieval. Most existing work focuses on document retrieval given explicit queries. However, in real applications such as curated biomedical database maintenance, explicit queries are missing. In this paper, we propose a two-step model for biomedical information retrieval in the case that only a small set of example documents is available without explicit queries. Initially, we extract keywords from the observed documents using large pre-trained language models and biomedical knowledge graphs. These keywords are then enriched with domain-specific entities. Information retrieval techniques can subsequently use the collected entities to rank the documents. Following this, we introduce an iterative Positive-Unlabeled learning method to classify all unlabeled documents. Experiments conducted on the PubMed dataset demonstrate that the proposed technique outperforms the state-of-the-art positive-unlabeled learning methods. The results underscore the effectiveness of integrating large language models and biomedical knowledge graphs in improving zero-shot information retrieval performance in the biomedical domain.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Biomedical Information Retrieval with Positive-Unlabeled Learning and Knowledge Graphs

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Intelligent Systems and Technology

Lead the way for us

Similar Papers

Knowledge graphs in psychiatric research: Potential applications and future perspectives.
Sebastian Freidel ... Emanuel Schwarz
Acta psychiatrica Scandinavica | VOL. -
Sebastian Freidel, et. al.Sebastian Freidel ... Emanuel Schwarz
17 Jun 2024
Acta psychiatrica Scandinavica | VOL. -

Towards an Enhanced Understanding of Bias in Pre-trained Neural Language Models: A Survey with Special Emphasis on Affective Bias
Anoop K ... Lajish V L
-
Anoop K, et. al. Anoop K ... Lajish V L
01 Jan 2021
01 Jan 2021

FuseLinker: Leveraging LLM’s pre-trained text embeddings and domain knowledge to enhance GNN-based link prediction on biomedical knowledge graphs
Yongkang Xiao ... Rui Zhang
Journal of Biomedical Informatics | VOL. 158
Yongkang Xiao, et. al.Yongkang Xiao ... Rui Zhang
24 Sep 2024
Journal of Biomedical Informatics | VOL. 158

Jigsaw
Naman Jain ... Arun Iyer
-
Naman Jain, et. al.Naman Jain ... Arun Iyer
21 May 2022
21 May 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Biomedical Information Retrieval with Positive-Unlabeled Learning and Knowledge Graphs

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Intelligent Systems and Technology