Document Retrieval System for Biomedical Question Answering

Harun Bolat,Baha Şen

doi:10.3390/app14062613

Abstract

In this paper, we describe our biomedical document retrieval system and answers extraction module, which is part of the biomedical question answering system. Approximately 26.5 million PubMed articles are indexed as a corpus with the Apache Lucene text search engine. Our proposed system consists of three parts. The first part is the question analysis module, which analyzes the question and enriches it with biomedical concepts related to its wording. The second part of the system is the document retrieval module. In this step, the proposed system is tested using different information retrieval models, like the Vector Space Model, Okapi BM25, and Query Likelihood. The third part is the document re-ranking module, which is responsible for re-arranging the documents retrieved in the previous step. For this study, we tested our proposed system with 6B training questions from the BioASQ challenge task. We obtained the best MAP score on the document retrieval phase when we used Query Likelihood with the Dirichlet Smoothing model. We used the sequential dependence model at the re-rank phase, but this model produced a worse MAP score than the previous phase. In similarity calculation, we included the Named Entity Recognition (NER), UMLS Concept Unique Identifiers (CUI), and UMLS Semantic Types of the words in the question to find the sentences containing the answer. Using this approach, we observed a performance enhancement of roughly 25% for the top 20 outcomes, surpassing another method employed in this study, which relies solely on textual similarity.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Mar 20, 2024
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Document Retrieval System for Biomedical Question Answering

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Comparative Analysis of Machine Learning Algorithms for Author Age and Gender Identification
Zarah Zainab ... Feras Al-Obeidat
-
Zarah Zainab, et. al.Zarah Zainab ... Feras Al-Obeidat
01 Jan 2023
01 Jan 2023

A Neural Pipeline Approach for the PharmaCoNER Shared Task using Contextual Exhaustive Models
Mohammad Golam Sohrab ... Minh Thang Pham
-
Mohammad Golam Sohrab, et. al.Mohammad Golam Sohrab ... Minh Thang Pham
01 Jan 2019
01 Jan 2019

Passage-Based Text Summarization for Legal Information Retrieval
Ambedkar Kanapala ... Srikanth Jannu
Arabian Journal for Science and Engineering | VOL. 44
Ambedkar Kanapala, et. al.Ambedkar Kanapala ... Srikanth Jannu
18 Jul 2019
Arabian Journal for Science and Engineering | VOL. 44

Information Retrieval: Concepts, Models, and Systems
Venkat N Gudivada ... Dhana L Rao
-
Venkat N Gudivada, et. al.Venkat N Gudivada ... Dhana L Rao
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Document Retrieval System for Biomedical Question Answering

Abstract

Talk to us

Similar Papers

More From: Applied Sciences