Biomedical Question Answering Research Articles

BackgroundMining the vast pool of biomedical literature to extract accurate responses and relevant references is challenging due to the domain's interdisciplinary nature, specialized jargon, and continuous evolution. Early natural language processing (NLP) approaches often led to incorrect answers as they failed to comprehend the nuances of natural language. However, transformer models have significantly advanced the field by enabling the creation of large language models (LLMs), enhancing question-answering (QA) tasks. Despite these advances, current LLM-based solutions for specialized domains like biology and biomedicine still struggle to generate up-to-date responses while avoiding “hallucination” or generating plausible but factually incorrect responses.ResultsOur work focuses on enhancing prompts using a retrieval-augmented architecture to guide LLMs in generating meaningful responses for biomedical QA tasks. We evaluated two approaches: one relying on text embedding and vector similarity in a high-dimensional space, and our proposed method, which uses explicit signals in user queries to extract meaningful contexts. For robust evaluation, we tested these methods on 50 specific and challenging questions from diverse biomedical topics, comparing their performance against a baseline model, BM25. Retrieval performance of our method was significantly better than others, achieving a median Precision@10 of 0.95, which indicates the fraction of the top 10 retrieved chunks that are relevant. We used GPT-4, OpenAI's most advanced LLM to maximize the answer quality and manually accessed LLM-generated responses. Our method achieved a median answer quality score of 2.5, surpassing both the baseline model and the text embedding-based approach. We developed a QA bot, WeiseEule (https://github.com/wasimaftab/WeiseEule-LocalHost), which utilizes these methods for comparative analysis and also offers advanced features for review writing and identifying relevant articles for citation.ConclusionsOur findings highlight the importance of prompt enhancement methods that utilize explicit signals in user queries over traditional text embedding-based approaches to improve LLM-generated responses for specialized queries in specialized domains such as biology and biomedicine. By providing users complete control over the information fed into the LLM, our approach addresses some of the major drawbacks of existing web-based chatbots and LLM-based QA systems, including hallucinations and the generation of irrelevant or outdated responses.

Read full abstract

Extractive methods for machine reading comprehension (MRC) tasks have achieved comparable or better accuracy than human performance on benchmark data sets. However, such models are not as successful when adapted to complex domains such as health care. One of the main reasons is that the context that the MRC model needs to process when operating in a complex domain can be much larger compared with an average open-domain context. This causes the MRC model to make less accurate and slower predictions. A potential solution to this problem is to reduce the input context of the MRC model by extracting only the necessary parts from the original context. This study aims to develop a method for extracting useful contexts from long articles as an additional component to the question answering task, enabling the MRC model to work more efficiently and accurately. Existing approaches to context extraction in MRC are based on sentence selection strategies, in which the models are trained to find the sentences containing the answer. We found that using only the sentences containing the answer was insufficient for the MRC model to predict correctly. We conducted a series of empirical studies and observed a strong relationship between the usefulness of the context and the confidence score output of the MRC model. Our investigation showed that a precise input context can boost the prediction correctness of the MRC and greatly reduce inference time. We proposed a method to estimate the utility of each sentence in a context in answering the question and then extract a new, shorter context according to these estimations. We generated a data set to train 2 models for estimating sentence utility, based on which we selected more precise contexts that improved the MRC model's performance. We demonstrated our approach on the Question Answering Data Set for COVID-19 and Biomedical Semantic Indexing and Question Answering data sets and showed that the approach benefits the downstream MRC model. First, the method substantially reduced the inference time of the entire question answering system by 6 to 7 times. Second, our approach helped the MRC model predict the answer more correctly compared with using the original context (F1-score increased from 0.724 to 0.744 for the Question Answering Data Set for COVID-19 and from 0.651 to 0.704 for the Biomedical Semantic Indexing and Question Answering). We also found a potential problem where extractive transformer MRC models predict poorly despite being given a more precise context in some cases. The proposed context extraction method allows the MRC model to achieve improved prediction correctness and a significantly reduced MRC inference time. This approach works technically with any MRC model and has potential in tasks involving processing long texts.

Read full abstract

Biomedical Question Answering Research Articles

Related Topics

Articles published on Biomedical Question Answering

Optimizing biomedical information retrieval with a keyword frequency-driven prompt enhancement strategy

Enhancing Biomedical Question Answering with Large Language Models

Efficient Machine Reading Comprehension for Health Care Applications: Algorithm Development and Validation of a Context Extraction Approach.

Document Retrieval System for Biomedical Question Answering

Taiyi: a bilingual fine-tuned large language model for diverse biomedical tasks.

Evaluating the ChatGPT family of models for biomedical reasoning and classification.

Benchmarking Large Language Models in Evidence-Based Medicine.

Developing ChatGPT for biology and medicine: a complete review of biomedical question answering.

A self-supervised language model selection strategy for biomedical question answering

Enhancing Biomedical ReQA With Adversarial Hard In-Batch Negative Samples.

A novel self-attention enriching mechanism for biomedical question answering

Selective UMLS knowledge infusion for biomedical question answering

BioASQ-QA: A manually curated corpus for Biomedical Question Answering

Improving Biomedical Question Answering by Data Augmentation and Model Weighting.

Survey on the Biomedical Text Summarization Techniques with an Emphasis on Databases, Techniques, Semantic Approaches, Classification Techniques, and Similarity Measures

Adversarial Knowledge Distillation Based Biomedical Factoid Question Answering.

Sequence tagging for biomedical extractive question answering.

Ensemble-based Methods for Multi-label Classification on Biomedical Question-Answer Data

An Efficient Method for Biomedical Entity Linking Based on Inter- and Intra-Entity Attention

SentiMedQAer: A Transfer Learning-Based Sentiment-Aware Model for Biomedical Question Answering.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Biomedical Question Answering Research Articles

Related Topics

Articles published on Biomedical Question Answering

Optimizing biomedical information retrieval with a keyword frequency-driven prompt enhancement strategy

Enhancing Biomedical Question Answering with Large Language Models

Efficient Machine Reading Comprehension for Health Care Applications: Algorithm Development and Validation of a Context Extraction Approach.

Document Retrieval System for Biomedical Question Answering

Taiyi: a bilingual fine-tuned large language model for diverse biomedical tasks.

Evaluating the ChatGPT family of models for biomedical reasoning and classification.

Benchmarking Large Language Models in Evidence-Based Medicine.

Developing ChatGPT for biology and medicine: a complete review of biomedical question answering.

A self-supervised language model selection strategy for biomedical question answering

Enhancing Biomedical ReQA With Adversarial Hard In-Batch Negative Samples.

A novel self-attention enriching mechanism for biomedical question answering

Selective UMLS knowledge infusion for biomedical question answering

BioASQ-QA: A manually curated corpus for Biomedical Question Answering

Improving Biomedical Question Answering by Data Augmentation and Model Weighting.

Survey on the Biomedical Text Summarization Techniques with an Emphasis on Databases, Techniques, Semantic Approaches, Classification Techniques, and Similarity Measures

Adversarial Knowledge Distillation Based Biomedical Factoid Question Answering.

Sequence tagging for biomedical extractive question answering.

Ensemble-based Methods for Multi-label Classification on Biomedical Question-Answer Data

An Efficient Method for Biomedical Entity Linking Based on Inter- and Intra-Entity Attention

SentiMedQAer: A Transfer Learning-Based Sentiment-Aware Model for Biomedical Question Answering.