Retrieval Capabilities Research Articles

Introduction Large language models (LLMs) have gained popularity due to their natural language generation and interpretation capabilities. Integrating these models in medicine enables multiple tasks like summarizing medical histories, synthesizing literature, and suggesting diagnoses. Models like ChatGPT, GPT-4, and Med-PaLM2 (Singhal et al., 2023) have demonstrated their proficiency by achieving high scores in medical tests like the United States Medical Licensing Examination (USMLE) (Kung et al., 2023). However, LLMs may sometimes be inaccurate, providing unverified and erroneous information. In this study, we investigate the potential uses of LLMs in hematology, assessing their knowledge through hematology questions from the USMLE. Additionally, we propose augmenting LLMs with retrieval capabilities for medical guidelines in order to eliminate incorrect information. By extracting relevant information from specified medical documents, this approach holds the potential to streamline decision-making processes. Methods For comparative purposes, all experiments were conducted using both GPT 3.5-turbo and GPT-4 models. In a first step, we evaluated the general knowledge and performance of LLM in the field of hematology by testing it in a collected dataset of 127 question-answer pairs from the hematology section (covering various aspects of the field) of the USMLE. In a second step, we evaluated the proposed information retrieval framework using a set of 120 multiple-choice questions. These questions were specifically focused on the 4th revision of the World Health Organization classification of myeloid neoplasms and acute leukemia guidelines (subsequently called WHO 2017). By testing the framework on this domain-specific dataset, we aimed to assess its ability to extract specific clinical context and relevant information from complex clinical guidelines. Each question from the WHO 2017 guideline dataset was subjected to a comprehensive evaluation using two techniques. First, the questions were assessed using a zero-shot approach (the question together with the different options are directly posed to the model) to assess the LLM's capability to respond based on its own knowledge. Second, we employed our proposed retrieval information approach, enabling the system to conduct in-depth searches throughout the external documents (WHO 2017 guideline) to identify relevant (and similar) extracts about each question. Subsequently, the system provided answers based on the retrieved contexts from the document, thus facilitating more accurate and contextually informed responses. To achieve this, we created an embedding space containing the document's content and conducted a cosine-similarity search between a given question and all the content extracts from the document. The top three relevant extracts, based on similarity to the given question, were used as context for the LLM. Results In the evaluation of 127 hematology questions from the USMLE, GPT-3.5 in zero-shot mode achieved 63% accuracy, while GPT-4 demonstrated a higher accuracy rate of 82%. The evaluation of the WHO 2017 questions dataset revealed that the zero-shot approach achieved accuracy rates of 51% for GPT-3.5 and 71% for GPT-4. Incorporating information retrieval, retrieving the three most relevant extracts from the guidelines, substantially improved performance, with GPT-3.5 achieving 86% accuracy and GPT-4 demonstrating 97% accuracy. Conclusions LLMs have great potential, with current models showcasing substantial knowledge in hematology. However, ensuring their consistency and safety in responses is critical for their reliable application in medical settings (Thirunavukarasu et al., 2023). To address this, we demonstrated the benefits of information retrieval for question-answering in the field of hematology, significantly improving response reliability and accuracy by empowering LLMs to deliver more informed and contextually appropriate answers. The concept was effectively validated using the WHO 2017 guideline, and it can be effortlessly adapted to answer questions based on any set of hematology-related documents. Leveraging LLMs has the potential to significantly enhance the efficiency and effectiveness of clinical, educational, and research work in hematology.

Read full abstract

Fengyun-3E (FY-3E)/Hyperspectral Infrared Atmospheric Sounder-II (HIRAS-II) is an extension Fengyun-3D (FY-3D)/HIRAS-I. It is crucial to fully explore and analyze the detection capabilities of these two instruments for atmospheric gas composition. Based on the observed spectral data from the infrared hyperspectral detection instruments FY-3D/HIRAS-I and FY-3E/HIRAS-II, simulated radiance data and Jacobian matrices are obtained using the Rapid Radiative Transfer Model RTTOV (Radiative Transfer for TOVS (TIROS Operational Vertical Sounder)). By perturbing temperature (T), surface temperature (Tsurf), water vapor (H2O), ozone (O3), carbon dioxide (CO2), methane (CH4), carbon monoxide (CO), and nitrous oxide (N2O), the brightness temperature differences before and after the perturbations are calculated to analyze the sensitivity of temperature and various atmospheric gas components. The Improved Optimal Sensitivity Profile (OSP) algorithm is used to select the channels for atmospheric gas retrieval. The observation error covariance and background error covariance matrices are calculated, and then the information capacity is calculated, specifically the degrees of freedom for signal(DFS) and the entropy reduction (ER). Based on this, a comparative analysis is conducted on the information capacity of atmospheric water vapor and ozone components contained in the hyperspectral detection data from HIRAS-I and HIRAS-II instruments, respectively, to explore the retrieval capabilities of the two instruments for atmospheric gas components. We selected clear-sky data from the African oceanic region and the Chinese Yangtze River Delta terrestrial region for quantitative analysis of the information capacity of HIRAS-I and HIRAS-II. The results show that FY-3D/HIRAS-I and FY-3E/HIRAS-II exhibit different sensitivities to atmospheric gas components. In different experimental regions, temperature and water vapor show the most dramatic sensitivity changes, followed by ozone, methane, and nitrous oxide, while carbon monoxide and carbon dioxide exhibit the lowest variability. Regarding channel selection, HIRAS-II identifies more gas channels compared to HIRAS-I. The experiments concluded that HIRAS-II has a significantly higher information capacity than HIRAS-I, and the information capacity of atmospheric gas components varies across different experimental regions. Water vapor and ozone exhibit the highest information capacity, followed by nitrous oxide and methane, while carbon monoxide and carbon dioxide demonstrate the lowest capacity. The H2O ER (DFS) contained in FY-3E/HIRAS-II is 1.51 (0.35) higher than that in FY-3D/HIRAS-I, the O3 ER (DFS) in FY-3E/HIRAS-II is 1.51 (0.36) higher than that in FY-3D/HIRAS-I, while the N2O ER (DFS) in FY-3E/HIRAS-II is 0.17 (0.19) higher and the CH4 ER (DFS) is 0.07 (0.04) higher than that in FY-3D/HIRAS-I.

Read full abstract

Retrieval Capabilities Research Articles

Related Topics

Articles published on Retrieval Capabilities

Factors influencing proxy online health information seeking among the elderly: A study from the perspective of the elderly with chronic illnesses.

Creating a computer assisted ICD coding system: Performance metric choice and use of the ICD hierarchy

A Convolutional Neural Network and Attention-Based Retrieval of Temperature Profile for a Satellite Hyperspectral Microwave Sensor

Digital engineering audit management strategy based on machine learning combined with wireless communication network

Resolution-enhanced multi-core fiber imaging learned on a digital twin for cancer diagnosis.

Synergizing intelligence and knowledge discovery: Hybrid black hole algorithm for optimizing discrete Hopfield neural network with negative based systematic satisfiability

Deep Learning Algorithms for Personalized Services and Enhanced User Experience in Libraries

Combined Retrieval of Oil Film Thickness Using Hyperspectral and Thermal Infrared Remote Sensing

Evaluation of Five Satellite-Based Precipitation Products for Extreme Rainfall Estimations over the Qinghai-Tibet Plateau

Assessment of Artificial Intelligence Language Models and Information Retrieval Strategies for QA in Hematology

Reconstruction of Femtosecond Laser Pulses from FROG Traces by Convolutional Neural Networks

Structural optical design of multi-group, high zoom ratio optical systems

Enhancing Cloud Communication Security: A Blockchain-Powered Framework with Attribute-Aware Encryption

Evaluating the performance of ChatGPT in clinical pharmacy: A comparative study of ChatGPT and clinical pharmacists.

Bathymetry Inversion Using Attention-Based Band Optimization Model for Hyperspectral or Multispectral Satellite Imagery

Assessing the Chlorophyll-a Retrieval Capabilities of Sentinel 3A OLCI Images for the Monitoring of Coastal Waters in Algoa and Francis Bays, South Africa

Comparative Study of the Atmospheric Gas Composition Detection Capabilities of FY-3D/HIRAS-I and FY-3E/HIRAS-II Based on Information Capacity

ChainDash: An Ad-Hoc Blockchain Data Analytics System

Database System for Medical Record Keeping and Retrieval

Retrieval of atmospheric temperature and humidity profiles over a tropical coastal station from ground-based Microwave Radiometer using deep learning technique

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Retrieval Capabilities Research Articles

Related Topics

Articles published on Retrieval Capabilities

Factors influencing proxy online health information seeking among the elderly: A study from the perspective of the elderly with chronic illnesses.

Creating a computer assisted ICD coding system: Performance metric choice and use of the ICD hierarchy

A Convolutional Neural Network and Attention-Based Retrieval of Temperature Profile for a Satellite Hyperspectral Microwave Sensor

Digital engineering audit management strategy based on machine learning combined with wireless communication network

Resolution-enhanced multi-core fiber imaging learned on a digital twin for cancer diagnosis.

Synergizing intelligence and knowledge discovery: Hybrid black hole algorithm for optimizing discrete Hopfield neural network with negative based systematic satisfiability

Deep Learning Algorithms for Personalized Services and Enhanced User Experience in Libraries

Combined Retrieval of Oil Film Thickness Using Hyperspectral and Thermal Infrared Remote Sensing

Evaluation of Five Satellite-Based Precipitation Products for Extreme Rainfall Estimations over the Qinghai-Tibet Plateau

Assessment of Artificial Intelligence Language Models and Information Retrieval Strategies for QA in Hematology

Reconstruction of Femtosecond Laser Pulses from FROG Traces by Convolutional Neural Networks

Structural optical design of multi-group, high zoom ratio optical systems

Enhancing Cloud Communication Security: A Blockchain-Powered Framework with Attribute-Aware Encryption

Evaluating the performance of ChatGPT in clinical pharmacy: A comparative study of ChatGPT and clinical pharmacists.

Bathymetry Inversion Using Attention-Based Band Optimization Model for Hyperspectral or Multispectral Satellite Imagery

Assessing the Chlorophyll-a Retrieval Capabilities of Sentinel 3A OLCI Images for the Monitoring of Coastal Waters in Algoa and Francis Bays, South Africa

Comparative Study of the Atmospheric Gas Composition Detection Capabilities of FY-3D/HIRAS-I and FY-3E/HIRAS-II Based on Information Capacity

ChainDash: An Ad-Hoc Blockchain Data Analytics System

Database System for Medical Record Keeping and Retrieval

Retrieval of atmospheric temperature and humidity profiles over a tropical coastal station from ground-based Microwave Radiometer using deep learning technique