Assessing the utility of large language models for phenotype-driven gene prioritization in the diagnosis of rare genetic disease

Junyoung Kim,Kai Wang,Chunhua Weng,Cong Liu

doi:10.1016/j.ajhg.2024.08.010

Abstract

Phenotype-driven gene prioritization is fundamental to diagnosing rare genetic disorders. While traditional approaches rely on curated knowledge graphs with phenotype-gene relations, recent advancements in large language models (LLMs) promise a streamlined text-to-gene solution. In this study, we evaluated five LLMs, including two generative pre-trained transformers (GPT) series and three Llama2 series, assessing their performance across task completeness, gene prediction accuracy, and adherence to required output structures. We conducted experiments, exploring various combinations of models, prompts, phenotypic input types, and task difficulty levels. Our findings revealed that the best-performed LLM, GPT-4, achieved an average accuracy of 17.0% in identifying diagnosed genes within the top 50 predictions, which still falls behind traditional tools. However, accuracy increased with the model size. Consistent results were observed over time, as shown in the dataset curated after 2023. Advanced techniques such as retrieval-augmented generation (RAG) and few-shot learning did not improve the accuracy. Sophisticated prompts were more likely to enhance task completeness, especially in smaller models. Conversely, complicated prompts tended to decrease output structure compliance rate. LLMs also achieved better-than-random prediction accuracy with free-text input, though performance was slightly lower than with standardized concept input. Bias analysis showed that highly cited genes, such as BRCA1, TP53, and PTEN, are more likely to be predicted. Our study provides valuable insights into integrating LLMs with genomic analysis, contributing to the ongoing discussion on their utilization in clinical workflows.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Assessing the utility of large language models for phenotype-driven gene prioritization in the diagnosis of rare genetic disease

Abstract

Talk to us

Similar Papers

More From: The American Journal of Human Genetics

Lead the way for us

Journal: The American Journal of Human Genetics	Publication Date: Sep 1, 2024
License type: cc-by

Similar Papers

A systematic review on machine learning approaches in the diagnosis and prognosis of rare genetic diseases
P Roman-Naranjo ... J.A Lopez-Escamez
Journal of Biomedical Informatics | VOL. 143
P Roman-Naranjo, et. al.P Roman-Naranjo ... J.A Lopez-Escamez
22 Jun 2023
Journal of Biomedical Informatics | VOL. 143

CancerGPT for few shot drug pair synergy prediction using large pretrained language models
Tianhao Li ... Yejin Kim
npj Digital Medicine | VOL. 7
Tianhao Li, et. al.Tianhao Li ... Yejin Kim
19 Feb 2024
npj Digital Medicine | VOL. 7

Improving the use of LLMs in radiology through prompt engineering: from precision prompts to zero-shot learning.
Fabian Bamberg ... Maximilian Frederik Russe
RöFo - Fortschritte auf dem Gebiet der Röntgenstrahlen und der bildgebenden Verfahren | VOL. -
Fabian Bamberg, et. al.Fabian Bamberg ... Maximilian Frederik Russe
26 Feb 2024
RöFo - Fortschritte auf dem Gebiet der Röntgenstrahlen und der bildgebenden Verfahren | VOL. -

Model tuning or prompt Tuning? a study of large language models for clinical concept and relation extraction
Cheng Peng ... Yonghui Wu
Journal of Biomedical Informatics | VOL. 153
Cheng Peng, et. al.Cheng Peng ... Yonghui Wu
26 Mar 2024
Journal of Biomedical Informatics | VOL. 153

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Assessing the utility of large language models for phenotype-driven gene prioritization in the diagnosis of rare genetic disease

Abstract

Talk to us

Similar Papers

More From: The American Journal of Human Genetics