An Accurate and Efficient Approach to Knowledge Extraction from Scientific Publications Using Structured Ontology Models, Graph Neural Networks, and Large Language Models.

Timofey V Ivanisenko,Pavel S Demenkov,Vladimir A Ivanisenko

doi:10.3390/ijms252111811

Abstract

The rapid growth of biomedical literature makes it challenging for researchers to stay current. Integrating knowledge from various sources is crucial for studying complex biological systems. Traditional text-mining methods often have limited accuracy because they don't capture semantic and contextual nuances. Deep-learning models can be computationally expensive and typically have low interpretability, though efforts in explainable AI aim to mitigate this. Furthermore, transformer-based models have a tendency to produce false or made-up information-a problem known as hallucination-which is especially prevalent in large language models (LLMs). This study proposes a hybrid approach combining text-mining techniques with graph neural networks (GNNs) and fine-tuned large language models (LLMs) to extend biomedical knowledge graphs and interpret predicted edges based on published literature. An LLM is used to validate predictions and provide explanations. Evaluated on a corpus of experimentally confirmed protein interactions, the approach achieved a Matthews correlation coefficient (MCC) of 0.772. Applied to insomnia, the approach identified 25 interactions between 32 human proteins absent in known knowledge bases, including regulatory interactions between MAOA and 5-HT2C, binding between ADAM22 and 14-3-3 proteins, which is implicated in neurological diseases, and a circadian regulatory loop involving RORB and NR1D1. The hybrid GNN-LLM method analyzes biomedical literature efficiency to uncover potential molecular interactions for complex disorders. It can accelerate therapeutic target discovery by focusing expert verification on the most relevant automatically extracted information.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Accurate and Efficient Approach to Knowledge Extraction from Scientific Publications Using Structured Ontology Models, Graph Neural Networks, and Large Language Models.

Abstract

Talk to us

Similar Papers

More From: International journal of molecular sciences

Lead the way for us

Journal: International journal of molecular sciences	Publication Date: Nov 3, 2024
License type: CC BY 4.0

Similar Papers

How Can IJDS Authors, Reviewers, and Editors Use (and Misuse) Generative AI?
Galit Shmueli ... Bianca Maria Colosimo
INFORMS Journal on Data Science | VOL. 2
Galit Shmueli, et. al.Galit Shmueli ... Bianca Maria Colosimo
01 Apr 2023
INFORMS Journal on Data Science | VOL. 2

DTT: An Example-Driven Tabular Transformer for Joinability by Leveraging Large Language Models
Arash Dargahi Nobari ... Davood Rafiei
Proceedings of the ACM on Management of Data | VOL. 2
Arash Dargahi Nobari, et. al.Arash Dargahi Nobari ... Davood Rafiei
12 Mar 2024
Proceedings of the ACM on Management of Data | VOL. 2

Exploring the Potential of Large Language Models (LLMs)in Learning on Graphs
Zhikai Chen ... Hongzhi Wen
ACM SIGKDD Explorations Newsletter | VOL. 25
Zhikai Chen, et. al.Zhikai Chen ... Hongzhi Wen
26 Mar 2024
ACM SIGKDD Explorations Newsletter | VOL. 25

Automatic structuring of radiology reports with on-premise open-source large language models.
Piotr Woźnicki ... Fabian Christopher Laqua
European radiology | VOL. -
Piotr Woźnicki, et. al.Piotr Woźnicki ... Fabian Christopher Laqua
10 Oct 2024
European radiology | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Accurate and Efficient Approach to Knowledge Extraction from Scientific Publications Using Structured Ontology Models, Graph Neural Networks, and Large Language Models.

Abstract

Talk to us

Similar Papers

More From: International journal of molecular sciences