Discovering Photoswitchable Molecules for Drug Delivery with Large Language Models and Chemist Instruction Training.

Junjie Hu,Shiyi Wang,Qi Li,Kun Qian,Yang Liu,Peng Wu,Guang Yang,Yulin Li

doi:10.3390/ph17101300

Abstract

Background: As large language models continue to expand in size and diversity, their substantial potential and the relevance of their applications are increasingly being acknowledged. The rapid advancement of these models also holds profound implications for the long-term design of stimulus-responsive materials used in drug delivery. Methods: The large model used Hugging Face's Transformers package with BigBird, Gemma, and GPT NeoX architectures. Pre-training used the PubChem dataset, and fine-tuning used QM7b. Chemist instruction training was based on Direct Preference Optimization. Drug Likeness, Synthetic Accessibility, and PageRank Scores were used to filter molecules. All computational chemistry simulations were performed using ORCA and Time-Dependent Density-Functional Theory. Results: To optimize large models for extensive dataset processing and comprehensive learning akin to a chemist's intuition, the integration of deeper chemical insights is imperative. Our study initially compared the performance of BigBird, Gemma, GPT NeoX, and others, specifically focusing on the design of photoresponsive drug delivery molecules. We gathered excitation energy data through computational chemistry tools and further investigated light-driven isomerization reactions as a critical mechanism in drug delivery. Additionally, we explored the effectiveness of incorporating human feedback into reinforcement learning to imbue large models with chemical intuition, enhancing their understanding of relationships involving -N=N- groups in the photoisomerization transitions of photoresponsive molecules. Conclusions: We implemented an efficient design process based on structural knowledge and data, driven by large language model technology, to obtain a candidate dataset of specific photoswitchable molecules. However, the lack of specialized domain datasets remains a challenge for maximizing model performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Discovering Photoswitchable Molecules for Drug Delivery with Large Language Models and Chemist Instruction Training.

Abstract

Talk to us

Similar Papers

More From: Pharmaceuticals (Basel, Switzerland)

Lead the way for us

Journal: Pharmaceuticals (Basel, Switzerland)	Publication Date: Sep 30, 2024
License type: CC BY 4.0

Similar Papers

UNDERSTANDING LARGE LANGUAGE MODELS: THE FUTURE OF ARTIFICIAL INTELLIGENCE
Iryna Yurchak ... Vira Oksentyuk
Computer Design Systems. Theory and Practice | VOL. 6
Iryna Yurchak, et. al.Iryna Yurchak ... Vira Oksentyuk
01 Jan 2024
Computer Design Systems. Theory and Practice | VOL. 6

MLatom 3: A Platform for Machine Learning-Enhanced Computational Chemistry Simulations and Workflows.
Arif Ullah ... Yuxinxin Chen
Journal of Chemical Theory and Computation | VOL. 20
Arif Ullah, et. al.Arif Ullah ... Yuxinxin Chen
25 Jan 2024
Journal of Chemical Theory and Computation | VOL. 20

Generalizable clinical note section identification with large language models.
Weipeng Zhou ... Timothy A Miller
JAMIA open | VOL. 7
Weipeng Zhou, et. al.Weipeng Zhou ... Timothy A Miller
01 Jul 2024
JAMIA open | VOL. 7

Generating Novel Leads for Drug Discovery Using LLMs with Logical Feedback
Shreyas Bhat Brahmavar ... Ashwin Srinivasan
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38
Shreyas Bhat Brahmavar, et. al.Shreyas Bhat Brahmavar ... Ashwin Srinivasan
24 Mar 2024
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Discovering Photoswitchable Molecules for Drug Delivery with Large Language Models and Chemist Instruction Training.

Abstract

Talk to us

Similar Papers

More From: Pharmaceuticals (Basel, Switzerland)