Data-augmented machine learning scoring functions for virtual screening of YTHDF1 m6A reader protein

Muhammad Junaid,Bo Wang,Wenjin Li

doi:10.1016/j.compbiomed.2024.109268

Abstract

Machine learning is rapidly advancing the drug discovery process, significantly enhancing speed and efficiency. Innovation in computer-aided drug design is primarily driven by structure- and ligand-based approaches. When the number of known inhibitors for a target is limited, data augmentation strategies are often preferred to enhance model performance. In this study, we developed predictive machine learning models for structure-based drug discovery leveraging multiple traditional machine learning algorithms trained with target and ligand dynamics-aware datasets.To illustrate our approach, we present a composite model that combines classification and regression to predict YTHDF1 inhibitors, utilizing PLEC features. YTHDF1, a key m6A reader protein involved in mRNA translation, is implicated in various cancers, making it a promising therapeutic target. Traditional structure-based virtual screening (SBVS) using generic scoring functions has struggled to identify potent YTHDF1 inhibitors due to the protein's unique binding characteristics. To overcome this, we developed YTHDF1-specific machine learning scoring functions (MLSFs) to enhance SBVS efficacy.We employed various data augmentation techniques to generate a comprehensive dataset, incorporating multiple conformations of ligands and the YTHDF1 protein. We have trained 64 YTHDF1-specific MLSFs using four machine learning algorithms and evaluated them on ten test sets, focusing on their predictive and ranking power. Our results demonstrate that the artificial neural network with protein-ligand extended connectivity fingerprints (ANN-PLEC) outperforms other MLSFs, consistently achieving high area under the precision-recall curve (PR-AUC) of 0.87. This method shows promise for targets with limited quantities of active molecules, providing a viable path forward for drug discovery research. The ANN-PLEC scoring function is made freely available on GitHub for other researchers to access and utilize https://github.com/JuniML/SBVS-YTHDF1/.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Data-augmented machine learning scoring functions for virtual screening of YTHDF1 m6A reader protein

Abstract

Talk to us

Similar Papers

More From: Computers in Biology and Medicine

Lead the way for us

Similar Papers

Comparative assessment of machine-learning scoring functions on PDBbind 2013
Mohamed A Khamis ... Walid Gomaa
Engineering Applications of Artificial Intelligence | VOL. 45
Mohamed A Khamis, et. al.Mohamed A Khamis ... Walid Gomaa
16 Jul 2015
Comparative assessment of machine-learning scoring functions on PDBbind 2013
Mohamed A Khamis ... Walid Gomaa

SCORCH: Improving structure-based virtual screening with machine learning classifiers, data augmentation, and uncertainty estimation
Miles Mcgibbon ... Douglas R Houston
Journal of Advanced Research | VOL. 46
Miles Mcgibbon, et. al.Miles Mcgibbon ... Douglas R Houston
25 Jul 2022
Journal of Advanced Research | VOL. 46

Task-Specific Scoring Functions for Predicting Ligand Binding Poses and Affinity and for Screening Enrichment.
Hossam M Ashtawy ... Nihar R Mahapatra
Journal of Chemical Information and Modeling | VOL. 58
Hossam M Ashtawy, et. al.Hossam M Ashtawy ... Nihar R Mahapatra
20 Dec 2017
Journal of Chemical Information and Modeling | VOL. 58

Artificial intelligence to deep learning: machine intelligence approach for drug discovery.
Rohan Gupta ... Pravir Kumar
Molecular diversity | VOL. 25
Rohan Gupta, et. al.Rohan Gupta ... Pravir Kumar
12 Apr 2021
Molecular diversity | VOL. 25

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data-augmented machine learning scoring functions for virtual screening of YTHDF1 m6A reader protein

Abstract

Talk to us

Similar Papers

More From: Computers in Biology and Medicine