Benchmarking Active Learning Protocols for Ligand-Binding Affinity Prediction.

Rohan Gorantla,Alžbeta Kubincová,Benjamin Suutari,Benjamin P Cossins,Antonia S J S Mey

doi:10.1021/acs.jcim.4c00220

Rohan Gorantla, Alžbeta Kubincová + Show 3 more

Open Access

https://doi.org/10.1021/acs.jcim.4c00220

Copy DOI

Abstract

Active learning (AL) has become a powerful tool in computational drug discovery, enabling the identification of top binders from vast molecular libraries. To design a robust AL protocol, it is important to understand the influence of AL parameters, as well as the features of the data sets on the outcomes. We use four affinity data sets for different targets (TYK2, USP7, D2R, Mpro) to systematically evaluate the performance of machine learning models [Gaussian process (GP) model and Chemprop model], sample selection protocols, and the batch size based on metrics describing the overall predictive power of the model (R2, Spearman rank, root-mean-square error) as well as the accurate identification of top 2%/5% binders (Recall, F1 score). Both models have a comparable Recall of top binders on large data sets, but the GP model surpasses the Chemprop model when training data are sparse. A larger initial batch size, especially on diverse data sets, increased the Recall of both models as well as overall correlation metrics. However, for subsequent cycles, smaller batch sizes of 20 or 30 compounds proved to be desirable. Furthermore, adding artificial Gaussian noise to the data up to a certain threshold still allowed the model to identify clusters with top-scoring compounds. However, excessive noise (<1σ) did impact the model's predictive and exploitative capabilities.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of chemical information and modeling	Publication Date: Mar 6, 2024
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Benchmarking Active Learning Protocols for Ligand-Binding Affinity Prediction.

Abstract

Talk to us

Similar Papers

More From: Journal of chemical information and modeling

Lead the way for us

Similar Papers

Hyperparameter Optimization for Machine Learning Models Based on Bayesian Optimization
Jia Wu ... Si-Hao Deng
Journal of Electronic Science and Technology | VOL. 17
Jia Wu, et. al.Jia Wu ... Si-Hao Deng
11 Dec 2019
Journal of Electronic Science and Technology | VOL. 17

American society of anesthesiologists physical status classification significantly affects the performances of machine learning models in intraoperative hypotension inference
Zehua Dong ... Jiapeng Huang
Journal of Clinical Anesthesia | VOL. 92
Zehua Dong, et. al.Zehua Dong ... Jiapeng Huang
02 Nov 2023
Journal of Clinical Anesthesia | VOL. 92

Emulating Spatial and Temporal Outputs From Fuel Cell and Battery Models: A Comparison of Deep Learning and Gaussian Process Models
W W Xing ... Q Xu
Journal of Electrochemical Energy Conversion and Storage | VOL. 20
W W Xing, et. al.W W Xing ... Q Xu
12 May 2022
Journal of Electrochemical Energy Conversion and Storage | VOL. 20

An XML-based System for Synthesis of Data from Disparate Databases
Tahsin Kurc ... Joel H Saltz
Journal of the American Medical Informatics Association | VOL. 13
Tahsin Kurc, et. al.Tahsin Kurc ... Joel H Saltz
01 May 2006
Journal of the American Medical Informatics Association | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Benchmarking Active Learning Protocols for Ligand-Binding Affinity Prediction.

Abstract

Talk to us

Similar Papers

More From: Journal of chemical information and modeling