Exploring isofunctional molecules: Design of a benchmark and evaluation of prediction performance.

Philippe Pinel,Stéphanie Labouille,Nicolas Drizard,Yann Gaston‐Mathé,Brice Hoffmann,Véronique Stoven,Gwenn Guichaoua,Matthieu Najm

doi:10.1002/minf.202200216

Philippe Pinel, Stéphanie Labouille + Show 6 more

Open Access

https://doi.org/10.1002/minf.202200216

Copy DOI

Journal: Molecular informatics	Publication Date: Feb 17, 2023
Citations: 2	License type: CC BY-NC-ND 4.0

Affiliation: Université Paris Sciences et Lettres

Abstract

Identification of novel chemotypes with biological activity similar to a known active molecule is an important challenge in drug discovery called 'scaffold hopping'. Small-, medium-, and large-step scaffold hopping efforts may lead to increasing degrees of chemical structure novelty with respect to the parent compound. In the present paper, we focus on the problem of large-step scaffold hopping. We assembled a high quality and well characterized dataset of scaffold hopping examples comprising pairs of active molecules and including a variety of protein targets. This dataset was used to build a benchmark corresponding to the setting of real-life applications: one active molecule is known, and the second active is searched among a set of decoys chosen in a way to avoid statistical bias. This allowed us to evaluate the performance of computational methods for solving large-step scaffold hopping problems. In particular, we assessed how difficult these problems are, particularly for classical 2D and 3D ligand-based methods. We also showed that a machine-learning chemogenomic algorithm outperforms classical methods and we provided some useful hints for future improvements.

Full Text