Abstract

Activity cliffs (ACs) are formed by pairs of structurally similar compounds with large differences in potency. Predicting ACs is of high interest in lead optimization for drug discovery. Previous AC prediction models that focused on matched molecular pair (MMP) cliffs produced adequate performances. However, the extrapolation ability of these models is unclear because the main scaffold for MMPs, the core structure, could exist in both training and test data sets. Also, representation of MMPs did not consider the attachment points where the core and R-group substituents are connected. In this study, we aimed to improve a ligand-based AC prediction method using molecular fingerprints. We incorporated applicability domain, which was defined using R-path fingerprints to consider the local environment around an attachment point. Rigorous evaluation of the extrapolation ability of AC prediction models showed that MMP-cliffs were accurately predicted for nine biological targets. Furthermore, incorporation of training MMPs with cores distinct from those of test MMPs improved the predictability compared with using training MMPs with only similar cores.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call