CASTELO: clustered atom subtypes aided lead optimization\u2014a combined machine learning and molecular modeling method

Leili Zhang,Chih-Chieh Yang,Guojing Cong,Ruhong Zhou,Seung-Gu Kang,Giacomo Domeniconi

doi:10.1186/s12859-021-04214-4

Abstract

BackgroundDrug discovery is a multi-stage process that comprises two costly major steps: pre-clinical research and clinical trials. Among its stages, lead optimization easily consumes more than half of the pre-clinical budget. We propose a combined machine learning and molecular modeling approach that partially automates lead optimization workflow in silico, providing suggestions for modification hot spots.ResultsThe initial data collection is achieved with physics-based molecular dynamics simulation. Contact matrices are calculated as the preliminary features extracted from the simulations. To take advantage of the temporal information from the simulations, we enhanced contact matrices data with temporal dynamism representation, which are then modeled with unsupervised convolutional variational autoencoder (CVAE). Finally, conventional and CVAE-based clustering methods are compared with metrics to rank the submolecular structures and propose potential candidates for lead optimization.ConclusionWith no need for extensive structure-activity data, our method provides new hints for drug modification hotspots which can be used to improve drug potency and reduce the lead optimization time. It can potentially become a valuable tool for medicinal chemists.

Highlights

At a time of global health crisis, drug discovery is of utter importance to bring the society back to its order
We propose a computational method, coined Clustered Atom Subtypes aidEd Lead Optimization (CASTELO), that identifies modifiable submolecular moieties in a lead molecule to narrow down the substitution sites to a few possibilities
convolutional variational autoencoder (CVAE) method was adopted to compress the dynamism tensors into latent space before the data clustering with HDBSCAN

Summary

Introduction

At a time of global health crisis, drug discovery is of utter importance to bring the society back to its order. Despite the growing research and development expenditure every year [1, 2], the yearly FDA-approval of drugs has mostly stalled since 1993 [3]. There were a total of 3437 FDA approved small-molecule and large-molecule drugs or therapeutics in 2018 [4], with a yearly addition of only ∼ 1.2% (2014–2018 average). Drug discovery is a multi-stage process that comprises two costly major steps: pre-clinical research and clinical trials. Lead optimization consumes more than half of the pre-clinical budget. We propose a combined machine learning and molecular modeling approach that partially automates lead optimization workflow in silico, providing suggestions for modification hot spots

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Jun 22, 2021
Citations: 4	License type: open-access

R Discovery Prime

R Discovery Prime

CASTELO: clustered atom subtypes aided lead optimization\u2014a combined machine learning and molecular modeling method

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

A Combination of Machine Learning and PBPK Modeling Approach for Pharmacokinetics Prediction of Small Molecules in Humans
Yuelin Li ... Lipeng Lai
Pharmaceutical Research | VOL. 41
Yuelin Li, et. al.Yuelin Li ... Lipeng Lai
25 Jun 2024
Pharmaceutical Research | VOL. 41

Toward Reducing HERG Affinities for Dat Inhibitors with a Combined Machine Learning and Molecular Modeling Approach
Andrew D Fant ... Lei Shi
Biophysical Journal | VOL. 116
Andrew D Fant, et. al.Andrew D Fant ... Lei Shi
01 Feb 2019
Biophysical Journal | VOL. 116

Toward Reducing hERG Affinities for DAT Inhibitors with a Combined Machine Learning and Molecular Modeling Approach.
Kuo Hao Lee ... Jianjing Cao
Journal of chemical information and modeling | VOL. 61
Kuo Hao Lee, et. al.Kuo Hao Lee ... Jianjing Cao
21 Aug 2021
Journal of chemical information and modeling | VOL. 61

Designing and Implementing Real-Time Bus Time Predictions using Artificial Intelligence
Benny Wai ... Winston Zhou
Transportation Research Record: Journal of the Transportation Research Board | VOL. 2674
Benny Wai, et. al.Benny Wai ... Winston Zhou
10 Sep 2020
Transportation Research Record: Journal of the Transportation Research Board | VOL. 2674

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CASTELO: clustered atom subtypes aided lead optimization\u2014a combined machine learning and molecular modeling method

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics