Abstract

Predicting antitumor activity of compounds using regression models trained on a small number of compounds with measured biological activity is an ill-posed inverse problem. Yet, it occurs very often within the academic community. To counteract, up to some extent, overfitting problems caused by a small training data, we propose to use consensus of six regression models for prediction of biological activity of virtual library of compounds. The QSAR descriptors of 22 compounds related to the opioid growth factor (OGF, Tyr-Gly-Gly-Phe-Met) with known antitumor activity were used to train regression models: the feed-forward artificial neural network, the k-nearest neighbor, sparseness constrained linear regression, the linear and nonlinear (with polynomial and Gaussian kernel) support vector machine. Regression models were applied on a virtual library of 429 compounds that resulted in six lists with candidate compounds ranked by predicted antitumor activity. The highly ranked candidate compounds were synthesized, characterized and tested for an antiproliferative activity. Some of prepared peptides showed more pronounced activity compared with the native OGF; however, they were less active than highly ranked compounds selected previously by the radial basis function support vector machine (RBF SVM) regression model. The ill-posedness of the related inverse problem causes unstable behavior of trained regression models on test data. These results point to high complexity of prediction based on the regression models trained on a small data sample.

Highlights

  • Peptides are attracting increasing attention and have growing significance as therapeutics

  • Six regression models were trained on a dataset of 22 opioid growth factor (OGF)-related peptides with measured antitumor activity (1647 QSAR descriptors) and applied on a 429 virtual library members

  • Some of them showed more pronounced activity compared with the native OGF, they were less active than highly ranked compounds selected previously from the same virtual library by the radial basis function support vector machine (RBF support vector machines (SVM)) regression model

Read more

Summary

Introduction

Peptides are attracting increasing attention and have growing significance as therapeutics. They are Nature’s toolkit known to control and direct various cellular functions and intercellular communication events. Peptide-based therapeutics were only considered for hormonal disorders and hormone-dependent cancers. About half of the peptides in clinical trials address oncology, metabolic, infectious and cardiovascular diseases-related targets. It is expected in the future that peptide drugs will address other medical disorders as well. Some of the most applied peptide-based drugs today are glatiramer acetate for the treatment of multiple sclerosis [3], leuprolide acetate, a GnRH receptor agonist for the treatment of breast and prostate cancers [4] and exenatide, approved for the treatment of diabetes mellitus type 2 [5]

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call