Abstract

BackgroundIn past, numerous quantitative structure-activity relationship (QSAR) based models have been developed for predicting anticancer activity for a specific class of molecules against different cancer drug targets. In contrast, limited attempt have been made to predict the anticancer activity of a diverse class of chemicals against a wide variety of cancer cell lines. In this study, we described a hybrid method developed on thousands of anticancer and non-anticancer molecules tested against National Cancer Institute (NCI) 60 cancer cell lines.ResultsOur analysis of anticancer molecules revealed that majority of anticancer molecules contains 18–24 carbon atoms and are dominated by functional groups like R2NH, R3N, ROH, RCOR, and ROR. It was also observed that certain substructures (e.g., 1-methoxy-4-methylbenzene, 1-methoxy benzene, Nitrobenzene, Indole, Propenyl benzene) are more abundant in anticancer molecules. Next, we developed anticancer molecule prediction models using various machine-learning techniques and achieved maximum matthews correlation coefficient (MCC) of 0.81 with 90.40 % accuracy using support vector machine (SVM) based models. In another approach, a novel similarity or potency score based method has been developed using selected fragments/fingerprints and achieved maximum MCC of 0.82 with 90.65 % accuracy. Finally, we combined the strength of above methods and developed a hybrid method with maximum MCC of 0.85 with 92.47 % accuracy.ConclusionsWe developed a hybrid method utilizing the best of machine learning and potency score based method. The highly accurate hybrid method can be used for classification of anticancer and non-anticancer molecules. In order to facilitate scientific community working in the field of anticancer drug discovery, we integrate hybrid and potency method in a web server CancerIN. This server provides various facilities that includes; virtual screening of anticancer molecules, analog based drug design, and similarity with known anticancer molecules (http://crdd.osdd.net/oscadd/cancerin).Electronic supplementary materialThe online version of this article (doi:10.1186/s12885-016-2082-y) contains supplementary material, which is available to authorized users.

Highlights

  • Introduction to methodology and encoding rulesJ Chem Inf Comput Sci. 1988;28(1):31–6.43

  • For improving the overall performance, we developed the hybrid method by doing an average of the normalized potency score and support vector machine (SVM) score

  • We tried to find out the pharmacophore of most active molecules, which could be responsible for the anticancer activity

Read more

Summary

Introduction

Introduction to methodology and encoding rulesJ Chem Inf Comput Sci. 1988;28(1):31–6.43. Numerous quantitative structure-activity relationship (QSAR) based models have been developed for predicting anticancer activity for a specific class of molecules against different cancer drug targets. Limited attempt have been made to predict the anticancer activity of a diverse class of chemicals against a wide variety of cancer cell lines. One of the major challenges in the field of drug discovery is to design effective drugs against cancer. Existing drugs have their limitations that includes, side effects of drugs, high toxicity, drug resistance towards current anticancer drugs [1]. Attempts have been made to develop computational methods to design/predict anticancer molecules. Various studies modelled the drug behaviour against multiple cancer cell lines using different genomics features. In spite of advances in genomics, modelling the behaviour of thousands of drug is still a

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call