Potential drugs and nondrugs: prediction and identification of important structural features

Markus Wagener,Vincent J Van Geerestein

doi:10.1021/ci990266t

Abstract

Using decision trees, a model to discriminate between potential drugs and nondrugs has been developed. Compounds from the Available Chemical Directory and the World Drug Index databases were used as training set; the molecular structures were represented using extended atom types. The error rate on an independent validation data set is 17.4%. The number of false negatives can be reduced by penalizing the misclassification of drugs so that 92 out of 100 potential drugs are correctly recognized. At the same time, 34 out of 100 nondrugs are classified as potential drugs. The predictions of the model can be used to guide the purchase or selection of compounds for biological screening or the design of combinatorial libraries. The visualization of the generated models in the form of colored trees allowed us to identify a few, surprisingly simple features that explain the most significant differences between drugs and nondrugs in the training set: Just by testing the presence of hydroxyl, tertiary or secondary amino, carboxyl, phenol, or enol groups, already three quarters of all drugs could be correctly recognized. The nondrugs, on the other hand, are characterized by their aromatic nature with a low content of functional groups besides halogens. The general applicability of the model is shown by the predictions made for several Organon databases.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Potential drugs and nondrugs: prediction and identification of important structural features

Abstract

Talk to us

Similar Papers

More From: Journal of chemical information and computer sciences

Lead the way for us

Journal: Journal of chemical information and computer sciences	Publication Date: Feb 25, 2000
Citations: 123

Similar Papers

Estimation of aqueous solubility of organic compounds with QSPR approach.
Hua Gao ... Veerabahu Shanmugasundaram
Pharmaceutical research | VOL. 19
Hua Gao, et. al.Hua Gao ... Veerabahu Shanmugasundaram
01 Jan 2002
Pharmaceutical research | VOL. 19

Combinatorial library designs for quantifying thin film adhesion via the edge delamination test
Jae Hyun Kim ... Martin Y M Chiang
Journal of Physics D: Applied Physics | VOL. 44
Jae Hyun Kim, et. al.Jae Hyun Kim ... Martin Y M Chiang
22 Dec 2010
Journal of Physics D: Applied Physics | VOL. 44

Predicting the Authenticity of Banknotes Using Supervised Learning
Priyam Guha ... Abhishek Verma
American Journal of Advanced Computing | VOL. 1
Priyam Guha, et. al.Priyam Guha ... Abhishek Verma
01 Apr 2020
American Journal of Advanced Computing | VOL. 1

Application of Self-Organizing Maps in Compounds Pattern Recognition and Combinatorial Library Design
Aixia Yan
Combinatorial Chemistry & High Throughput Screening | VOL. 9
Aixia YanAixia Yan
01 Jul 2006
Combinatorial Chemistry & High Throughput Screening | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Potential drugs and nondrugs: prediction and identification of important structural features

Abstract

Talk to us

Similar Papers

More From: Journal of chemical information and computer sciences