Structure-based classification of active and inactive estrogenic compounds by decision tree, LVQ and kNN methods

Arja Asikainen,Mikko Kolehmainen,Juhani Ruuskanen,Kari Tuppurainen

doi:10.1016/j.chemosphere.2005.04.115

Abstract

The performance of decision tree (DT), learning vector quantization (LVQ), and k-nearest neighbour (kNN) methods classifying active and inactive estrogenic compounds in terms of their structure activity relationship (SAR) was evaluated. A set of 311 compounds was used for construction of the models, the predictive power of which was verified with separate training and test sets. Principal components derived from molecular descriptors calculated with DRAGON software were used as variables representing the structures of the compounds. Broadly, kNN had the best classification ability and DT the weakest, although the performance of each method was dependent on the group of compounds used for modelling. The best performance was obtained with kNN for the calf estrogen receptor data, averaging 98.3% of correctly classified compounds in the external tests. Overall, the results indicate that all the methods tested are suitable for the SAR classification of estrogenic compounds, producing models with a predictive power ranging from adequate to excellent.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Structure-based classification of active and inactive estrogenic compounds by decision tree, LVQ and kNN methods

Abstract

Talk to us

Similar Papers

More From: Chemosphere

Lead the way for us

Journal: Chemosphere	Publication Date: Jun 29, 2005
Citations: 32

Similar Papers

Fault Detection and Isolation of Non-Gaussian and Nonlinear Processes Based on Statistics Pattern Analysis and the k-Nearest Neighbor Method.
Zhe Zhou ... Jian Wang
ACS omega | VOL. 7
Zhe Zhou, et. al.Zhe Zhou ... Jian Wang
26 May 2022
ACS omega | VOL. 7

Classification of High‐Activity Tiagabine Analogs by Binary QSAR Modeling
Andreas Jurik ... Gerhard F Ecker
Molecular Informatics | VOL. 32
Andreas Jurik, et. al.Andreas Jurik ... Gerhard F Ecker
15 May 2013
Molecular Informatics | VOL. 32

Estimation of stand volumes using the k-nearest neighbors method in Kyushu, Japan
Tsuyoshi Kajisa ... Shigejiro Yoshida
Journal of Forest Research | VOL. 13
Tsuyoshi Kajisa, et. al.Tsuyoshi Kajisa ... Shigejiro Yoshida
04 Jun 2008
Journal of Forest Research | VOL. 13

Implementation of the K-Nearest Neighbor (kNN) Method to Determine Outstanding Student Classes
Nanda Fahrezi Munazhif ... Mila Nirmala Sari Hasibuan
SinkrOn | VOL. 8
Nanda Fahrezi Munazhif, et. al.Nanda Fahrezi Munazhif ... Mila Nirmala Sari Hasibuan
04 Apr 2023
SinkrOn | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Structure-based classification of active and inactive estrogenic compounds by decision tree, LVQ and kNN methods

Abstract

Talk to us

Similar Papers

More From: Chemosphere