KScore: a novel machine learning approach that is not dependent on the data structure of the training set

Scott Oloff,Ingo Muegge

doi:10.1007/s10822-007-9108-0

Abstract

Currently machine learning approaches used in Quantitative Structure Activity Relationship (QSAR) model generation impose restrictions and/or make assumptions on how the training set descriptors correlate with a target activity. kScore has been developed as the first machine learning approach that does not require the training data to conform to a defined kernel, accommodates uneven data point distributions in the descriptor space, and optimizes the weight of each dimension in the descriptor space in order to identify the descriptors most relevant to the target property. The ability of kScore to adapt to virtually any correlation makes it essential that generalization terms be included to inhibit overtraining. The Structural Risk Minimization principle and the linear epsilon-insensitive loss terms have been added to the kScore optimization function. The resulting kScore algorithm has proven to be quite universal across several datasets and either produces results similar to or outperforms the most predictive machine learning algorithms tested, such as SVM, kNN, Recursive Partitioning, Neural Networks, Gaussian Process, and the Bayesian Classifier.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

KScore: a novel machine learning approach that is not dependent on the data structure of the training set

Abstract

Talk to us

Similar Papers

More From: Journal of Computer-Aided Molecular Design

Lead the way for us

Journal: Journal of Computer-Aided Molecular Design	Publication Date: Feb 28, 2007
Citations: 9

Similar Papers

Uncertainty quantification: Can we trust artificial intelligence in drug discovery?
Jie Yu ... Mingyue Zheng
iScience | VOL. 25
Jie Yu, et. al.Jie Yu ... Mingyue Zheng
21 Jul 2022
iScience | VOL. 25

Prediction on the mutagenicity of nitroaromatic compounds using quantum chemistry descriptors based QSAR and machine learning derived classification methods
Yuxing Hao ... Yongzhen Peng
Ecotoxicology and Environmental Safety | VOL. 186
Yuxing Hao, et. al.Yuxing Hao ... Yongzhen Peng
18 Oct 2019
Ecotoxicology and Environmental Safety | VOL. 186

Evaluation of QSAR Equations for Virtual Screening.
Jacob Spiegel ... Hanoch Senderowitz
International Journal of Molecular Sciences | VOL. 21
Jacob Spiegel, et. al.Jacob Spiegel ... Hanoch Senderowitz
22 Oct 2020
International Journal of Molecular Sciences | VOL. 21

AZOrange - High performance open source machine learning for QSAR modeling in a graphical programming environment
Jonna C Stålring ... Scott Boyer
Journal of Cheminformatics | VOL. 3
Jonna C Stålring, et. al.Jonna C Stålring ... Scott Boyer
28 Jul 2011
Journal of Cheminformatics | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

KScore: a novel machine learning approach that is not dependent on the data structure of the training set

Abstract

Talk to us

Similar Papers

More From: Journal of Computer-Aided Molecular Design