Linear and Nonlinear Methods in Modeling the Aqueous Solubility of Organic Compounds

Cornel Catana,Christian Orrenius,Pieter F W Stouten,Hua Gao

doi:10.1021/ci049797u

Abstract

Solubility data for 930 diverse compounds have been analyzed using linear Partial Least Square (PLS) and nonlinear PLS methods, Continuum Regression (CR), and Neural Networks (NN). 1D and 2D descriptors from MOE package in combination with E-state or ISIS keys have been used. The best model was obtained using linear PLS for a combination between 22 MOE descriptors and 65 ISIS keys. It has a correlation coefficient (r2) of 0.935 and a root-mean-square error (RMSE) of 0.468 log molar solubility (log S(w)). The model validated on a test set of 177 compounds not included in the training set has r2 0.911 and RMSE 0.475 log S(w). The descriptors were ranked according to their importance, and at the top of the list have been found the 22 MOE descriptors. The CR model produced results as good as PLS, and because of the way in which cross-validation has been done it is expected to be a valuable tool in prediction besides PLS model. The statistics obtained using nonlinear methods did not surpass those got with linear ones. The good statistic obtained for linear PLS and CR recommends these models to be used in prediction when it is difficult or impossible to make experimental measurements, for virtual screening, combinatorial library design, and efficient leads optimization.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Linear and Nonlinear Methods in Modeling the Aqueous Solubility of Organic Compounds

Abstract

Talk to us

Similar Papers

More From: Journal of Chemical Information and Modeling

Lead the way for us

Journal: Journal of Chemical Information and Modeling	Publication Date: Nov 24, 2004
Citations: 43

Similar Papers

Nonlinear Monitoring and Prediction Model in an Industrial Environmental Process
Chang Kyoo Yoo
JOURNAL OF CHEMICAL ENGINEERING OF JAPAN | VOL. 41
Chang Kyoo YooChang Kyoo Yoo
01 Jan 2008
JOURNAL OF CHEMICAL ENGINEERING OF JAPAN | VOL. 41

Nonlinear PLS modeling using neural networks
S.J Qin ... T.J Mcavoy
Computers & Chemical Engineering | VOL. 16
S.J Qin, et. al.S.J Qin ... T.J Mcavoy
01 Apr 1992
Computers & Chemical Engineering | VOL. 16

Iterative Error-based Nonlinear PLS Method for Nonlinear Chemical Process Modeling.
Kwang Gi Min ... In-Su Han
JOURNAL OF CHEMICAL ENGINEERING OF JAPAN | VOL. 35
Kwang Gi Min, et. al.Kwang Gi Min ... In-Su Han
01 Jan 2002
JOURNAL OF CHEMICAL ENGINEERING OF JAPAN | VOL. 35

A Novel Nonlinear Partial Least Square Integrated With Error-Based Extreme Learning Machine
Ze Dong ... Ning Ma
IEEE Access | VOL. 7
Ze Dong, et. al.Ze Dong ... Ning Ma
01 Jan 2019
IEEE Access | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Linear and Nonlinear Methods in Modeling the Aqueous Solubility of Organic Compounds

Abstract

Talk to us

Similar Papers

More From: Journal of Chemical Information and Modeling