Abstract

We describe robust methods for estimating the aqueous solubility of a set of 734 organic compounds from different structural classes based on multiple linear regression (MLR) and artificial neural networks (ANN) model. The structures were represented by atom-type electrotopological state (E-state) indices. The squared correlation coefficient and standard deviation for the MLR with 34 structural parameters were r 2=0.94 and s=0.58 for the training set of 675 compounds. For the test set of 21 compounds, the equivalent statistics were r 2 pred=0.80 and s=0.87, respectively. Neural networks gave a significant improvement using the same set of parameters, and the standard deviations were s=0.52 for the training set and s=0.75 for the test set when an artificial neural network with five neurons in the hidden layer was used. The results clearly show that accurate models can be rapidly calculated for the estimation of aqueous solubility for a large and diverse set of organic compounds using easily calculated structural parameters.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.