Abstract

Three regression methods, namely ridge regression (RR), partial least squares (PLS), and principal components regression (PCR), were used to develop models for the prediction of rat blood:air partition coefficient for increasingly diverse data sets. Initially, modeling was performed for a set of 13 chlorocarbons. To this set, 10 additional hydrophobic compounds were added, including aromatic and non-aromatic hydrocarbons. A set of 16 hydrophilic compounds was also modeled separately. Finally, all 39 compounds were combined into one data set for which comprehensive models were developed. A large set of diverse, theoretical molecular descriptors was calculated for use in the current study. The topostructural (TS), topochemical (TC), and geometrical or 3-dimensional (3D) indices were used hierarchically in model development. In addition, single-class models were developed using the TS, TC, and 3D descriptors. In most cases, RR outperformed PLS and PCR, and the models developed using TC indices were superior to those developed using other classes of descriptors.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call