Abstract

The use of the internet has evolved in quantitative structure–activity relationship (QSAR) over the past decade with the development of web based activities like the availability of numerous public domain software tools for descriptor calculation and chemometric toolboxes. The importance of chemometrics in QSAR has accelerated in recent years for processing the enormous amount of information in form of predictive mathematical models for large datasets of molecules. With the availability of huge numbers of physicochemical and structural parameters, variable selection became crucial in deriving interpretable and predictive QSAR models. Among several approaches to address this problem, the principle component regression (PCR) and partial least squares (PLS) analyses provide highly predictive QSAR models but being more abstract, they are difficult to understand and interpret. Genetic algorithm (GA) is a stochastic method well suited to the problem of variable selection and to solve optimization problems. Consequently the hybrid approach (GA-MLR) combining GA with multiple linear regression (MLR) may be useful in derivation of highly predictive and interpretable QSAR models. In view of the above, a comparative study of stepwise-MLR, PLS and GA-MLR in deriving QSAR models for datasets of α1-adrenoreceptor antagonists and β3-adrenoreceptor agonists has been carried out using the public domain software Dragon for computing descriptors and free Matlab codes for data modeling.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call