Abstract

The selection of an optimal set of molecular descriptors from a much larger collection of such regression variables is a vital step in the elaboration of most QSAR and QSPR models. The aim of this work is to continue advancing this important selection process by combining the enhanced replacement method (ERM) and the well-known genetic algorithms (GA). These approaches had previously proven to yield near-optimal results with a much smaller number of linear regressions than a full search. The newly proposed algorithms were tested on four different experimental datasets, formed by collections of 116, 200, 78, and 100 experimental records from different compounds and 1268, 1338, 1187, and 1306 molecular descriptors, respectively. The comparisons showed that the new alternative ERMp (combination of ERM with a GA population) further improves ERM, it has previously been shown that the latter is superior to GA for the selection of an optimal set of molecular descriptors from a much greater pool.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call