Abstract

Quantitative structure-activity relationship modeling techniques are widely employed in diverse areas of chemistry, including environmental risk assessment of compounds and virtual screening of drugs. The modeling process entails optimizing the hyper parameters of the model and selecting appropriate molecular descriptors. It is a typical optimization problem with high-dimensional, nonlinear, continuous and discrete variables coexisting. To address this challenge, this paper proposes an enhanced snake optimizer with multiple strategies. First, new food emergence and temperature change mechanisms are introduced to balance the exploration and exploitation. Second, a death and regeneration mechanism is employed to enhance the algorithm's ability to escape from local optima. Third, new formulas for food searching, fighting, mating and eating are designed to simplify the complexity of the algorithm and improve its search efficiency and accuracy. Fourth, the performance of the proposed algorithm is evaluated using the CEC2017 benchmark functions, eight engineering cases with continuous variables, and six feature selection cases with discrete variables. Experimental results demonstrate that the algorithm significantly outperforms comparison algorithms. Finally, the algorithm was applied to quantitative structure-activity relationship modeling for environmental toxicity prediction. Compared with other algorithms, the algorithm can better configure hyper-parameters and select fewer and more effective features for environmental toxicity prediction model. It will be an effective tool to improve the performance of quantitative structure-activity relationship models.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call