Quantitative structure-activity relationships (QSARs) built using machine learning methods, such as artificial neural networks (ANNs) are powerful in prediction of (antioxidant) activity from quantum mechanical (QM) parameters describing the molecular structure, but are usually not interpretable. This obvious difficulty is one of the most common obstacles in application of ANN-based QSAR models for design of potent antioxidants or elucidating the underlying mechanism. Interpreting the resulting models is often omitted or performed erroneously altogether. In this work, a comprehensive comparative study of six methods (PaD, PaD2 , weights, stepwise, perturbation and profile) for exploration and interpretation of ANN models built for prediction of Trolox-equivalent antioxidant capacity (TEAC) QM descriptors, is presented. Sum of ranking differences (SRD) was used for ranking of the six methods with respect to the contributions of the calculated QM molecular descriptors toward TEAC. The results show that the PaD, PaD2 and profile methods are the most stable and give rise to realistic interpretation of the observed correlations. Therefore, they are safely applicable for future interpretations without the opinion of an experienced chemist or bio-analyst. © 2018 Wiley Periodicals, Inc.
Read full abstract