Quantitative Structure Activity Relationship modelling methodologies need to incorporate relevant mechanistic information to have high predictive performance and validity. Electrophilic reactivity is a common mechanistic feature of skin sensitization endpoints which could be concisely characterized with electronic descriptors which is key to enabling the modelling of small datasets in this domain. However, quantum mechanical methodologies have previously featured high computational costs which would exclude the use of large datasets. Consequently, we investigate the use of electronic descriptors calculated using the Hartree Fock with 3 corrections (Hf-3c) method, a low-cost ab initio methodology that has higher chemical accuracy than previous semiempirical methodologies for modelling in vitro skin sensitization assay outcomes. We also model the Ames assay as a surrogate for determining skin sensitization outcomes. The quantum chemical descriptors calculated using the Hf-3c method with conductor-like polarizable continuum model (CPCM) implicit solvation found improved QSAR model performance for the in vitro Ames (n = 6049, 0.770 AUC), KeratinoSens (n = 164, 0.763 AUC), and Direct Peptide Reactivity Assay (n = 122, 0.750 AUC) datasets, with their combination producing high predictive performance for unseen in vivo Local Lymph Node Assay (n = 86, 0.789 AUC) and Human Repeated Insult Patch Test (n = 86, 0.791 AUC) assay toxicant outcomes.
Read full abstract