Assessing the aquatic toxicity originating from air pollutants is essential in sustaining water resources and maintaining the ecosystem's safety. Quantitative structure-activity relationship (QSAR) models provide a computational tool for predicting pollutant toxicity, facilitating the identification/evaluation of the contaminants and identifying responsible structural fragments. One-vs-all (OvA) QSAR is a tailored approach to address multi-class QSAR problems. The study aims to determine five distinct levels of aquatic hazard categories for airborne pollutants using OvA-QSAR modeling containing 254 air contaminants. This QSAR analysis reveals the critical descriptors of air pollutants to target for molecular modification. Various factors, including the selection of relevant mechanistic descriptors, data quality, and outliers, determine the reliability of QSAR models. By employing feature selection and outlier identification approaches, the robustness and accuracy of our QSAR models were significantly increased, leading to more reliable predictions in chemical hazard assessment. The results revealed that models using the Random Forest algorithm performed the best based on the selected descriptors, with internal and external validation accuracy ranging from 71.90% to 97.53% and 76.47%–98.03%, respectively. This study indicated that the aquatic risk of air contaminants might be attributed predominantly to their sp3/sp2 carbon ratio, hydrogen-bond acceptor capability, hydrophilicity/lipophilicity, and van der Waals volumes. These structures can be critical in developing innovative strategies to mitigate or avoid the chemicals' harmful effects. Supporting air quality improvement, this study contributes to the rapid implementation of measures to protect aquatic ecosystems affected by air pollution.
Read full abstract