Abstract
PM2.5 is a critical air pollutant, and understanding its drivers is essential for regional air quality control. This study employed meteorological and pollutant variables to predict PM2.5 concentrations in Shanghai using interpretable tree-based models. The random forest (RF) model performed best, achieving MAE, RMSE, MBE, and R2 values of 3.279, 4.609, 1.254, and 0.971, respectively, improving accuracy by 42.1%-85.5% compared to AdaBoost. Shapley additive explanations (SHAP) analysis identified CO, SO2, and O3 as the most influential factors. Partial dependence plots (PDPs) showed SO2 had the strongest impact below 40 μg/m³, while NO2 exhibited a linear positive correlation with PM2.5 up to 60 μg/m³. Atmospheric pressure and rainfall were negatively correlated with PM2.5, with notable reductions in concentrations under high-pressure conditions and rainfall levels between 0 and 20 mm. Temperature and relative humidity showed complex relationships, with sharp increases in PM2.5 at temperatures between -5°C and 15°C and SHAP values declining for humidity above 90%. Wind speed exhibited a non-linear effect, with minimal influence at higher velocities. The combined effects of different pollutants can be intensified significantly at higher levels. These findings offer valuable guidance for urban air quality management and pollution mitigation strategies.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have