Low-cost sensor networks offer the potential to reduce monitoring costs while providing high-resolution spatiotemporal data on pollutant levels. However, these sensors come with limitations, and many aspects of their field performance remain underexplored. During October to December 2023, this study deployed two identical low-cost sensor systems near an urban standard monitoring station to record PM2.5 and PM10 concentrations, along with environmental temperature and humidity. Our evaluation of the monitoring performance of these sensors revealed a broad data distribution with a systematic overestimation; this overestimation was more pronounced in PM10 readings. The sensors showed good consistency (R2 > 0.9, NRMSE<5 %), and normalization residuals were tracked to assess stability, which, despite occasional environmental influences, remained generally stable. A lateral comparison of four calibration models (MLR, SVR, RF, XGBoost) demonstrated superior performance of RF and XGBoost over others, particularly with RF showing enhanced effectiveness on the test set. SHAP analysis identified sensor readings as the most critical variable, underscoring their pivotal role in predictive modeling. Relative humidity consistently proved more significant than dew point and temperature, with higher RH levels typically having a positive impact on model outputs. The study indicates that, with appropriate calibration, sensors can supplement the sparse networks of regulatory-grade instruments, enabling dense neighborhood-scale monitoring and a better understanding of temporal air quality trends.
Read full abstract