Abstract

Low-Cost Air Quality Monitoring Systems (LCAQMS) and Machine Learning techniques are enabling a new paradigm in Air Quality Monitoring Networks. Nevertheless, compliance with Data Quality Objective (DQO) is still an open point. The assessment of various calibration models proposed in literature has ever neglected the Concept Drift, i.e. the differences in data distributions associated with input and target variables of the streaming data coming from dynamic nonstationary environments. The influence of the concept drift is investigated on the maintenance of the (calibrated) low-cost instrumentation. The data from mid-term co-location campaigns are firstly used to train a Multiple Linear Regression as calibration model. Then, an original methodology based on the Two-Sample Kolmogorov-Smirnov Test is proposed for automatically detecting the presence of the concept drift. The time evolution of the Relative Expanded Uncertainty is considered as well, to highlight the negative influence of the concept shift on the metrological performance of the LCAQMS, proving the need for the estimation of a new calibration model during the maintenance of the instrumentation in order to match the DQOs. A quantitative analysis is carried out on the distances among the training and test distributions about the input and outputs of the calibration model by correlating them with the time evolution of uncertainty. The scheme of an add-on block based on the proposed approach is designed for the continuous monitoring of the metrological performance exhibited by the calibration model. This management of the concept drift is expected to allow the long-awaited achievement of DQOs.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call