Abstract

Predicting the levels of a pollutant in a given area is an open problem, mainly because historical data are typically available at certain locations, where monitoring stations are located, but not at all locations in the area. This work presents an approach based on developing predictions at each of the points where an immission station is available; in this case, based on shallow Artificial Neural Networks, ANNs, and then using a simple geostatistical interpolation algorithm (Inverse Distance Weighted, IDW), a pollutant map is constructed over the entire study area, thus providing predictions at each point in the plane. The ANN models are designed to make 1 h ahead and 4 h ahead predictions, using an autoregressive scheme as inputs (in the case of 4 h ahead as a jumping strategy). The results are then compared using the Friedman and Bonferroni tests to select the best model at each location, and predictions are made with all the best models. In general, to the 1 h ahead prediction models, the optimal models typically have fewer neurons and require minimal historical data. For instance, the best model in Algeciras has an R of almost 0.89 and consists of 1 hidden neuron and 3 to 5 lags, similar to Colegio Los Barrios. In the case of 4h ahead prediction, Colegio Carteya station shows the best model, with an R of almost 0.89 and a MSE of less than 240, including 5 hidden neurons and different lags from the past. The results are sufficiently adequate, especially in the case of predictions 4 h into the future. The aim is to integrate the models into a tool for citizens and administrations to make decisions.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call