Abstract

This paper presents an original approach combining Artificial Neural Networks (ANNs) and clustering in order to detect pollutant peaks. We developed air quality forecasting models using machine learning methods applied to hourly concentrations of ozone (O 3), nitrogen dioxide (NO 2) and particulate matter (PM 10) 24 hours ahead. MultiLayer Perceptron (MLP) was used alone, then hybridized successively with hierarchical clustering and with a combination of self-organizing map and k-means clustering. Clustering methods were used to subdivide the dataset, and then an MLP was trained on each subset. Two urban sites of Corsica Island in the western Mediterranean Sea were investigated. These models showed a good global precision (Index of Agreement reaching 0.87 for O 3 , 0.80 for NO 2 and 0.74 for PM 10). Considering it is particularly important than forecasting model used on an operational basis correctly predict pollution peaks, a sensitivity analysis was performed using Receiver Operating Characteristic curves (ROC curves). It allowed to evaluate the behaviour and the robustness of the models for high concentration situations. The results show that for PM 10 and O 3 , hybrid models made of a combination of clustering and MLP outperform classical MLP most of the time for high concentration prediction. An operational tool has been built with the models presented in this paper, and is used for air quality forecasting in Corsica.

Highlights

  • IntroductionAir quality forecasting is an important part of Air Quality Monitoring Agreed Associations (AQMAA)’s missions, allowing the anticipation of pollution peak formation

  • Air quality is a major concern, both for public health and environment preservation

  • The hybrid models formed of various MultiLayer Perceptron (MLP) each trained with data subsets after a hierarchical clustering are referred to as hMLP

Read more

Summary

Introduction

Air quality forecasting is an important part of AQMAA’s missions, allowing the anticipation of pollution peak formation. Different air quality forecasting techniques have been developed in recent years (Zhang et al, 2012a) and two families of models can be distinguished. Deterministic models operate by modelizing all the physicochemical mechanisms responsible of the evolution of air quality. Statistical models must learn the underlying relationships between the different variables related to air quality to make their predictions. The first family of models, frequently called Chemical Transport Models (CTM), use similar principles to Numerical Weather Prediction (NWP) models.

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.