Forecasting PM2.5 concentration levels using shallow machine learning models on the Monterrey Metropolitan Area in Mexico

César Alejandro Pozo-Luyo,Jorge M Cruz-Duarte,Ivan Amaya,José Carlos Ortiz-Bayliss

doi:10.1016/j.apr.2023.101898

César Alejandro Pozo-Luyo, Jorge M Cruz-Duarte + Show 2 more

https://doi.org/10.1016/j.apr.2023.101898

Copy DOI

Abstract

The Monterrey Metropolitan Area is one of the most densely populated and polluted regions in Latin America. Hence, providing early warnings to the population when pollutant concentrations reach high levels is critical. This allows people at higher health risk to make informed decisions about when to go out, mitigating future health complications. Using forecasting models, we can produce timely warnings for future concentration levels. In this work, we implement a set of short-term shallow machine learning models that would serve as a baseline for future forecasting analyses of PM2.5 concentration levels in the Monterrey Metropolitan Area. The proposed approach starts with multiple imputation through chained equations for missing value imputation, the incorporation of time metadata, and target winsorization. Then, we rely on the well-known random search for parameter optimization of the machine learning models and k-fold cross-validation, obtaining favorable results. We devise these models for a single-step and single-station analysis on an hourly multivariate air quality dataset (containing 77203 rows and 16 columns from the first hour of January 1, 2015 00:00:00 to April 17, 2022 23:00:00) and compare them using standard regression metrics. Therefore, we identify the forecasting model with the best performance, which was an Extra Trees Regressor with a Root Mean Squared Error of 0.013, a Mean Absolute Error of 0.006 (equivalent to a Mean Absolute Percentage Error of 0.294% and a Symmetric Mean Absolute Percentage Error of 0.078%), and a Maximum Error of 0.187μg/m3.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Forecasting PM2.5 concentration levels using shallow machine learning models on the Monterrey Metropolitan Area in Mexico

Abstract

Talk to us

Similar Papers

More From: Atmospheric Pollution Research

Lead the way for us

Journal: Atmospheric Pollution Research	Publication Date: Aug 22, 2023
Citations: 2

Similar Papers

Spatial modelling of soil salinity: deep or shallow learning models?
Aliakbar Mohammadifar ... Adrian L Collins
Environmental Science and Pollution Research | VOL. 28
Aliakbar Mohammadifar, et. al.Aliakbar Mohammadifar ... Adrian L Collins
23 Mar 2021
Environmental Science and Pollution Research | VOL. 28

Comparing multi-step ahead building cooling load prediction using shallow machine learning and deep learning models
Raghavendra Chalapathy ... Nguyen Lu Dang Khoa
Sustainable Energy, Grids and Networks | VOL. 28
Raghavendra Chalapathy, et. al.Raghavendra Chalapathy ... Nguyen Lu Dang Khoa
01 Dec 2021
Sustainable Energy, Grids and Networks | VOL. 28

Classification of reflective writing: A comparative analysis with shallow machine learning and pre-trained language models
Chengming Zhang ... Florian Hofmann
Education and Information Technologies | VOL. -
Chengming Zhang, et. al.Chengming Zhang ... Florian Hofmann
02 May 2024
Education and Information Technologies | VOL. -

Gap Filling Cloudy Sentinel-2 NDVI and NDWI Pixels with Multi-Frequency Denoised C-Band and L-Band Synthetic Aperture Radar (SAR), Texture, and Shallow Learning Techniques
Kristofer Lasko
Remote Sensing | VOL. 14
Kristofer LaskoKristofer Lasko
27 Aug 2022
Remote Sensing | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Forecasting PM2.5 concentration levels using shallow machine learning models on the Monterrey Metropolitan Area in Mexico

Abstract

Talk to us

Similar Papers

More From: Atmospheric Pollution Research