Abstract
This paper compares the performance of Nonlinear Autoregressive Exogenous (NARX) Neural Network and Support Vector Machine (SVM) regression model to predict the Air Pollutant Index (API) in Malaysia. Two models namely the NARX and SVM regression were developed using the API and air quality time series data from three monitoring stations: Pasir Gudang, TTDI Jaya and Larkin. Hourly data of API and air quality parameters collected in year 2016 and 2018 were utilized to produce one step ahead API prediction. The air quality parameters consist of the NO2, SO2, CO, O3, PM2.5, PM10 concentration as well as three meteorological parameters which are wind speed, wind direction and ambient temperature. The NARX model was realized using a series-parallel feed-forward network. For the SVM regression model, different kernel functions: Linear, Quadratic, Cubic, Fine Gaussian, Medium Gaussian and Coarse Gaussian were evaluated. The performance of NARX and SVM regression was measured using the Root Mean Square Error (RMSE) and Coefficient of Determination (R2) values. Results show that the NARX model outperformed the SVM regression model in both 2016 and 2018 data respectively.
Highlights
Air pollution has been a major concern amongst the developing and developed country across the globe for decades
The results show that the Nonlinear Autoregressive Exogenous (NARX) model was superior than the Support Vector Machine (SVM) regression model for both 2016 and 2018 data
This research compared the performance of two air quality prediction models namely the NARX and SVM regression models
Summary
Air pollution has been a major concern amongst the developing and developed country across the globe for decades. The common air pollutants found in Malaysia are Nitrogen Dioxide (NO2), Sulphur Dioxide (SO2), Carbon Monoxide (CO), Ozone (O3) and Particulate Matter 10 (PM10). These air pollutants are the air quality parameters used to calculate the Air Pollutant Index (API) in the country. Starting 2017, the Particulate Matter 2.5 (PM2.5) was added in the API calculation. The API reading indicates the air pollution level and is calculated based on the concentration of the air quality parameters. The API for each pollutant is calculated individually and the highest API will be selected as the API for the particular hour. The pollutant with the highest API will be the responsible pollutant for the published API value
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have