Abstract

This paper uses machine learning to refine a Land-use Regression (LUR) model and to estimate the spatial–temporal variation in BTEX concentrations in Kaohsiung, Taiwan. Using the Taiwanese Environmental Protection Agency (EPA) data of BTEX (benzene, toluene, ethylbenzene, and xylenes) concentrations from 2015 to 2018, which includes local emission sources as a result of Asian cultural characteristics, a new LUR model is developed. The 2019 data was then used as external data to verify the reliability of the model. We used hybrid Kriging-land-use regression (Hybrid Kriging-LUR) models, geographically weighted regression (GWR), and two machine learning algorithms—random forest (RF) and extreme gradient boosting (XGBoost)—for model development. Initially, the proposed Hybrid Kriging-LUR models explained each variation in BTEX from 37% to 52%. Using machine learning algorithms (XGBoost) increased the explanatory power of the models for each BTEX, between 61% and 79%. This study compared each combination of the Hybrid Kriging-LUR model and (i) GWR, (ii) RF, and (iii) XGBoost algorithm to estimate the spatiotemporal variation in BTEX concentration. It is shown that a combination of Hybrid Kriging-LUR and the XGBoost algorithm gives better performance than other integrated methods.

Highlights

  • Chemical and petroleum facilities are major emitters of volatile organic compounds (VOCs) into the environment [1,2]

  • The study districts contributed respective ~50% of sulfur oxides (SOx), ~60% of nitrogen oxides (NOx), and ~60% of VOCs to ambient pollutants in Kaohsiung in 2018

  • To address the problem of overfitting, 80% of data was used to train the XGBoost-Hybrid Land-use Regression (LUR) model and 20% to test it

Read more

Summary

Introduction

Chemical and petroleum facilities are major emitters of volatile organic compounds (VOCs) into the environment [1,2]. These industrial emissions include benzene, toluene, ethylbenzene, and xylenes, which are known as BTEX [3,4]. Ambient BTEX might result from various substances and sources, including traffic, gas stations, combustion processes, and households [5,6,7]. BTEX emissions have a significant effect on human health. The International Agency for Research on Cancer classifies benzene as carcinogenic for humans [8].

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call