Abstract

In this paper, we develop a land-use regression model for sulfur dioxide air pollution concentrations. We make use of mobile monitoring data collected in Hamilton, Ontario, Canada, between 2005 and 2010 inclusive. The observed SO2 concentrations are regressed against a comprehensive set of land use and transportation variables. Land use and transportation variables are assessed as the amount of each characteristic within buffers of 50, 100, 200, 400, 800, and 1600 m around pollution observation locations. In the first instance of regression modeling, we apply ordinary least-squares regression. The OLS model R2 for training data was 0.38, and an R2 of 0.3 for a 50% held out cross-validation data set. The residuals are spatially correlated with the OLS model as determined with Moran's I. We thus applied a simultaneous autoregressive model, specifically the spatial error model. The resulting model slightly improved fit as determined by a pseudo R2 = 0.4, improved log-likelihood, and reduced MSE, RMSE, and MAE. The spatial error model residuals were not spatially auto-correlated, resulting in a valid model. SAR modeling is a natural extension to OLS regression models and solves the issue of spatial autocorrelation in model residuals with a one-stage model.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call