In densely populated urban areas, PM2.5 has a direct impact on the health and quality of residents' life. Thus, understanding the disparities of PM2.5 is crucial for ensuring urban sustainability and public health. Traditional prediction models often overlook the spillover effects within urban areas and the complexity of the data, leading to inaccurate spatial predictions of PM2.5. We propose Deep Support Vector Regression (DSVR) that models the urban areas as a graph, with grid center points as the nodes and the connections between grids as the edges. Nature and human activity features of each grid are initialized as the representation of each node. Based on the graph, DSVR uses random diffusion-based deep learning to quantify the spillover effects of PM2.5. It leverages random walk to uncover more extensive spillover relationships between nodes, thereby capturing both the local and nonlocal spillover effects of PM2.5. And then it engages in predictive learning using the feature vectors that encapsulate spillover effects, enhancing the understanding of PM2.5 disparities and connections across different regions. By applying our proposed model in the northern region of New York for predictive performance analysis, we found that DSVR consistently outperforms other models. During periods of PM2.5 surges, the R-square of DSVR reaches as high as 0.729, outperforming non-spillover models by 2.5 to 5.7 times and traditional spatial metric models by 2.2 to 4.6 times. Therefore, our proposed model holds significant importance for understanding disparities of PM2.5 air pollution in urban areas, taking the first steps toward a new method that considers both the spillover effects and nonlinear feature of data for prediction.
Read full abstract