Given the significant health and environmental risks posed by atmospheric PM2.5 pollution, accurately predicting its concentration changes is especially important. Current models fall short in researching time-series feature extraction from pollutants and spatial correlations among monitoring stations. In this study, a spatiotemporal prediction model is introduced to address these issues. The model combines spatial weighting, empirical mode decomposition (EMD), and a long short-term memory (LSTM) network. First, weights are allocated to sites using Pearson correlation analysis and distance weighting methods. Next, the pollutant time series is decomposed using the EMD method. The highly correlated intrinsic mode function (IMF) component is selected for signal reconstruction, enhancing denoising. Finally, the model uses an LSTM network to capture nonlinear and dynamic time series traits, which significantly improves the PM2.5 prediction accuracy. The model utilizes data collected from 10 monitoring stations across Hefei city during 2018-2019, employing the previous 24 h of observations to forecast PM2.5 concentrations for the subsequent hour. By comparing with RNN, HPO-RNN, GRU, LSTM, and CBAM-CNN-Bi LSTM, the results show that our model surpasses five benchmark models in terms of prediction accuracy. Relative to the best-performing CBAM-CNN-Bi LSTM model, our model reduces RMSE and MAE by 73.91% and 72.99%, respectively, and improves R2 by 8.15%. In summary, the proposed spatial weighting EMD-LSTM model offers an efficient new approach for predicting atmospheric PM2.5 pollution. It integrates spatial and time series analysis, significantly enhancing the prediction accuracy.
Read full abstract