Forecasting potential water pollution areas (PWPA) is essential for effective watershed management. However, there remains a limited understanding of the spatial-temporal features that influence water quality (WQ), and advanced technical methods for WQ forecasting. This study developed an integrated framework utilizing spatial-temporal graph convolution networks (STGCN) to enhance comprehension of the spatial-temporal features influencing WQ and to develop a practical module for integrating features into the WQ prediction. The Pearson correlation method and seasonal decomposition analysis described the WQ features. Subsequently, the spatial-temporal distribution of PWPA was assessed using both the comprehensive pollution index method and Cressman space interpolation technique. Data from 403 monitoring stations was collected from the Yangtze River basin (YZR), encompassing pollutants such as COD, TP and NH₄⁺-N. The finding revealed that maximum concentrations of COD (36.4 mg/L), TP (4.078 mg/L) and NH₄⁺-N (13.58 mg/L) exceeded the standard thresholds necessitating early warnings. A significant correlation among pollutants was observed with coefficients ranging from 0.356 to 0.475 (P < 0.001), indicating their potential utility in predicting PWPA. Seasonality components exhibited strong correlations with the original WQ (correlation coefficients ranging from 0.62 to 0.89), followed by residuals (from 0.37 to 0.61) and trend components (from 0.25 to 0.53). The geographic layout of WQ monitoring stations along river lines resembled a graph network structure, suggesting that watershed WQ prediction can be classified as a spatial-temporal prediction task. The STGCN model achieved R2 values ranging from 0.607 to 0.844 for each pollutant on the test datasets, surpassing traditional models such as RNN, LSTM, and GRU in predictive accuracy. PWPA occurrences were predominantly identified in the southwestern regions as well as within the middle and lower reaches of the YZR. These results validated that the developed framework is capable of forecasting PWPA in large-scale watersheds while supporting effective watershed management.
Read full abstract