Abstract Drought prediction is crucial for mitigating risks and designing measures to alleviate its impact. Machine learning models have been widely applied in the field of drought prediction in recent years. This study concentrated on predicting meteorological droughts in southwest China, a region prone to frequent and severe droughts, particularly in areas with sparse meteorological station coverage. The long short-term memory (LSTM) predictive model, which is a deep learning model, was constructed by calculating standardized precipitation evapotranspiration index (SPEI) values based on 144 weather station observations from 1980 to 2020. The 5-fold cross-validation method was used for the hyperparameter optimization of the model. The LSTM model underwent comprehensive assessment and validation through multiple methods. This included the use of several accuracy assessment indicators and a comparison of results. The comparison covered different drought characteristics among the LSTM predictive model, the benchmark random forest (RF) predictive model, the historical drought situations, and the calculated SPEI values based on observations from 144 weather stations. The results showed that the training results of the LSTM predictive model basically agreed with the SPEI values calculated from weather station observations. The model-predicted variation trend of SPEI values for 2020 was similar to the variation in SPEI values calculated based on weather station observations. On the test set, the coefficient of determination (R 2), the root mean square error, the explained variance score, the Nash–Sutcliffe efficiency, and the Kling–Gupta efficiency were 0.757, 0.210, 0.802, 0.761, and 0.212, respectively. The total consistency rate of the drought grade was 59.26%. The spatial correlation distribution of SPEI values between LSTM model prediction and calculation from meteorological stations in 2020 was more than 0.5 for most regions. The correlation coefficients exceeded 0.6 in western Tibet and Chengdu Plains. Compared to the RF model, the LSTM model excelled in all five performance evaluation metrics and demonstrated a higher overall consistency rate for drought categories. The Kruskal–Wallis test for both the LSTM and RF models all indicated no significant difference in the distributions between the predicted and observed data. Scatter plots revealed that the prediction accuracy for both models in 2020 was suboptimal, with the SPEI showing a comparatively narrow range of values. Nonetheless, the LSTM model significantly outperformed the RF model in terms of prediction accuracy. In summary, the LSTM model demonstrated good overall performance, accuracy, and applicability. It has the potential to enhance dynamic drought prediction in regions with complex terrain, diverse climatic factors, and sparse weather station networks.
Read full abstract