Accurate prediction of natural gas purchase volumes is crucial for both the economy and the environment. It not only facilitates the rational allocation of resources for companies but also helps to reduce operational costs. Although existing prediction methods have achieved some success in addressing the nonlinear relationships in natural gas purchases, there remains potential for further improvement. To address this issue, a stacking ensemble learning model was developed to enhance the ability to handle complex nonlinear problems. This model integrates diverse algorithms and incorporates weather factors, while regionalizing characteristics of natural gas usage, thereby achieving accurate forecasts of natural gas purchase volumes. We selected three distinctly different base models—Informer, multiple linear regression (MLR), and support vector regression (SVR)—for our research. By conducting four different feature combination experiments for each base model, including weather, time, regional, and usage features, we constructed 12 foundational models. Subsequently, we integrated these base models using a meta-learner to form the final stacking ensemble model. The experimental results indicate that the stacking ensemble model outperforms individual models across key metrics, including R2, MRE, and RMSE. Notably, the R2 values improved by 4–15% compared to the 12 base models. The model was subsequently applied to predict natural gas purchase volumes in Pi County, Chengdu, China. In November 2024, a side-by-side comparison of the predicted and actual data revealed a maximum error of just 5.39%. This exceptional accuracy effectively meets forecasting requirements, underscoring the model’s predictive strength in the energy sector.
Read full abstract