Abstract. The time series prediction problem has a very wide range of applications in many fields. Most scholars use the LSTM class of algorithms for prediction. However, this method consumes a significant amount of computational power and presents several challenges. To address this issue, this paper proposes a time series forecasting method based on feature fusion and XGBoost. Specifically, we first utilize holiday information and the K-Means algorithm for feature extraction to expand the feature dimensions of the dataset, and then employ XGBoost as a model for training and prediction. Experiments demonstrate that the method proposed in this paper significantly reduces error compared to other traditional machine learning and deep learning methods, while the training time is much shorter than these methods. For example, compared with LSTM, the MSLE of this model decreases by 1.42%, while the training time is only 0.15% of that of LSTM. This greatly saves on training costs and computational power consumption. This confirms the effectiveness of using machine learning and clustering algorithms in time series prediction and provides new methods and practical application directions for future time series prediction models.
Read full abstract