Urban rail transit features much strength, such as large capacity, safety and environment-friendliness, and it becomes a preferred choice for most passengers. It also plays a prominent part in solving urban traffic problems. In order to improve the operation efficiency of the urban rail transit system and achieve the goal of smart operations, this paper applies machine learning algorithms and completes the feature engineering of urban rail transit passenger flow data in terms of time, space and external factors. Based on passenger flow characteristics as collected, we build the short-term passenger flow forecast models, which include light gradient boosting machine (LightGBM) model, long short-term memory (LSTM) model, and LightGBM-LSTM fusion model. Besides, we construct the autoregressive integrated moving average (ARIMA) model and extreme gradient boosting (XGBoost) model for experiment comparison. Finally, we conduct the passenger flow forecasting experiments on the Hangzhou subway dataset based on the five prediction-models mentioned above. Then, five types of subway stations are selected (residential type, occupation type, residential-occupation type, business type, and transportation hubs type) and three accuracy evaluation indicators are chosen (mean absolute error, root mean square error and mean absolute percentage error) to evaluate the prediction accuracy of the five prediction models. The experimental results show that, the multi-feature machine learning model can effectively forecast urban rail transit short-term passenger flow which is difficult for the traditional time series model. However, the single model has poor adaptability to different types of subway stations. Compared with a single model, LightGBM-LSTM model equipped with merits of multiple models, fulfills a better function in forecasting and a better adaptability to different types of urban rail transit stations.
Read full abstract