Abstract

Bus operation scheduling is closely related to passenger flow. Accurate bus passenger flow prediction can help improve urban bus planning and service quality and reduce the cost of bus operation. Using machine learning algorithms to find the rules of urban bus passenger flow has become one of the research hotspots in the field of public transportation, especially with the rise of big data technology. Bus IC card data are an important data resource and are more valuable to passenger flow prediction in comparison with manual survey data. Aiming at the balance between efficiency and accuracy of passenger flow prediction for multiple lines, we propose a novel passenger flow prediction model based on the point-of-interest (POI) data and extreme gradient boosting (XGBoost), called PFP-XPOI. Firstly, we collected POI data around bus stops based on the Amap Web service application interface. Secondly, three dimensions were considered for building the model. Finally, the XGBoost algorithm was chosen to train the model for each bus line. Results show that the model has higher prediction accuracy through comparison with other models, and thus this method can be used for short-term passenger flow forecasting using bus IC cards. It plays a very important role in providing decision basis for more refined bus operation management.

Highlights

  • Bus transport is a critical component of the transportation system

  • To balance the efficiency and accuracy of prediction, we propose a novel passenger flow prediction model based on extreme gradient boosting (XGBoost) and the point-ofinterest (POI) data, referred to as PFP-XPOI

  • The results demonstrate that the PFP-XPOI model performs better in prediction and improves the prediction accuracy due to the addition of new features

Read more

Summary

Introduction

Bus transport is a critical component of the transportation system. With the significant progress of urbanization, buses are becoming the leading force in public transportation.For example, Beijing has one of the most crowded bus networks at present. According to the statistics of Beijing Public Transport Corporation, in 2020, there were 1207 bus lines (including suburban lines) with a total length of 28,400 km. Availability of smart card data has offered more opportunities for the prediction work [5]. The prediction results can help the bus operators optimize resource scheduling and save operating costs as well as assist passengers in making better decisions by adjusting their travel paths and departure time. This approach is useful for the government to assess risk and guarantee public safety

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call