Abstract

Short-term demand predictions, typically defined as less than an hour into the future, are essential for implementing dynamic control strategies and providing useful customer information in transit applications. Knowing the expected demand enables transit operators to deploy real-time control strategies in advance of the demand surge, and minimize the impact of abnormalities on the service quality and passenger experience. One of the most useful applications of demand prediction models in transit is in predicting the congestion on station platforms and crowding on vehicles. These require information about the origin-destination (OD) demand, providing a detailed profile of how and when passengers enter and exit the service. However, existing work in the literature is limited and overwhelmingly focuses on forecasting passenger <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">arrivals</i> at stations. This information, while useful, is incomplete for many practical applications. We address this gap by developing a scalable methodology for real-time, short-term OD demand prediction in transit systems. Our proposed model consists of three modules: multi-resolution spatial feature extraction module for capturing the local spatial dependencies with a channel-wise attention block, auxiliary information encoding module (AIE) for encoding the exogenous information, and a module for capturing the temporal evolution of demand. The OD demand at time <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$t$ </tex-math></inline-formula> , represented as a <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$N \times N$ </tex-math></inline-formula> matrix, is processed in two separate branches. In one branch we use the discrete wavelet transform (DWT) to decompose the demand into its different time and frequency variations, detecting patterns that are not visible in the raw data. In the other, three convolutional neural network (CNN) layers are utilized to learn the spatial dependencies from the OD demand directly. Instead of treating each channel of the resultant transformation equally, we use a <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">squeeze-and-excitation</i> layer to weight feature maps based on their contribution to the final prediction. A Convolutional Long Short-term Memory network (ConvLSTM) is then used to capture the temporal evolution of demand. The approach is demonstrated through a case study using 2 months of Automated Fare Collection (AFC) data from the Hong Kong Mass Transit Railway (MTR) system. The extensive evaluation of the model shows the superiority of our proposed model compared to the other compared methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.