Abstract

Traffic flow forecasting is a critical task for urban traffic control and dispatch in the field of transportation, which is characterized by the high nonlinearity and complexity. In this paper, we propose an end-to-end deep learning based dual path framework, i.e., Spatial-Temporal Graph Attention Network (STGAT), for traffic flow forecasting. Specifically, different from previous structure-based approaches, STGAT can be directly generalized to the graph with arbitrary structure. Furthermore, STGAT is capable of handling long temporal sequence by stacking gated temporal convolution layer. The dual path architectures is proposed for taking both potential and existing spatial dependencies into account. By capturing potential spatial dependencies will naturally catch more useful information for forecasting. We design a gated fusion mechanism to combine the outputs from each path. The proposed model can be directly applicable to inductive learning tasks by introducing a graph attention mechanism into spatial-temporal framework, which means our model can be generalized to completely unseen graphs. Moreover, experimental results on two public real-world traffic network datasets, METR-LA and PEMS-BAY, show that our STGAT outperforms the state-of-the-art baselines. Additionally, we demonstrate the proposed model is competent for efficient migration between graphs with different structures.

Highlights

  • With the rapid development of machine learning and the emergence of new data resources, examining and predicting traffic conditions in smart cities is becoming more and more accurate [1]

  • METHODOLOGY we proposed the framework of SpatialTemporal Graph Attention Networks

  • We design the same procedures as DCRNN [6], verifying Spatial-Temporal Graph Attention Network (STGAT) on two public real-world large-scale traffic network datasets: (1) METR-LA including traffic speed data with 207 sensors collected from Mar 1st, 2012 to Jun 30th, 2012 for 4 months on the highways of Los Angeles County [33]

Read more

Summary

INTRODUCTION

With the rapid development of machine learning and the emergence of new data resources, examining and predicting traffic conditions in smart cities is becoming more and more accurate [1]. To reach more effective forecasting for traffic flow in long-term and complex spatial conditions, spatial-temporal graph modeling has received increasing attention with the advance of graph neural networks [4]. Despite showing the effectiveness by introducing the graph structure of data into a model, these approaches still face some shortcomings Most of these studies are structure-based [3], [4], [7]–[9], which have good performance on transductive learning tasks. Current studies for spatial-temporal graph modeling are effective to learn short-term or mid-term temporal dependencies, long-term temporal forecasting is still a problem. CNN-based approaches take the advantages of parallel computing, stable gradients, and low memory requirement These methods need to use many layers to capture long-term temporal dependency, which will inevitably loss of some critical information. The source codes of STGAT are publicly available from github

RELATED WORK
METHODOLOGY
FUSION
DATASETS
BASELINES
Findings
DISCUSSION AND CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call