Abstract

Big data from toll stations provides reliable and accurate origin-destination (OD) pair information of expressway networks. However, although the short-term traffic prediction model based on big data is being constantly improved, the volatility and nonlinearity of peak traffic flow restricts the accuracy of the prediction results. Therefore, this research attempts to solve this problem through three contributions, firstly, proposing the use the Pauta criterion from statistics as the standard for defining the anomaly criteria of expressway traffic flows. Through comparison with the common local outlier factor (LOF) method, the rationality and advantages of the Pauta criterion were expounded. Secondly, adding week attributes to data, and splitting the data based on the similarity characteristics of traffic flow time series in order to improve the accuracy and efficiency of data input. Thirdly, by introducing empirical mode decomposition (EMD) to decompose the signal before autoregressive integrated moving average (ARIMA) model training is carried out. The first two contributions are for efficiency, the third is to deal with the volatility and nonlinearity of the abnormal peak training data. Finally, the model is analyzed, based on the expressway toll data of the Jiangsu Province. The results show that the EMD-ARIMA model has more advantages than the ARIMA model when dealing with fluctuating data.

Highlights

  • Considering the advantages of empirical mode decomposition (EMD)-autoregressive integrated moving average (ARIMA) in abnormal peak prediction, this paper explores the potential of this hybrid model in traffic volume prediction

  • In comparison with the local outlier factor (LOF) algorithm which is based on density, the Pauta criterion is proved to be better in the 10,000–20,000 pcu/D, and

  • This paper demonstrates the rationality of the application of the Pauta criterion in the field of expressway traffic flow

Read more

Summary

Motivation

The Ministry of Transport of the People0 s Republic of China issued an early warning in 2019: the short-term surge in highway traffic is the main factor leading to a substantial increase in traffic congestion and crash risks, especially causing frequent fatal crashes in the afternoon and night. Recent studies have attempted to capture the spatial correlation between traffic variables on road networks by extending the time-series model to a multivariate form [10,11,12], it is proved that the obvious changes in the volatility of traffic data are predictable and may be caused by specific types of non-linear functions. Leveraging these unique attributes in different models may lead to more efficient and reliable predictions. EMD preprocessing was introduced into the hybrid model to deal with the volatility and nonlinearity of the abnormal peak traffic flow

Approaches to Traffic Predictions and Related Work
Autoregressive Integrated Moving Average Model
Local Outlier Factor
Data Set
Definition of Anomaly Criteria
Visual
Comparison of Traffic Flow Prediction Methods
Identification of Anomalous Peak Value of Traffic Volume
Findings
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call