Traffic Flow Prediction With Missing Data Imputed by Tensor Completion Methods

Qin Li,Linhui Ye,Huachun Tan,Yuankai Wu,Fan Ding

doi:10.1109/access.2020.2984588

Qin Li, Linhui Ye + Show 3 more

Open Access

PDF Available

https://doi.org/10.1109/access.2020.2984588

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Missing data is inevitable and ubiquitous in intelligent transportation systems (ITSs). A handful of completion methods have been proposed, among which the tensor-based models have been shown to be the most advantageous for missing traffic data imputation. Despite their superior imputation accuracies, the adoption of these imputed data is not uniform in modern ITSs applications. The primary goal of this paper is to explore how to use tensor completion methods to support ITSs. In particular, we study how to improve traffic flow prediction accuracy under different missing scenarios. Specifically, three common missing scenarios including element-wise random missing, time-structured missing, and space-structured missing are considered. Four classical tensor completion models including Smooth PARAFAC Decomposition based Completion (SPC), CP Decomposition-based (CP-WOPT) Completion, Tucker Decomposition-based Completion (TDI), and High-accuracy Low-rank Tensor Completion (HaLRTC) are used to impute the missing data. Four well-known prediction methods including Support Vector Regression (SVR), K-nearest Neighbor (KNN), Gradient Boost Regression Tree (GBRT), and Long Short-term Memory (LSTM) are tested. The simple mean value interpolation completed traffic data is regarded as the baseline data. The extensive experiments show that improvements of traffic flow prediction can be achieved by increasing missing traffic data imputation accuracy at most cases. Interestingly we find that prediction accuracy cannot be improved by an imputation model when the sparsely observed training datasets already provide sufficient training samples.

Highlights

The Smooth PARAFAC decomposition based tensor completion (SPC), which is combined with the total variation (TV norm) or quadratic variation (QV norm) proposed by Yokota et al [28] has been proved to perform the best especially when the missing ratio is over 95%
Support Vector Regression (SVR) is more appropriate to small training datasets [51], [52], it provides the smallest mean absolute error (MAE) and root mean squared error (RMSE) using the data completed by the simple mean value interpolation
WORK To the best of our knowledge, this is the first paper to analyze the detailed effects of missing data and its completion to traffic flow prediction which is a basic technology of the intelligent transportation systems (ITSs)

Summary

INTRODUCTION

Due to the fixation on the number of parameters, parametric methods [38], such as Autoregressive Integrated Moving Average (ARIMA) [39], failed to fit complex functions, are unable to remain robust prediction for heterogeneous traffic data To avoid this problem, various nonparametric methods have been proposed, including Artificial Neural Networks (ANNS) [40], K-nearest Neighbor (KNN) [41], Support Vector Regression (SVR) [42], ensemble methods like the Gradient Boosting Regression Tree (GBRT) [43] and so on. They demonstrated completing missing data can improve the accuracy of traffic flow prediction Their analysis is not comprehensive, which only considered two kinds of mixed random missing scenarios, one kind of matrix completion based imputation method (PPCA), and just under low missing rates (up to 50% missing). All the slices of ω along the location mode, i.e. ω(:, :, i3) for all i3 ∈ {1, 2, · · · I3}, are set to be a randomly missing matrix M2 RI1×I2 with random entries 0 and 1

TENSOR COMPLETION BASED MISSING DATA IMPUTATION

PREDICTION METHODS

EVALUATION CRITERIA FOR IMPUTATION AND PREDICTION PERFORMANCES

DISCUSSIONS

Findings

CONCLUSION AND FUTURE WORK

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 24	License type: CC BY 4.0

R Discovery Prime

Traffic Flow Prediction With Missing Data Imputed by Tensor Completion Methods

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Tensor Completion based Prediction in Wireless Edge Caching
Navneet Garg ... Tharmalingam Ratnarajah
-
Navneet Garg, et. al.Navneet Garg ... Tharmalingam Ratnarajah
01 Nov 2020
01 Nov 2020

Urban traffic flow prediction: a dynamic temporal graph network considering missing values
Peixiao Wang ... Tong Zhang
International Journal of Geographical Information Science | VOL. 37
Peixiao Wang, et. al.Peixiao Wang ... Tong Zhang
16 Nov 2022
International Journal of Geographical Information Science | VOL. 37

LSTM-based traffic flow prediction with missing data
Yan Tian ... Bailin Yang
Neurocomputing | VOL. 318
Yan Tian, et. al.Yan Tian ... Bailin Yang
31 Aug 2018
Neurocomputing | VOL. 318

Research and application of urban real-time traffic flow prediction based on STARIMA
Wenchang Duan ... Zhiqiang Gong
-
Wenchang Duan, et. al.Wenchang Duan ... Zhiqiang Gong
20 Aug 2022
20 Aug 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Traffic Flow Prediction With Missing Data Imputed by Tensor Completion Methods

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE Access