Comparison on PPCA, KPPCA and MPPCA Based Missing Data Imputing for Traffic Flow

Yuebiao Li,Zhiheng Li,Li Li,Maojing Jin,Yi Zhang

doi:10.1061/9780784413036.155

Yuebiao Li, Zhiheng Li + Show 3 more

https://doi.org/10.1061/9780784413036.155

Copy DOI

Export

Save

Cite

Publication Date: Jun 11, 2013

Citations: 16

Affiliation: Tsinghua University

Abstract
Full-Text
Similar Papers

Abstract

Listen

In recent studies, the Probabilistic Principal Component Analysis (PPCA) for imputing missing data was shown to be a good tool for traffic flow data processing. The PPCA method has two major benefits: it results in significantly smaller reconstruction errors and much less computation time, which make it outperform the conventional historical and regression imputing methods. In this paper, the possibility of applying more complex PPCA methods, e.g. Kernel Probabilistic Principal Component Analysis (KPPCA) or Mixed Probabilistic Principal Component Analysis (MPPCA), to impute missing data is explored. Considering reconstruction errors and computation time cost, test results show that the basic PPCA method is still our first choice in missing data imputing for traffic flow for online systems.

Full Text