A Two-Step Method for Missing Spatio-Temporal Data Reconstruction

Shifen Cheng,Feng Lu

doi:10.3390/ijgi6070187

Abstract

Missing data reconstruction is a critical step in the analysis and mining of spatio-temporal data; however, few studies comprehensively consider missing data patterns, sample selection and spatio-temporal relationships. As a result, traditional methods often fail to obtain satisfactory accuracy or address high levels of complexity. To combat these problems, this study developed an effective two-step method for spatio-temporal missing data reconstruction (ST-2SMR). This approach includes a coarse-grained interpolation method for considering missing patterns, which can successfully eliminate the influence of continuous missing data on the overall results. Based on the results of coarse-grained interpolation, a dynamic sliding window selection algorithm was implemented to determine the most relevant sample data for fine-grained interpolation, considering both spatial and temporal heterogeneity. Finally, spatio-temporal interpolation results were integrated by using a neural network model. We validated our approach using Beijing air quality data and found that the proposed method outperforms existing solutions in term of estimation accuracy and reconstruction rate.

Highlights

Following both the rapid development and popularization of geographic information and the enhancement of data collection, data with temporal and spatial attributes are quickly accumulated and form large numbers of spatio-temporal datasets [1]; missing data are extremely common; for example, missing data on air quality monitoring sensor readings, missing data on floating car track points or the absence of mobile phone signaling records
A large number of interpolation methods has been proposed to solve the problem of spatio-temporal missing data [4,5,6,7,8,9,10]. These methods can be roughly divided into three categories: spatial interpolation, temporal interpolation and spatio-temporal interpolation
Traditional methods (e.g., inverse distance weighting (IDW)) assume that the data distribution obeys the first law of geography, namely the closer data are in spatial distribution, the greater the contribution they make to missing data interpolation

Summary

Introduction

Following both the rapid development and popularization of geographic information and the enhancement of data collection, data with temporal and spatial attributes are quickly accumulated and form large numbers of spatio-temporal datasets [1]; missing data are extremely common; for example, missing data on air quality monitoring sensor readings, missing data on floating car track points or the absence of mobile phone signaling records. Due to the existence of spatial and temporal heterogeneity, the data distribution can show uneven characteristics and relationships according to different regions [15]; the accuracy of interpolation results obtained by existing methods remains unsatisfactory if data are not homogeneously distributed To solve this problem [16] considered spatial autocorrelation and heterogeneity in a study area and proposed a point estimation model of biased hospital-based area disease estimation (P-BSHADE). Using the correlation coefficient to determine the spatial and temporal weights, estimated values in spatial and temporal dimensions are integrated to obtain overall estimated values of missing data [2] This method requires the whole dataset to participate in computation, which leads to high computational complexity and a large volume of redundant data.

Method Framework

Coarse-Grained Interpolation

Sliding Window

Fine-Grained Spatial Dimension Interpolation

Fine-Grained Temporal Dimension Interpolation

Spatio-Temporal Integration

Datasets

Evaluation Criteria

Experimental Results

Overall Results

Effect of Coarse-Grained Interpolation

Effect of the Coarse-Grained Missing Data Rate

EEffect of Sliding Window

Performance Comparison for Different Datasets

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: ISPRS International Journal of Geo-Information	Publication Date: Jun 23, 2017
Citations: 35	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Two-Step Method for Missing Spatio-Temporal Data Reconstruction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ISPRS International Journal of Geo-Information

Lead the way for us

Similar Papers

Bootstrap joint prediction regions for sequences of missing values in spatio-temporal datasets
Maria Lucia Parrella ... Cira Perna
Computational Statistics | VOL. 36
Maria Lucia Parrella, et. al.Maria Lucia Parrella ... Cira Perna
05 Apr 2021
Computational Statistics | VOL. 36

Lasagna Plots
Bruce J Swihart ... Naresh M Punjabi
Epidemiology | VOL. 21
Bruce J Swihart, et. al.Bruce J Swihart ... Naresh M Punjabi
01 Sep 2010
Epidemiology | VOL. 21

Examining solutions to missing data in longitudinal nursing research
Mary B Roberts ... Suzy B Winchester
Journal for Specialists in Pediatric Nursing | VOL. 22
Mary B Roberts, et. al.Mary B Roberts ... Suzy B Winchester
01 Apr 2017
Journal for Specialists in Pediatric Nursing | VOL. 22

Multiple imputation to deal with missing EQ-5D-3L data: Should we impute individual domains or the actual index?
Claire L Simons ... Judit Simon
Quality of Life Research | VOL. 24
Claire L Simons, et. al.Claire L Simons ... Judit Simon
04 Dec 2014
Quality of Life Research | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Two-Step Method for Missing Spatio-Temporal Data Reconstruction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ISPRS International Journal of Geo-Information