Abstract

Modeling spatiotemporal data can be a challenge due to the plethora of processes, both independent and interacting, which may or may not contribute to the measurements. Characterization can be considered a complement to modeling by helping guide assumptions about generative processes and their representation in the data. For high-D signals, Dimensionality Reduction (DR) is a frequently implemented type of characterization designed to mitigate the effects of the so-called “curse of dimensionality”. For decades, Principal Component (PC) and Empirical Orthogonal Function (EOF) analysis has been used as a linear, invertible approach to dimensionality reduction and spatiotemporal analysis. Recent years have seen the additional development of a suite of nonlinear DR algorithms, frequently categorized as “manifold learning”. Here, we explore the idea of joint characterization of spatiotemporal data manifolds using the PC/EOF approach alongside two nonlinear DR approaches: Laplacian Eigenmaps (LE) and t-distributed Stochastic Neighbor Embedding (t-SNE). Starting with a synthetic example and progressing to global, regional, and field scale spatiotemporal datasets spanning roughly 5 orders of spatial magnitude and 2 orders of temporal magnitude, we show these three DR approaches can yield complementary information about the topology of spatiotemporal data manifolds. Compared to the PC/EOF projections, the nonlinear DR approaches yield more compact manifolds with decreased ambiguity in temporal endmembers (LE) and/or in spatiotemporal clustering (t-SNE), compared to the relatively diffuse temporal feature space produced by the PC/EOF approach. However, these properties are compensated by the greater interpretability of PCs and EOFs than of the LE or t-SNE dimensions, as well as significantly lower computational demand and diminished sensitivity to spatial aliasing for PCs/EOFs than LE or t-SNE. Taken together, we find the joint characterization using the three complementary DR approaches capable of providing substantially greater insight about the generative processes represented in spatiotemporal datasets than is possible using any single approach alone. This parsimonious, complementary characterization of both local manifold structure and global variance can advance remote sensing time series analysis by providing important context to constrain and guide design of effective spatiotemporal models.

Highlights

  • From agriculture to coastal erosion, and from vehicle traffic to disease transmission, many phenomena on Earth’s surface are inherently spatiotemporal: that is, variable across both space and time

  • We extend the concept of the temporal feature space beyond the linear domain, illustrating the potential for meaningful signal to exist across multiple variance scales within the same spatiotemporal dataset, and for the ability of complementary Dimensionality Reduction (DR) approaches to jointly capture that signal

  • Through the lens of Principal Component (PC)/Empirical Orthogonal Function (EOF) analysis, the data are decomposed onto a set of mutually orthogonal eigenvectors which are sorted on the basis of variance

Read more

Summary

Introduction

From agriculture to coastal erosion, and from vehicle traffic to disease transmission, many phenomena on Earth’s surface are inherently spatiotemporal: that is, variable across both space and time. The data analysis landscape has fundamentally shifted: today, spatiotemporal observations abound, and scientists are in need of effective and efficient tools to analyze patterns, inform modeling, and discriminate between signal and noise. This asymmetry between the volume of observations and capacity of inference tools is imperfectly represented by a comparison of Google Ngram word usage for” big data” versus “spatiotemporal” (Figure 1). While a wide range of analytic approaches can be used to characterize the pairwise statistical similarity structure of a dataset, here we focus on two nonlinear manifold learning algorithms: t-SNE and LE These two algorithms leverage neighborhood connectivity information in fundamentally different ways

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.