High-Dimensional Bayesian Geostatistics.

Sudipto Banerjee

doi:10.1214/17-ba1056r

Abstract

With the growing capabilities of Geographic Information Systems (GIS) and user-friendly software, statisticians today routinely encounter geographically referenced data containing observations from a large number of spatial locations and time points. Over the last decade, hierarchical spatiotemporal process models have become widely deployed statistical tools for researchers to better understand the complex nature of spatial and temporal variability. However, fitting hierarchical spatiotemporal models often involves expensive matrix computations with complexity increasing in cubic order for the number of spatial locations and temporal points. This renders such models unfeasible for large data sets. This article offers a focused review of two methods for constructing well-defined highly scalable spatiotemporal stochastic processes. Both these processes can be used as "priors" for spatiotemporal random fields. The first approach constructs a low-rank process operating on a lower-dimensional subspace. The second approach constructs a Nearest-Neighbor Gaussian Process (NNGP) that ensures sparse precision matrices for its finite realizations. Both processes can be exploited as a scalable prior embedded within a rich hierarchical modeling framework to deliver full Bayesian inference. These approaches can be described as model-based solutions for big spatiotemporal datasets. The models ensure that the algorithmic complexity has ~ n floating point operations (flops), where n the number of spatial locations (per iteration). We compare these methods and provide some insight into their methodological underpinnings.

Highlights

The increased availability of inexpensive, high speed computing has enabled the collection of massive amounts of spatial and spatiotemporal datasets across many fields
This has resulted in widespread deployment of sophisticated Geographic Information Systems (GIS) and related software, and the ability to investigate challenging inferential questions related to geographically-referenced data
Using the fact that PB = PB1 + P[(I−PB1 )B2], which is a standard result in linear model theory, we find the excess residual variability in the low-rank likelihood is summarized by y P[(I−PB1 )B2]y which can be substantial when r is much smaller than n

Summary

Introduction

The increased availability of inexpensive, high speed computing has enabled the collection of massive amounts of spatial and spatiotemporal datasets across many fields. Model-based approaches for large spatial datasets proceeds from either exploiting “low-rank models or exploiting “sparsity” The former attempts to construct Gaussian processes on a lower-dimensional subspace (see, e.g., Wikle and Cressie, 1999; Higdon, 2002a; Kammann and Wand, 2003; Quinonero and Rasmussen, 2005; Stein, 2007; Gramacy and Lee, 2008; Stein, 2008; Cressie and Johannesson, 2008; Banerjee et al, 2008; Crainiceanu et al, 2008; Sanso et al, 2008; Finley et al, 2009a; Lemos and Sanso, 2009; Cressie et al, 2010) in spatial, spatiotemporal and more general Gaussian process regression settings.

Hierarchical low-rank models

Biases in low-rank models

Predictive process models and variants

Bayesian implementation for low-rank models

Sparsity-inducing nearest-neighbor Gaussian processes

Sparse Gaussian graphical models

From distributions to processes

Hierarchical NNGP models

Discussion and future directions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Bayesian Analysis	Publication Date: May 16, 2017
Citations: 62	License type: cc-by

R Discovery Prime

R Discovery Prime

High-Dimensional Bayesian Geostatistics.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Bayesian Analysis

Lead the way for us

Similar Papers

Creating and Displaying Virtual Silicate Structures Using Geographic Information Systems
Patrick J Kennelly
Journal of Geoscience Education | VOL. 55
Patrick J KennellyPatrick J Kennelly
01 May 2007
Journal of Geoscience Education | VOL. 55

Geographic Information System (GIS) capabilities in traffic accident information management: a qualitative approach.
Maryam Ahmadi ... Morteza Hemmat
Electronic Physician | VOL. 9
Maryam Ahmadi, et. al.Maryam Ahmadi ... Morteza Hemmat
25 Jun 2017
Electronic Physician | VOL. 9

Hydrologic and hydrodynamic modelling of extreme flood events to assess the impact of climate change in a large basin with limited data
M.S.I.I Amir ... M.M.K Khan
Journal of Flood Risk Management | VOL. 11
M.S.I.I Amir, et. al.M.S.I.I Amir ... M.M.K Khan
21 Aug 2015
Journal of Flood Risk Management | VOL. 11

Geographic web services and tools as instruments for territorial management
Sandra Yanet Velzco Florez ... Luis Joyanes Aguilar
TECCIENCIA | VOL. 7
Sandra Yanet Velzco Florez, et. al.Sandra Yanet Velzco Florez ... Luis Joyanes Aguilar
01 Feb 2013
TECCIENCIA | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

High-Dimensional Bayesian Geostatistics.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Bayesian Analysis