Covariance Tapering for Likelihood-Based Estimation in Large Spatial Data Sets

Cari G Kaufman,Mark J Schervish,Douglas W Nychka

doi:10.1198/016214508000000959

Abstract

Maximum likelihood is an attractive method of estimating covariance parameters in spatial models based on Gaussian processes. But calculating the likelihood can be computationally infeasible for large data sets, requiring O(n3) calculations for a data set with n observations. This article proposes the method of covariance tapering to approximate the likelihood in this setting. In this approach, covariance matrixes are “tapered,” or multiplied element wise by a sparse correlation matrix. The resulting matrixes can then be manipulated using efficient sparse matrix algorithms. We propose two approximations to the Gaussian likelihood using tapering. One of these approximations simply replaces the model covariance with a tapered version, whereas the other is motivated by the theory of unbiased estimating equations. Focusing on the particular case of the Matérn class of covariance functions, we give conditions under which estimators maximizing the tapering approximations are, like the maximum likelihood estimator, strongly consistent. Moreover, we show in a simulation study that the tapering estimators can have sampling densities quite similar to that of the maximum likelihood estimator, even when the degree of tapering is severe. We illustrate the accuracy and computational gains of the tapering methods in an analysis of yearly total precipitation anomalies at weather stations in the United States.

Highlights

Much recent work has focused on the problem of estimating the autocovariance functions of spatially correlated stochastic processes
Consider taking the direct product of the true covariance function K0(x; θ) and a “tapering function” Ktaper(x; γ), an isotropic correlation function which is identically zero outside a range described by γ
When taking an increasing number of observations from a spatial process, we either can assume that the domain of the sampling region S increases to infinity, called “increasing domain” asymptotics, or we can assume that S is bounded and that observations become increasingly dense within S, called “bounded domain” or “infill” asymptotics

Summary

Introduction

Much recent work has focused on the problem of estimating the autocovariance functions of spatially correlated stochastic processes. If numerical methods such as numerical maximization or Markov chain Monte Carlo are required, estimation will involve repeated evaluations of the likelihood Techniques for overcoming this computational hurdle have been developed mainly for datasets in which the sampling locations form a regular lattice, in which case spectral methods can be used (Whittle, 1954; Guyon, 1982; Stein, 1995; Dahlhaus, 2000). For analyzing large irregularly spaced spatial datasets, it is desirable to have a method which will reduce computational expense, while producing results comparable to those that would have been given by exact likelihood-based techniques To address this need, we propose using the method of covariance tapering, in which covariance matrices are multiplied element-wise by a compactly supported correlation matrix, giving matrices which can be be manipulated using more efficient sparse matrix algorithms. The approximations we have developed will be used to facilitate fitting the model

Current techniques for large spatial datasets

Covariance tapering

Likelihood approximation via covariance tapering

Performance of estimators using tapering methods

Matern covariance functions

Equivalence under tapered and untapered covariance functions

Almost sure convergence under the approximations

Simulation study

Joint estimation of σ2 and ρ

Estimating σ2 for fixed values of ρ

Estimating uncertainty in the tapering-based estimators

Climate modeling application

Proposed research

Comparing tapering functions

Extending methods and theoretical results for estimators

Estimating functions

Profile likelihoods

Proof of Theorem 1

Proof of Theorem 2

Proof of Theorem 3

Information matrices

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of the American Statistical Association	Publication Date: Dec 1, 2008
Citations: 481	License type: cc-by

R Discovery Prime

R Discovery Prime

Covariance Tapering for Likelihood-Based Estimation in Large Spatial Data Sets

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of the American Statistical Association

Lead the way for us

Similar Papers

Covariance tapering for prediction of large spatial data sets in transformed random fields
Toshihiro Hirano ... Yoshihiro Yajima
Annals of the Institute of Statistical Mathematics | VOL. 65
Toshihiro Hirano, et. al.Toshihiro Hirano ... Yoshihiro Yajima
29 Mar 2013
Annals of the Institute of Statistical Mathematics | VOL. 65

Fixed Rank Kriging for Very Large Spatial Data Sets
Noel Cressie ... Gardar Johannesson
Journal of the Royal Statistical Society. Series B, Statistical methodology | VOL. 70
Noel Cressie, et. al.Noel Cressie ... Gardar Johannesson
04 Jan 2008
Journal of the Royal Statistical Society. Series B, Statistical methodology | VOL. 70

Estimation of spatial autoregressive models with measurement error for large data sets
Thomas Suesse
Computational Statistics | VOL. 33
Thomas SuesseThomas Suesse
08 Nov 2017
Computational Statistics | VOL. 33

An improved HASM method for dealing with large spatial data sets
Na Zhao ... Chuanfa Chen
Science China Earth Sciences | VOL. 61
Na Zhao, et. al.Na Zhao ... Chuanfa Chen
17 May 2018
Science China Earth Sciences | VOL. 61

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Covariance Tapering for Likelihood-Based Estimation in Large Spatial Data Sets

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of the American Statistical Association