Faster model matrix crossproducts for large generalized linear models with discretized covariates

Zheyuan Li,Simon N Wood

doi:10.1007/s11222-019-09864-2

Zheyuan Li, Simon N Wood

Open Access

https://doi.org/10.1007/s11222-019-09864-2

Copy DOI

Journal: Statistics and Computing	Publication Date: Mar 1, 2019
Citations: 42	License type: open-access

Affiliation: University of Bath, University of Bristol

Abstract

Wood et al. (J Am Stat Assoc 112(519):1199–1210, 2017) developed methods for fitting penalized regression spline based generalized additive models, with of the order of 10^4 coefficients, to up to 10^8 data. The methods offered two to three orders of magnitude reduction in computational cost relative to the most efficient previous methods. Part of the gain resulted from the development of a set of methods for efficiently computing model matrix products when model covariates each take only a discrete set of values substantially smaller than the sample size [generalizing an idea first appearing in Lang et al. (Stat Comput 24(2):223–238, 2014)]. Covariates can always be rounded to achieve such discretization, and it should be noted that the covariate discretization is marginal. That is we do not rely on discretizing covariates jointly, which would typically require the use of very coarse discretization. The most expensive computation in model estimation is the formation of the matrix cross product mathbf{X}^{mathsf{T}}{mathbf{WX}} where mathbf{X} is a model matrix and {mathbf{W}} a diagonal or tri-diagonal matrix. The purpose of this paper is to present a simple, novel and substantially more efficient approach to the computation of this cross product. The new method offers, for example, a 30 fold reduction in cross product computation time for the Black Smoke model dataset motivating Wood et al. (2017). Given this reduction in computational cost, the subsequent Cholesky decomposition of mathbf{X}^{mathsf{T}}{mathbf{WX}} and follow on computation of (mathbf{X}^{mathsf{T}}{mathbf{WX}})^{-1} become a more significant part of the computational burden, and we also discuss the choice of methods for improving their speed.

Highlights

A rate limiting step in computations involving large scale regression models is often the computation of weighted crossproducts, XTWX, of the model matrix, X, where W is diagonal
This paper provides new algorithms for computing XTWX from discretized covariates, that are more efficient than previous algorithms, thereby substantially reducing the computational burden of estimating large generalized additive models (GAM) of large data sets
The original Wood et al (2017) method implemented a parallel version of the block Cholesky method of Lucas (2004) followed by a parallel formation of (XTWX + Sλ)−1: the implementations scaled well and had good performance relative to LAPACK’s Cholesky routines based on the reference BLAS, but were poor compared to LAPACK using a tuned BLAS, such as OpenBLAS (Xianyi et al 2014)

Summary

Introduction

A rate limiting step in computations involving large scale regression models is often the computation of weighted crossproducts, XTWX, of the model matrix, X, where W is diagonal (or in this paper sometimes tri-diagonal). In its most basic form a GAM is a generalized linear model in which the linear predictor depends on unknown smooth functions, f j , of covariates x j (possibly vector valued). The alternative is to find ways to exploit discretization when covariates are discretized individually (marginally), and Wood et al (2017) provide a set of algorithms to do this. These latter methods include the important case of model interaction terms. The columns of X relating to an interaction are given by a row-Kronecker product of a set of marginal model matrices for each marginal covariate of the interaction These marginal covariates and their marginal model matrices are discretized separately

The basic discrete cross product algorithms

Proof of algorithm correctness

Discrete cross product algorithms for interaction terms

Parallelization and other numerically costly operations

Example

Findings

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Faster model matrix crossproducts for large generalized linear models with discretized covariates

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Statistics and Computing

Lead the way for us

Similar Papers

On quantile quantile plots for generalized linear models
Nicole H Augustin ... Simon N Wood
Computational Statistics & Data Analysis | VOL. 56
Nicole H Augustin, et. al.Nicole H Augustin ... Simon N Wood
11 Feb 2012
Computational Statistics & Data Analysis | VOL. 56

Fast graph-cut based optimization for practical dense deformable registration of volume images
Simon Ekström ... Robin Strand
Computerized Medical Imaging and Graphics | VOL. 84
Simon Ekström, et. al.Simon Ekström ... Robin Strand
19 Jun 2020
Computerized Medical Imaging and Graphics | VOL. 84

A hybrid Galerkin finite element method for seismic wave propagation in fractured media
Janaki Vamaraju ... Jonas De Basabe
Geophysical Journal International | VOL. 221
Janaki Vamaraju, et. al.Janaki Vamaraju ... Jonas De Basabe
21 Jan 2020
Geophysical Journal International | VOL. 221

Utilization of a dependency-tracking language to reduce computational time during multidisciplinary design optimization
M.A Schönning ... P.R Zarda
Advances in Engineering Software | VOL. 34
M.A Schönning, et. al.M.A Schönning ... P.R Zarda
07 Jan 2003
Advances in Engineering Software | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Faster model matrix crossproducts for large generalized linear models with discretized covariates

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Statistics and Computing