Abstract

In order to develop a general approach for analysis of non-normal multivariate data, it would be desirable to obtain a simple-minded framework that can accommodate a wide variety of different types of data, much like generalized linear models do in the univariate case. There is no shortage of multivariate distributions available, but the main stumbling block so far has been the lack of a suitable multivariate form of exponential dispersion model. In the univariate case, an exponential dispersion model ED(μ, σ) is a twoparameter family parametrized by the mean μ and dispersion parameter σ, with variance σV (μ), where V denotes the unit variance function. The generalized linear models paradigm is based on combining a link function with a suitable linear model. Estimation uses quasi-likelihood for the regression parameters, and the Pearson statistic for estimating the dispersion parameter. We consider a new k-variate exponential dispersion model EDk(μ,Σ) aimed at providing a fully flexible covariance structure corresponding to a mean vector μ and a positive-definite dispersion matrix Σ. The covariance matrix is of the form Cov(Y ) = Σ⊙V (μ), where ⊙ denotes the Hadamard (elementwise) product between two matrices, and V (μ) denotes the (matrix) unit variance function. We consider a multivariate generalized linear model for independent response vectors Y i ∼ EDk(μi,Σ) defined by g(μ ⊤ i ) = xiB, where the link function g is applied coordinatewise to μ i , xi is an m-vector of covariates, and B is an m × k matrix of regression coefficients. We estimate the regression matrix B using a quasi-score function, and we estimate the dispersion matrix Σ using a multivariate Pearson statistic defined as a weighted sum of squares and cross-products matrix of residuals. This model specializes to the classical multivariate multiple regression model when g is the identity function and EDk(μ,Σ) is the multivariate normal distribution. The construction of the multivariate exponential dispersion model EDk(μ,Σ) is based on an extended convolution method, which makes the marginal distributions follow a given univariate exponential dispersion model. We illustrate the method by considering multivariate versions of the Poisson and gamma distributions, and discuss some of the challenges faced in the implementation of the method.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call