Abstract

Compositional data (CoDa) often appear in various fields such as biology, medicine, geology, chemistry, economics, ecology and sociology. Although existing Dirichlet and related models are frequently employed in CoDa analysis, sometimes they may provide unsatisfactory performances in modelling CoDa as shown in our first real data example. First, this paper develops a multivariate compositional inverse Gaussian (CIG) model as a new tool for analysing CoDa. By incorporating the stochastic representation (SR), the expectation–maximization (EM) algorithm (aided by a one-step gradient descent algorithm) can be established to solve the parameter estimation for the proposed distribution (model). Next, zero observations may be often encountered in the real CoDa analysis. Therefore, the second aim of this paper is to propose a new model (called as ZCIG model) through a novel mixture SR based on both the CIG random vector and a so-called zero-truncated product Bernoulli random vector to model CoDa with zeros. Corresponding statistical inference methods are also developed for both cases without/with covariates. Two real data sets are analysed to illustrate the proposed statistical methods by comparing the proposed CIG and ZCIG models with existing Dirichlet and logistic-normal models.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call