A New Regression Model for the Analysis of Overdispersed and Zero-Modified Count Data

Wesley Bertoli,Francisco Louzada,Katiane S Conceição,Marinho G Andrade

doi:10.3390/e23060646

Wesley Bertoli, Francisco Louzada + Show 2 more

Open Access

https://doi.org/10.3390/e23060646

Copy DOI

Abstract

Count datasets are traditionally analyzed using the ordinary Poisson distribution. However, said model has its applicability limited, as it can be somewhat restrictive to handling specific data structures. In this case, the need arises for obtaining alternative models that accommodate, for example, overdispersion and zero modification (inflation/deflation at the frequency of zeros). In practical terms, these are the most prevalent structures ruling the nature of discrete phenomena nowadays. Hence, this paper’s primary goal was to jointly address these issues by deriving a fixed-effects regression model based on the hurdle version of the Poisson–Sujatha distribution. In this framework, the zero modification is incorporated by considering that a binary probability model determines which outcomes are zero-valued, and a zero-truncated process is responsible for generating positive observations. Posterior inferences for the model parameters were obtained from a fully Bayesian approach based on the g-prior method. Intensive Monte Carlo simulation studies were performed to assess the Bayesian estimators’ empirical properties, and the obtained results have been discussed. The proposed model was considered for analyzing a real dataset, and its competitiveness regarding some well-established fixed-effects models for count data was evaluated. A sensitivity analysis to detect observations that may impact parameter estimates was performed based on standard divergence measures. The Bayesian p-value and the randomized quantile residuals were considered for the task of model validation.

Highlights

The ordinary Poisson (P ) distribution is often adopted for the analysis of count data, mainly due to its simplicity and having computational implementations available for most of the standard statistical packages
The formal concept behind the information matrix prior is closely related to the unit information prior [54], whose main idea is that the amount of information provided by a prior distribution must be the same as the amount of information contained in a single observation
Intensive Monte Carlo simulation studies were performed, and the obtained results have allowed us to assess the empirical properties of the Bayesian estimators and conclude about the suitability of the adopted methodology to the predefined scenarios

Summary

Introduction

The ordinary Poisson (P ) distribution is often adopted for the analysis of count data, mainly due to its simplicity and having computational implementations available for most of the standard statistical packages. A Bayesian approach for the zero-inflated Poisson (Z IP ) distribution was considered by [33], and by [34] in a regression framework with fixed-effects. This paper aims to extend the works of [42,43] in the sense of developing a new fixed-effects regression model for count data based on the zero-modified. P S distribution accounts for different levels of overdispersion, its zero-modified version is naturally a robust alternative, as it may accommodate discrepant points that would significantly impact the parameter estimates of the Z MP model. Local influence measures based on some well-known divergences were considered for the task of detecting influential points Model validation metrics such as the Bayesian p-value and the randomized quantile residuals are presented.

The ZMPS Regression Model

Inference

Prior Distributions

Posterior Distributions and Estimation

Posterior Predictive Distribution

Simulation Study

Chromosomal Aberration Data Analysis

Findings

Concluding Remarks

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Entropy	Publication Date: May 21, 2021
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A New Regression Model for the Analysis of Overdispersed and Zero-Modified Count Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy

Lead the way for us

Similar Papers

A new mixed-effects regression model for the analysis of zero-modified hierarchical count data.
Wesley Bertoli ... Katiane S Conceição
Biometrical Journal | VOL. 63
Wesley Bertoli, et. al.Wesley Bertoli ... Katiane S Conceição
19 Oct 2020
Biometrical Journal | VOL. 63

Models for Overdispersion Count Data with Generalized Distribution: An Application to Parasites Intensity
Öznur İşçi̇ Güneri̇ ... Burcu Durmuş
Journal of New Theory | VOL. -
Öznur İşçi̇ Güneri̇, et. al.Öznur İşçi̇ Güneri̇ ... Burcu Durmuş
30 Jun 2021
Journal of New Theory | VOL. -

Models of Count Data
...
-
, et. al. ...
09 May 2005
09 May 2005

Models for Count Data With an Application to Healthy Days Measures: Are You Driving in Screws With a Hammer?
Hong Zhou ... Paul Z Siegel
Preventing Chronic Disease | VOL. 11
Hong Zhou, et. al.Hong Zhou ... Paul Z Siegel
27 Mar 2014
Preventing Chronic Disease | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A New Regression Model for the Analysis of Overdispersed and Zero-Modified Count Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy