Overdispersed Count Data Research Articles

In pre-clinical and medical quality control, it is of interest to assess the stability of the process under monitoring or to validate a current observation using historical control data. Classically, this is done by the application of historical control limits (HCL) graphically displayed in control charts. In many applications, HCL are applied to count data, for example, the number of revertant colonies (Ames assay) or the number of relapses per multiple sclerosis patient. Count data may be overdispersed, can be heavily right-skewed and clusters may differ in cluster size or other baseline quantities (e.g., number of petri dishes per control group or different length of monitoring times per patient). Based on the quasi-Poisson assumption or the negative-binomial distribution, we propose prediction intervals for overdispersed count data to be used as HCL. Variable baseline quantities are accounted for by offsets. Furthermore, we provide a bootstrap calibration algorithm that accounts for the skewed distribution and achieves equal tail probabilities. Comprehensive Monte-Carlo simulations assessing the coverage probabilities of eight different methods for HCL calculation reveal, that the bootstrap calibrated prediction intervals control the type-1-error best. Heuristics traditionally used in control charts (e.g., the limits in Shewhart c- or u-charts or the mean ± 2 SD) fail to control a pre-specified coverage probability. The application of HCL is demonstrated based on data from the Ames assay and for numbers of relapses of multiple sclerosis patients. The proposed prediction intervals and the algorithm for bootstrap calibration are publicly available via the R package predint.

Read full abstract

In repeated measurements, regression to the mean (RTM) is a tendency of subjects with observed extreme values to move closer to the mean when measured a second time. Not accounting for RTM could lead to incorrect decisions such as when observed natural variation is incorrectly attributed to the effect of a treatment/intervention. A strategy for addressing RTM is to decompose the total effect, the expected difference in paired random variables conditional on the first being in the tail of its distribution, into regression to the mean and unbiased treatment effects. The unbiased treatment effect can then be estimated by subtraction. Formulae are available in the literature to quantify RTM for Poisson distributed data which are constrained by mean–variance equivalence, although there are many real life examples of overdispersed count data that are not well approximated by the Poisson. The negative binomial can be considered an explicit overdispersed Poisson process where the Poisson intensity is chosen from a gamma distribution. In this study, the truncated bivariate negative binomial distribution is used to decompose the total effect formulae into RTM and treatment effects. Maximum likelihood estimators (MLE) and method of moments estimators are developed for the total, RTM, and treatment effects. A simulation study is carried out to investigate the properties of the estimators and compare them with those developed under the assumption of the Poisson process. Data on the incidence of dengue cases reported from 2007 to 2017 are used to estimate the total, RTM, and treatment effects.

Read full abstract

Overdispersed Count Data Research Articles

Related Topics

Articles published on Overdispersed Count Data

Efficient Analysis of Overdispersed Data Using an Accurate Computation of the Dirichlet Multinomial Distribution.

Prediction Intervals for Overdispersed Poisson Data and Their Application in Medical and Pre-Clinical Quality Control.

Estimation of mean using under-reported and overdispersed count data

Modified ridge estimator in the Bell regression model

Regression to the mean for overdispersed count data

Semi-parametric approach for modelling overdispersed count data with application to Industry 4.0

Modified Bivariate Poisson-Lindley Model: Properties and Applications in Soccer

One-misrecorded Poisson INAR(1) model via two random operators with application to crime and economics data

New ridge parameter estimators for the quasi-Poisson ridge regression model

Poisson-New Quadratic-Exponential Distribution

Bootstrapping generalized linear models to accommodate overdispersed count data

High-dimensional covariate-augmented overdispersed poisson factor model.

A new over-dispersed count model based on Poisson-Geometric convolution

Premium Linear-Exponential Mixture of Poisson Distribution

Multilevel modeling in single-case studies with zero-inflated and overdispersed count data.

A variable clustering approach for overdispersed high-dimensional count data using a copula-based mixture model

Poisson XRani Distribution: An Alternative Discrete Distribution for Overdispersed Count Data

A New Effective Jackknifing Estimator in the Negative Binomial Regression Model

A truncated mean-parameterized Conway-Maxwell-Poisson model for the analysis of Test match bowlers

A Family of Finite Mixture Distributions for Modelling Dispersion in Count Data

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Overdispersed Count Data Research Articles

Related Topics

Articles published on Overdispersed Count Data

Efficient Analysis of Overdispersed Data Using an Accurate Computation of the Dirichlet Multinomial Distribution.

Prediction Intervals for Overdispersed Poisson Data and Their Application in Medical and Pre-Clinical Quality Control.

Estimation of mean using under-reported and overdispersed count data

Modified ridge estimator in the Bell regression model

Regression to the mean for overdispersed count data

Semi-parametric approach for modelling overdispersed count data with application to Industry 4.0

Modified Bivariate Poisson-Lindley Model: Properties and Applications in Soccer

One-misrecorded Poisson INAR(1) model via two random operators with application to crime and economics data

New ridge parameter estimators for the quasi-Poisson ridge regression model

Poisson-New Quadratic-Exponential Distribution

Bootstrapping generalized linear models to accommodate overdispersed count data

High-dimensional covariate-augmented overdispersed poisson factor model.

A new over-dispersed count model based on Poisson-Geometric convolution

Premium Linear-Exponential Mixture of Poisson Distribution

Multilevel modeling in single-case studies with zero-inflated and overdispersed count data.

A variable clustering approach for overdispersed high-dimensional count data using a copula-based mixture model

Poisson XRani Distribution: An Alternative Discrete Distribution for Overdispersed Count Data

A New Effective Jackknifing Estimator in the Negative Binomial Regression Model

A truncated mean-parameterized Conway-Maxwell-Poisson model for the analysis of Test match bowlers

A Family of Finite Mixture Distributions for Modelling Dispersion in Count Data