Abstract

Stata has several procedures that can be used in analyzing count-data regression models and, more specifically, in studying the behavior of the dependent variable, conditional on explanatory variables. Identifying overdispersion in countdata models is one of the most important procedures that allow researchers to correctly choose estimations such as Poisson or negative binomial, given the distribution of the dependent variable. The main purpose of this paper is to present a new command for the identification of overdispersion in the data as an alternative to the procedure presented by Cameron and Trivedi [5], since it directly identifies overdispersion in the data, without the need to previously estimate a specific type of count-data model. When estimating Poisson or negative binomial regression models in which the dependent variable is quantitative, with discrete and non-negative values, the new Stata package overdisp helps researchers to directly propose more consistent and adequate models. As a second contribution, we also present a simulation to show the consistency of the overdispersion test using the overdisp command. Findings show that, if the test indicates equidispersion in the data, there are consistent evidence that the distribution of the dependent variable is, in fact, Poisson. If, on the other hand, the test indicates overdispersion in the data, researchers should investigate more deeply whether the dependent variable actually exhibits better adherence to the Poisson-Gamma distribution or not.

Highlights

  • Many situations have as an outcome of interest a nonnegative integer, or a count, denoted by y, y ∈ N0 = 0, 1, 2

  • Following the test implemented in Stata through a sequence of four commands proposed by Cameron and Trivedi [5], we present the new package overdisp to directly identify overdispersion in Stata

  • We illustrate the application of the overdisp command using the two examples below, in which we show the use of the new command and the interpretation of its output in different settings

Read more

Summary

Introduction

The benchmark model for the analysis of integer count-data is the Poisson regression model, which restricts the variance of the data to be equal to the mean, conditional on explanatory variables [5, 6, 7]. Failures of this restriction can allow researchers to estimate parameters considering more general distributions, such as the negative binomial. Following the test implemented in Stata through a sequence of four commands proposed by Cameron and Trivedi [5], we present the new package overdisp to directly identify overdispersion in Stata.

Poisson regression models
Negative binomial regression models
Overdispersion test
The overdisp Stata command
Examples
Example 1
Example 2
Simulation study
Findings
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.