Abstract
Count data arising in various fields of applications are often under-reported. Ignoring undercount naturally leads to biased estimators and inaccurate confidence intervals. Further, overdispersion in count data may arise due to inherent heterogeneity of the data. Negative Binomial distribution is a viable candidate for modeling overdispersed count data. However, in presence of undercount the negative binomial model needs to be adjusted. In this paper, we shall develop likelihood-based methodologies for estimation of mean using validation data after accounting for underreporting and overdispersion. The impact of ignoring undercount on the coverage and length of the confidence intervals is investigated using extensive numerical studies. The study is supplemented with a real-life data analysis.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have