Model selection criteria for dual-inflated data

Ting Hsiang Lin,Min-Hsiao Tsai

doi:10.1080/00949655.2015.1118102

Abstract

ABSTRACTInflated data are prevalent in many situations and a variety of inflated models with extensions have been derived to fit data with excessive counts of some particular responses. The family of information criteria (IC) has been used to compare the fit of models for selection purposes. Yet despite the common use in statistical applications, there are not too many studies evaluating the performance of IC in inflated models. In this study, we studied the performance of IC for data with dual-inflated data. The new zero- and K-inflated Poisson (ZKIP) regression model and conventional inflated models including Poisson regression and zero-inflated Poisson (ZIP) regression were fitted for dual-inflated data and the performance of IC were compared. The effect of sample sizes and the proportions of inflated observations towards selection performance were also examined. The results suggest that the Bayesian information criterion (BIC) and consistent Akaike information criterion (CAIC) are more accurate than the Akaike information criterion (AIC) in terms of model selection when the true model is simple (i.e. Poisson regression (POI)). For more complex models, such as ZIP and ZKIP, the AIC was consistently better than the BIC and CAIC, although it did not reach high levels of accuracy when sample size and the proportion of zero observations were small. The AIC tended to over-fit the data for the POI, whereas the BIC and CAIC tended to under-parameterize the data for ZIP and ZKIP. Therefore, it is desirable to study other model selection criteria for dual-inflated data with small sample size.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Model selection criteria for dual-inflated data

Abstract

Talk to us

Similar Papers

More From: Journal of Statistical Computation and Simulation

Lead the way for us

Journal: Journal of Statistical Computation and Simulation	Publication Date: Nov 30, 2015
Citations: 3

Similar Papers

Modelling Service Times Using Some Beta-Based Compound Distribution
John, O T ... Singla, S
African Journal of Mathematics and Statistics Studies | VOL. 7
John, O T, et. al.John, O T ... Singla, S
28 Oct 2024
African Journal of Mathematics and Statistics Studies | VOL. 7

Model Selection for Multilevel Mixture Rasch Models
Sedat Sen ... Seock-Ho Kim
Applied Psychological Measurement | VOL. 43
Sedat Sen, et. al.Sedat Sen ... Seock-Ho Kim
07 Jun 2018
Applied Psychological Measurement | VOL. 43

An Evaluation of Information Criteria Use for Correct Cross-Classified Random Effects Model Selection
S Natasha Beretvas ... Daniel L Murphy
The Journal of Experimental Education | VOL. 81
S Natasha Beretvas, et. al.S Natasha Beretvas ... Daniel L Murphy
02 Oct 2013
The Journal of Experimental Education | VOL. 81

The Effect of Sample Size on the Efficiency of Count Data Models: Application to Marriage Data
Ntebogang Dinah Moroke ... Volition Tlhalitshi Montshiwa
Journal of Economics and Behavioral Studies | VOL. 9
Ntebogang Dinah Moroke, et. al.Ntebogang Dinah Moroke ... Volition Tlhalitshi Montshiwa
20 Jul 2017
Journal of Economics and Behavioral Studies | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Model selection criteria for dual-inflated data

Abstract

Talk to us

Similar Papers

More From: Journal of Statistical Computation and Simulation