Abstract

Multiple counts may occur in each cell of an a × b two-way layout (balanced or unbalanced) of two fixed factors A and B. Standard log-linear model analysis based on a Poisson distribution assumption of the cell counts is not applicable here, because of the unbalanced nature of the table or because the Poisson distribution assumption is not valid. We develop C(α) tests for interaction and main effects assuming data to be Poisson distributed and also assuming that data within the cells have extra (over/under) dispersion beyond that explained by a Poisson distribution. For this we consider an extended negative binominal distribution and a semiparametric model using the quasi-likelihood. We show that in all situations the C(α) tests for interaction are of very simple forms. For C(α) tests for the main effect in presence of no interaction, such simplification is possible only under certain conditions. A score test for detecting extra dispersion in presence of interaction is also obtained and is of simple form. This test is to be used before testing for interaction. If evidence of extra dispersion is found, then the score tests for testing for interaction developed under the assumption of extra-dispersed distributions is used; otherwise, the score test developed under the assumption of a Poisson distribution is used. Further, maximum likelihood (ML) and quasi-likelihood (QL) estimates under different relevant null hypotheses are obtained. For testing for interaction under the assumption of Poisson-distributed data, we show that to obtain ML estimates of a + b - 1 parameters, we need to solve only min(a - 1, b - 1) equations simultaneously. For example, for a 2 × 10 table we need to solve only one equation iteratively. When data are extra dispersed, we need to solve a + b equations simultaneously to obtain ML or QL estimates of the a + b parameters. For testing for a main effect in the absence of interaction, closed-form estimates exist when data are Poisson distributed. However, when data follow a negative binomial or a semiparametric model based on the knowledge of only the first two moments (quasi-likelihood), only one equation involving the dispersion parameter is to be solved iteratively. Some simulations are performed, and two real datasets are analyzed for illustrative purposes.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.