An Extended GFfit Statistic Defined on Orthogonal Components of Pearson's Chi-Square.

Mark Reiser,Junfei Zhu,Silvia Cagnone

doi:10.1007/s11336-022-09866-6

Abstract

The Pearson and likelihood ratio statistics are commonly used to test goodness of fit for models applied to data from a multinomial distribution. The goodness-of-fit test based on Pearson's Chi-squared statistic is sometimes considered to be a global test that gives little guidance to the source of poor fit when the null hypothesis is rejected, and it has also been recognized that the global test can often be outperformed in terms of power by focused or directional tests. For the cross-classification of a large number of manifest variables, the GFfit statistic focused on second-order marginals for variable pairs i,j has been proposed as a diagnostic to aid in finding the source of lack of fit after the model has been rejected based on a more global test. When data are from a table formed by the cross-classification of a large number of variables, the common global statistics may also have low power and inaccurate Type I error level due to sparseness in the cells of the table. The sparseness problem is rarely encountered with the GFfit statistic because it is focused on the lower-order marginals. In this paper, a new and extended version of the GFfit statistic is proposed by decomposing the Pearson statistic from the full table into orthogonal components defined on marginal distributions and then defining the new version, [Formula: see text], as a partial sum of these orthogonal components. While the emphasis is on lower-order marginals, the new version of [Formula: see text] is also extended to higher-order tables so that the [Formula: see text] statistics sum to the Pearson statistic. As orthogonal components of the Pearson [Formula: see text] statistic, [Formula: see text] statistics have advantages over other lack-of-fit diagnostics that are currently available for cross-classified tables: the [Formula: see text] generally have higher power to detect lack of fit while maintaining good Type I error control even if the joint frequencies are very sparse, as will be shown in simulation results; theoretical results will establish that [Formula: see text] statistics have known degrees of freedom and are asymptotically independent with known joint distribution, a property which facilitates less conservative control of false discovery rate (FDR) or familywise error rate (FWER) in a high-dimensional table which would produce a large number of bivariate lack-of-fit diagnostics. Computation of [Formula: see text] statistics is also computationally stable. The extended [Formula: see text] statistic can be applied to a variety of models for cross-classified tables. An application of the new GFfit statistic as a diagnostic for a latent variable model is presented.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Extended GFfit Statistic Defined on Orthogonal Components of Pearson's Chi-Square.

Abstract

Talk to us

Similar Papers

More From: Psychometrika

Lead the way for us

Journal: Psychometrika	Publication Date: Jun 3, 2022
Citations: 2

Similar Papers

Partitions of Pearson’s Chi-square statistic for frequency tables: a comprehensive account
Sébastien Loisel ... Yoshio Takane
Computational Statistics | VOL. 31
Sébastien Loisel, et. al.Sébastien Loisel ... Yoshio Takane
04 Sep 2015
Computational Statistics | VOL. 31

A reformulation of Pearson's Chi-square statistic and some extentions
Gunter Lorenzen
Statistics and Probability Letters | VOL. 14
Gunter LorenzenGunter Lorenzen
01 Jul 1992
Statistics and Probability Letters | VOL. 14

Goodness‐of‐fit testing in sparse contingency tables when the number of variables is large
Mark Reiser
WIREs Computational Statistics | VOL. 11
Mark ReiserMark Reiser
09 Jun 2019
WIREs Computational Statistics | VOL. 11

The Equivalence of Cohen's Kappa and Pearson's Chi-Square Statistics in the 2 × 2 Table
Marcia Feingold
Educational and Psychological Measurement | VOL. 52
Marcia FeingoldMarcia Feingold
01 Mar 1992
Educational and Psychological Measurement | VOL. 52

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Extended GFfit Statistic Defined on Orthogonal Components of Pearson's Chi-Square.

Abstract

Talk to us

Similar Papers

More From: Psychometrika