Assessing the goodness of fit of logistic regression models in large samples: A modification of the Hosmer-Lemeshow test.

Giovanni Nattino,Michael L Pennell,Stanley Lemeshow

doi:10.1111/biom.13249

Abstract

Evaluating the goodness of fit of logistic regression models is crucial to ensure the accuracy of the estimated probabilities. Unfortunately, such evaluation is problematic in large samples. Because the power of traditional goodness of fit tests increases with the sample size, practically irrelevant discrepancies between estimated and true probabilities are increasingly likely to cause the rejection of the hypothesis of perfect fit in larger and larger samples. This phenomenon has been widely documented for popular goodness of fit tests, such as the Hosmer-Lemeshow test. To address this limitation, we propose a modification of the Hosmer-Lemeshow approach. By standardizing the noncentrality parameter that characterizes the alternative distribution of the Hosmer-Lemeshow statistic, we introduce a parameter that measures the goodness of fit of a model but does not depend on the sample size. We provide the methodology to estimate this parameter and construct confidence intervals for it. Finally, we propose a formal statistical test to rigorously assess whether the fit of a model, albeit not perfect, is acceptable for practical purposes. The proposed method is compared in a simulation study with a competing modification of the Hosmer-Lemeshow test, based on repeated subsampling. We provide a step-by-step illustration of our method using a model for postneonatal mortality developed in a large cohort of more than 300000 observations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Assessing the goodness of fit of logistic regression models in large samples: A modification of the Hosmer-Lemeshow test.

Abstract

Talk to us

Similar Papers

More From: Biometrics

Lead the way for us

Journal: Biometrics	Publication Date: Apr 6, 2020
Citations: 102

Similar Papers

Rejoinder to "Assessing the goodness of fit of logistic regression models in large samples: A modification of the Hosmer-Lemeshow test".
Giovanni Nattino ... Stanley Lemeshow
Biometrics | VOL. 76
Giovanni Nattino, et. al.Giovanni Nattino ... Stanley Lemeshow
06 Apr 2020
Biometrics | VOL. 76

A simple test procedure in standardizing the power of Hosmer–Lemeshow test in large data sets
Xin Lai ... Liu Liu
Journal of Statistical Computation and Simulation | VOL. 88
Xin Lai, et. al.Xin Lai ... Liu Liu
26 Apr 2018
Journal of Statistical Computation and Simulation | VOL. 88

A cautionary note about assessing the fit of logistic regression models
Joseph G Pigeon ... Joseph F Heyse
Journal of Applied Statistics | VOL. 26
Joseph G Pigeon, et. al.Joseph G Pigeon ... Joseph F Heyse
01 Sep 1999
Journal of Applied Statistics | VOL. 26

A modified Hosmer–Lemeshow test for large data sets
Wei Yu ... Lixing Zhu
Communications in Statistics - Theory and Methods | VOL. 46
Wei Yu, et. al.Wei Yu ... Lixing Zhu
24 Aug 2017
Communications in Statistics - Theory and Methods | VOL. 46

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Assessing the goodness of fit of logistic regression models in large samples: A modification of the Hosmer-Lemeshow test.

Abstract

Talk to us

Similar Papers

More From: Biometrics