A Default Prior Distribution for Logistic and Other Regression Models

Andrew Gelman,Aleks Jakulin,Maria Grazia Pittau,Yu-Sung Su

doi:10.2139/ssrn.1010421

Abstract

We propose a new prior distribution for classical (non-hierarchical) logistic regression models, constructed by first scaling all nonbinary variables to have mean 0 and standard deviation 0.5, and then placing independent Student-t prior distributions on the coefficients. As a default choice, we recommend the Cauchy distribution with center 0 and scale 2.5, which in the simplest setting is a longer-tailed version of the distribution attained by assuming one-half additional success and one-half additional failure in a logistic regression. We implement a procedure to fit generalized linear models in R with this prior distribution by incorporating an approximate EM algorithm into the usual iteratively weighted least squares. We illustrate with several examples, including a series of logistic regressions predicting voting preferences, an imputation model for a public health data set, and a hierarchical logistic regression in epidemiology. We recommend this default prior distribution for routine applied use. It has the advantage of always giving answers, even when there is complete separation in logistic regression (a common problem, even when the sample size is large and the number of predictors is small) and also automatically applying more shrinkage to higher-order interactions. This can be useful in routine data analysis as well as in automated procedures such as chained equations for missing-data imputation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Default Prior Distribution for Logistic and Other Regression Models

Abstract

Talk to us

Similar Papers

More From: SSRN Electronic Journal

Lead the way for us

Journal: SSRN Electronic Journal	Publication Date: Sep 11, 2007
Citations: 49

Similar Papers

A weakly informative default prior distribution for logistic and other regression models
Andrew Gelman ... Yu-Sung Su
The Annals of Applied Statistics | VOL. 2
Andrew Gelman, et. al.Andrew Gelman ... Yu-Sung Su
01 Dec 2008
The Annals of Applied Statistics | VOL. 2

Separation Issues and Possible Solutions: Part I – Systematic Literature Review on Logistic Models ‐ Part II – Comparison of different methods for separation in logistic regression
C Ensoy ... Tw Rakhmawati
EFSA Supporting Publications | VOL. 12
C Ensoy, et. al.C Ensoy ... Tw Rakhmawati
01 Sep 2015
EFSA Supporting Publications | VOL. 12

A statistical note on analyzing and interpreting individual-level epidemiological data.
Daisuke Yoneoka ... Eiko Saito
Journal of epidemiology | VOL. 25
Daisuke Yoneoka, et. al.Daisuke Yoneoka ... Eiko Saito
01 Jan 2015
Journal of epidemiology | VOL. 25

Logistic regression modeling and the number of events per variable: selection bias dominates
Ewout W Steyerberg ... Frank E Harrell
Journal of Clinical Epidemiology | VOL. 64
Ewout W Steyerberg, et. al.Ewout W Steyerberg ... Frank E Harrell
25 Oct 2011
Journal of Clinical Epidemiology | VOL. 64

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Default Prior Distribution for Logistic and Other Regression Models

Abstract

Talk to us

Similar Papers

More From: SSRN Electronic Journal