Bayesian model selection in the [formula omitted]-open setting — Approximate posterior inference and subsampling for efficient large-scale leave-one-out cross-validation via the difference estimator

Riko Kelter

doi:10.1016/j.jmp.2020.102474

Abstract

Comparison of competing statistical models is an essential part of psychological research. From a Bayesian perspective, various approaches to model comparison and selection have been proposed in the literature. However, the applicability of these approaches depends on the assumptions about the model space M. Also, traditional methods like leave-one-out cross-validation (LOO-CV) estimate the expected log predictive density (ELPD) of a model to investigate how the model generalises out-of-sample, and quickly become computationally inefficient when sample size becomes large. Here, a tutorial on Pareto-smoothed importance sampling leave-one-out cross-validation (PSIS-LOO-CV) is provided, which is computationally more efficient. It is shown how Bayesian model selection can be scaled efficiently for big data via PSIS-LOO-CV in combination with approximate posterior inference and probability-proportional-to-size subsampling. First, several model views and the available Bayesian model comparison methods in each are discussed. The Bayesian logistic regression model is then used as a running example to show how to apply the method in practice, and demonstrate that it provides similarly accurate ELPD estimates like LOO-CV or information criteria. Subsequently, the power and exponential law models relating reaction times to practice are used to demonstrate the approach with more complex models. Guidance is provided how to compare competing models based on the ELPD estimates and how to conduct posterior predictive checks to safeguard against overconfidence in one of the models under consideration. The intended audience are researchers who practice mathematical modelling and comparison, possibly with large datasets, and who are well acquainted to Bayesian statistics.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Bayesian model selection in the [formula omitted]-open setting — Approximate posterior inference and subsampling for efficient large-scale leave-one-out cross-validation via the difference estimator

Abstract

Talk to us

Similar Papers

More From: Journal of Mathematical Psychology

Lead the way for us

Journal: Journal of Mathematical Psychology	Publication Date: Dec 11, 2020
Citations: 12

Similar Papers

Performance of Model Selection Criteria in Bayesian Threshold VAR (TVAR) Models
Yongjae Kwon ... Halima Bensmail
Econometric Reviews | VOL. 28
Yongjae Kwon, et. al.Yongjae Kwon ... Halima Bensmail
18 Nov 2008
Econometric Reviews | VOL. 28

Weighted Pseudometric Discriminatory Power Improvement Using a Bayesian Logistic Regression Model Based on a Variational Method
R Ksantini ... D Ziou
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 30
R Ksantini, et. al.R Ksantini ... D Ziou
01 Feb 2008
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 30

Three Essays on Bayesian Hypothesis Testing and Model Selection

-

01 Jan 2013
01 Jan 2013

Bayesian Model Comparison of Solar Radiation Models
Philippe Lauret ... Carine Rivière
-
Philippe Lauret, et. al.Philippe Lauret ... Carine Rivière
01 Jan 2008
01 Jan 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bayesian model selection in the [formula omitted]-open setting — Approximate posterior inference and subsampling for efficient large-scale leave-one-out cross-validation via the difference estimator

Abstract

Talk to us

Similar Papers

More From: Journal of Mathematical Psychology