Influential Observations in Bayesian Regression Tree Models

M T Pratola,E I George,R E Mcculloch

doi:10.1080/10618600.2023.2210180

Abstract

Bayesian Classification and Regression Trees (BCART) and Bayesian Additive Regression Trees (BART) are popular Bayesian regression models widely applicable in modern regression problems. Their popularity is intimately tied to the ability to flexibly model complex responses depending on high-dimensional inputs while simultaneously being able to quantify uncertainties. This ability to quantify uncertainties is key, as it allows researchers to perform appropriate inferential analyses in settings that have generally been too difficult to handle using the Bayesian approach. However, surprisingly little work has been done to evaluate the sensitivity of these modern regression models to violations of modeling assumptions. In particular, we will consider influential observations, which one reasonably would imagine to be common—or at least a concern—in the big-data setting. In this article, we consider both the problem of detecting influential observations and adjusting predictions to not be unduly affected by such potentially problematic data. We consider three detection diagnostics for Bayesian tree models, one an analogue of Cook’s distance and the others taking the form of a divergence measure and a conditional predictive density metric, and then propose an importance sampling algorithm to re-weight previously sampled posterior draws so as to remove the effects of influential data in a computationally efficient manner. Finally, our methods are demonstrated on real-world data where blind application of the models can lead to poor predictions and inference. Supplementary materials for this article are available online.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Influential Observations in Bayesian Regression Tree Models

Abstract

Talk to us

Similar Papers

More From: Journal of Computational and Graphical Statistics

Lead the way for us

Journal: Journal of Computational and Graphical Statistics	Publication Date: Jun 17, 2023
Citations: 1

Similar Papers

Detection of Left Ventricular Hypertrophy Using Bayesian Additive Regression Trees: The MESA (Multi‐Ethnic Study of Atherosclerosis)
-
Journal of the American Heart Association | VOL. 8
--
04 May 2019
Journal of the American Heart Association | VOL. 8

Estimation of causal effects of multiple treatments in observational studies with a binary outcome.
Liangyuan Hu ... Michael Lopez
Statistical Methods in Medical Research | VOL. 29
Liangyuan Hu, et. al.Liangyuan Hu ... Michael Lopez
25 May 2020
Statistical Methods in Medical Research | VOL. 29

Bayesian Additive Regression Tree Calibration of Complex High-Dimensional Computer Models
M T Pratola ... D M Higdon
Technometrics | VOL. 58
M T Pratola, et. al.M T Pratola ... D M Higdon
02 Apr 2016
Technometrics | VOL. 58

Decision making and uncertainty quantification for individualized treatments using Bayesian Additive Regression Trees.
Brent R Logan ... Robert E Mcculloch
Statistical Methods in Medical Research | VOL. 28
Brent R Logan, et. al.Brent R Logan ... Robert E Mcculloch
18 Dec 2017
Statistical Methods in Medical Research | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Influential Observations in Bayesian Regression Tree Models

Abstract

Talk to us

Similar Papers

More From: Journal of Computational and Graphical Statistics