Predictive Distribution Modeling Using Transformation Forests

Torsten Hothorn,Achim Zeileis

doi:10.1080/10618600.2021.1872581

Abstract

Regression models for supervised learning problems with a continuous response are commonly understood as models for the conditional mean of the response given predictors. This notion is simple and therefore appealing for interpretation and visualization. Information about the whole underlying conditional distribution is, however, not available from these models. A more general understanding of regression models as models for conditional distributions allows much broader inference, for example, the computation of prediction intervals or probabilistic predictions for exceeding certain thresholds. Several random forest-type algorithms aim at estimating conditional distributions, most prominently quantile regression forests. We propose a novel approach based on a parametric family of distributions characterized by their transformation function. A dedicated novel “transformation tree” algorithm able to detect distributional changes is developed. Based on these transformation trees, we introduce “transformation forests” as an adaptive local likelihood estimator of conditional distribution functions. The resulting predictive distributions are fully parametric yet very general and allow inference procedures, such as likelihood-based variable importances, to be applied in a straightforward way. Supplemental files for this article are available online.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Computational and Graphical Statistics	Publication Date: Mar 8, 2021
Citations: 20	License type: open-access

R Discovery Prime

R Discovery Prime

Predictive Distribution Modeling Using Transformation Forests

Abstract

Talk to us

Similar Papers

More From: Journal of Computational and Graphical Statistics

Lead the way for us

Similar Papers

Counterfactual distributions of wages via quantile regression with endogeneity
Elena Martinez-Sanchis ... Ilker Kandemir
Computational Statistics & Data Analysis | VOL. 56
Elena Martinez-Sanchis, et. al.Elena Martinez-Sanchis ... Ilker Kandemir
06 Mar 2012
Computational Statistics & Data Analysis | VOL. 56

Inference on Counterfactual Distributions
Victor Chernozhukov ... Blaise Melly
SSRN Electronic Journal | VOL. -
Victor Chernozhukov, et. al.Victor Chernozhukov ... Blaise Melly
01 Jan 2009
SSRN Electronic Journal | VOL. -

Adapting a classification rule to local and global shift when only unlabelled data are available
Vera Hofer
European Journal of Operational Research | VOL. 243
Vera HoferVera Hofer
21 Nov 2014
European Journal of Operational Research | VOL. 243

Asymptotically Minimax Adaptive Estimation. I: Upper Bounds. Optimally Adaptive Estimates
O V Lepskii
Theory of Probability & Its Applications | VOL. 36
O V LepskiiO V Lepskii
01 Jan 1992
Theory of Probability & Its Applications | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Predictive Distribution Modeling Using Transformation Forests

Abstract

Talk to us

Similar Papers

More From: Journal of Computational and Graphical Statistics