Fast linear model trees by PILOT

Jakob Raymaekers,Peter J Rousseeuw,Tim Verdonck,Ruicong Yao

doi:10.1007/s10994-024-06590-3

Abstract

Linear model trees are regression trees that incorporate linear models in the leaf nodes. This preserves the intuitive interpretation of decision trees and at the same time enables them to better capture linear relationships, which is hard for standard decision trees. But most existing methods for fitting linear model trees are time consuming and therefore not scalable to large data sets. In addition, they are more prone to overfitting and extrapolation issues than standard regression trees. In this paper we introduce PILOT, a new algorithm for linear model trees that is fast, regularized, stable and interpretable. PILOT trains in a greedy fashion like classic regression trees, but incorporates an L2 boosting approach and a model selection rule for fitting linear models in the nodes. The abbreviation PILOT stands for PIecewise Linear Organic Tree, where ‘organic’ refers to the fact that no pruning is carried out. PILOT has the same low time and space complexity as CART without its pruning. An empirical study indicates that PILOT tends to outperform standard decision trees and other linear model trees on a variety of data sets. Moreover, we prove its consistency in an additive model setting under weak assumptions. When the data is generated by a linear model, the convergence rate is polynomial.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fast linear model trees by PILOT

Abstract

Talk to us

Similar Papers

More From: Machine Learning

Lead the way for us

Journal: Machine Learning	Publication Date: Jul 8, 2024
License type: CC BY 4.0

Similar Papers

Towards scalable quantile regression trees
Harish S Bhat ... Nitesh Kumar
-
Harish S Bhat, et. al.Harish S Bhat ... Nitesh Kumar
01 Oct 2015
01 Oct 2015

Assessment of Tree and Multiple Linear Regressions in Estimation of Cation Exchange Capacity
...
-
, et. al. ...
23 Aug 2015
23 Aug 2015

ONLINE NONLINEAR SYSTEM IDENTIFICATION USING LINEAR MODEL TREES
Duncan Potts ... Claude Sammut
IFAC Proceedings Volumes | VOL. 38
Duncan Potts, et. al.Duncan Potts ... Claude Sammut
01 Jan 2004
IFAC Proceedings Volumes | VOL. 38

Extrapolation errors in linear model trees
Wei-Yin Loh ... Chien-Wei Chen
ACM Transactions on Knowledge Discovery from Data | VOL. 1
Wei-Yin Loh, et. al.Wei-Yin Loh ... Chien-Wei Chen
01 Aug 2007
ACM Transactions on Knowledge Discovery from Data | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fast linear model trees by PILOT

Abstract

Talk to us

Similar Papers

More From: Machine Learning