Robust boosting for regression problems

Xiaomeng Ju,Matías Salibián-Barrera

doi:10.1016/j.csda.2020.107065

Abstract

Gradient boosting algorithms construct a regression predictor using a linear combination of “base learners”. Boosting also offers an approach to obtaining robust non-parametric regression estimators that are scalable to applications with many explanatory variables. The robust boosting algorithm is based on a two-stage approach, similar to what is done for robust linear regression: it first minimizes a robust residual scale estimator, and then improves it by optimizing a bounded loss function. Unlike previous robust boosting proposals this approach does not require computing an ad hoc residual scale estimator in each boosting iteration. Since the loss functions involved in this robust boosting algorithm are typically non-convex, a reliable initialization step is required, such as an L1 regression tree, which is also fast to compute. A robust variable importance measure can also be calculated via a permutation procedure. Thorough simulation studies and several data analyses show that, when no atypical observations are present, the robust boosting approach works as well as the standard gradient boosting with a squared loss. Furthermore, when the data contain outliers, the robust boosting estimator outperforms the alternatives in terms of prediction error and variable selection accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robust boosting for regression problems

Abstract

Talk to us

Similar Papers

More From: Computational Statistics & Data Analysis

Lead the way for us

Journal: Computational Statistics & Data Analysis	Publication Date: Aug 18, 2020
Citations: 15

Similar Papers

Editor's evaluation: Robust and Efficient Assessment of Potency (REAP) as a quantitative tool for dose-response curve estimation
Philip Boonstra
-
Philip BoonstraPhilip Boonstra
09 May 2022
09 May 2022

KNN robustification equivariant nonparametric regression estimators for functional ergodic data
Guenani Somi̇a ... Fetitah Omar
Hacettepe Journal of Mathematics and Statistics | VOL. 52
Guenani Somi̇a, et. al.Guenani Somi̇a ... Fetitah Omar
31 Mar 2023
Hacettepe Journal of Mathematics and Statistics | VOL. 52

General Bayesian Loss Function Selection and the use of Improper Models
Jack Jewson ... David Rossell
Journal of the Royal Statistical Society Series B: Statistical Methodology | VOL. 84
Jack Jewson, et. al.Jack Jewson ... David Rossell
25 Oct 2022
Journal of the Royal Statistical Society Series B: Statistical Methodology | VOL. 84

Robust ridge regression for estimating the effects of correlated gene expressions on phenotypic traits
Hirofumi Michimae ... Takeshi Emura
Environmental and Ecological Statistics | VOL. 27
Hirofumi Michimae, et. al.Hirofumi Michimae ... Takeshi Emura
26 Dec 2019
Environmental and Ecological Statistics | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust boosting for regression problems

Abstract

Talk to us

Similar Papers

More From: Computational Statistics &amp; Data Analysis

More From: Computational Statistics & Data Analysis