A regression tree approach using mathematical programming

Lingjian Yang,Songsong Liu,Sophia Tsoka,Lazaros G Papageorgiou

doi:10.1016/j.eswa.2017.02.013

Abstract

Regression analysis is a machine learning approach that aims to accurately predict the value of continuous output variables from certain independent input variables, via automatic estimation of their latent relationship from data. Tree-based regression models are popular in literature due to their flexibility to model higher order non-linearity and great interpretability. Conventionally, regression tree models are trained in a two-stage procedure, i.e. recursive binary partitioning is employed to produce a tree structure, followed by a pruning process of removing insignificant leaves, with the possibility of assigning multivariate functions to terminal leaves to improve generalisation. This work introduces a novel methodology of node partitioning which, in a single optimisation model, simultaneously performs the two tasks of identifying the break-point of a binary split and assignment of multivariate functions to either leaf, thus leading to an efficient regression tree model. Using six real world benchmark problems, we demonstrate that the proposed method consistently outperforms a number of state-of-the-art regression tree models and methods based on other techniques, with an average improvement of 7–60% on the mean absolute errors (MAE) of the predictions.

Highlights

Regression analysis seeks to estimate the relationships between output variables and a set of independent input variables by automatically learning from a number of curated samples (Sen & Srivastava, 2012)
We have proposed a novel regression tree learning algorithm, named Mathematical Programming Tree (MPTree)
An optimisation model OPLRA recently published in literature has been adopted to optimise the binary node splitting

Summary

Introduction

Regression analysis seeks to estimate the relationships between output variables and a set of independent input variables by automatically learning from a number of curated samples (Sen & Srivastava, 2012). The primary goal of applying a regression analysis is usually to obtain precise prediction of the level of output variables for new samples. One would like to gain some useful insights into the underlying relationship between the input and output variables, in which case the interpretability of a regression method is of great interest. Regression tree is a type of the machine learning tools that can satisfy both good prediction accuracy and easy interpretation, and have received extensive attention in the literature. A regression model is fitted to each terminal node to get the predicted values of the output variables of new samples

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Expert Systems with Applications	Publication Date: Feb 9, 2017
Citations: 93	License type: cc-by

R Discovery Prime

R Discovery Prime

A regression tree approach using mathematical programming

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Expert Systems with Applications

Lead the way for us

Similar Papers

Forecasting the power generation at renewable power plants in Sri Lanka using regression trees
Jeevani Jayasinghe ... Upaka Rathnayake
Results in Engineering | VOL. 22
Jeevani Jayasinghe, et. al.Jeevani Jayasinghe ... Upaka Rathnayake
08 Apr 2024
Results in Engineering | VOL. 22

Towards Operational Automatic Flood Detection Using EOS/MODIS Data
Donglian Sun ... Rui Zhang
Photogrammetric Engineering & Remote Sensing | VOL. 78
Donglian Sun, et. al.Donglian Sun ... Rui Zhang
01 Jun 2012
Photogrammetric Engineering & Remote Sensing | VOL. 78

Important variable assessment and electricity price forecasting based on regression tree models: classification and regression trees, Bagging and Random Forests
Camino González ... José Mira‐Mcwilliams
IET Generation, Transmission & Distribution | VOL. 9
Camino González, et. al.Camino González ... José Mira‐Mcwilliams
01 Aug 2015
IET Generation, Transmission & Distribution | VOL. 9

Machine learning approaches for estimation of compressive strength of concrete
Marijana Hadzima-Nyarko ... Senlin Zhu
The European Physical Journal Plus | VOL. 135
Marijana Hadzima-Nyarko, et. al.Marijana Hadzima-Nyarko ... Senlin Zhu
01 Aug 2020
The European Physical Journal Plus | VOL. 135

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A regression tree approach using mathematical programming

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Expert Systems with Applications