Decision Tree Regression with Residual Outlier Detection

Swee Chuan Tan

doi:10.47852/bonviewjdsis42023861

Abstract

This paper introduces a framework for identifying outliers in predictions made by regression tree models. Existing robust regression approaches tend to focus on the construction stage, which builds regression models that are less sensitive to outliers. In contrast, our approach focuses on identifying outliers during the prediction stage. The process of our proposed approach begins with building a regression tree using a training dataset. Predictions significantly deviating from the mean within each terminal node are automatically labeled as outliers. We show how the labelled data can be explored to better understand the characteristics of the outliers. We also identify the situations under which the data exploration may not work well. Further, we make use of the outlier labels and training data to construct an anomaly detector. Our results show that the proposed method can effectively detect outliers that may exist within datasets. Such outliers, when removed, result in improved data quality. Insights into its effectiveness and potential caveats are also discussed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Decision Tree Regression with Residual Outlier Detection

Abstract

Talk to us

Similar Papers

More From: Journal of Data Science and Intelligent Systems

Lead the way for us

Similar Papers

Designing of a 5G Multiband Antenna Using Decision Tree and Random Forest Regression Models
Shilpa Pavithran ... Asha J
-
Shilpa Pavithran, et. al.Shilpa Pavithran ... Asha J
26 Aug 2021
26 Aug 2021

Factors related to self-rated health of older adults in rural China: A study based on decision tree and logistic regression model.
Min Zhang ... Song Liu
Frontiers in Public Health | VOL. 10
Min Zhang, et. al.Min Zhang ... Song Liu
30 Nov 2022
Frontiers in Public Health | VOL. 10

Simulation of Subgouge Sand Deformations Using Robust Machine Learning Algorithms
Hodjat Shiri ... Hamed Azimi
-
Hodjat Shiri, et. al.Hodjat Shiri ... Hamed Azimi
25 Apr 2022
25 Apr 2022

Predicting the Effect of Violent Gameplaying to Violent Behavior Intention among Females using Tree Regression and AdaBoost Tree Regression
Maniah Maniah ... Bachtiar Saleh Abbas
Journal of Games, Game Art, and Gamification | VOL. 4
Maniah Maniah, et. al.Maniah Maniah ... Bachtiar Saleh Abbas
19 Oct 2021
Journal of Games, Game Art, and Gamification | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Decision Tree Regression with Residual Outlier Detection

Abstract

Talk to us

Similar Papers

More From: Journal of Data Science and Intelligent Systems