Meta-Tree Random Forest: Probabilistic Data-Generative Model and Bayes Optimal Prediction.

Nao Dobashi,Toshiyasu Matsushima,Shota Saito,Yuta Nakahara

doi:10.3390/e23060768

Nao Dobashi, Toshiyasu Matsushima + Show 2 more

Open Access

PDF Available

https://doi.org/10.3390/e23060768

Copy DOI

Export

Save

Cite

Journal: Entropy (Basel, Switzerland)	Publication Date: Jun 18, 2021
Citations: 6	License type: CC BY 4.0

Affiliation: Waseda University, Gunma University

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

This paper deals with a prediction problem of a new targeting variable corresponding to a new explanatory variable given a training dataset. To predict the targeting variable, we consider a model tree, which is used to represent a conditional probabilistic structure of a targeting variable given an explanatory variable, and discuss statistical optimality for prediction based on the Bayes decision theory. The optimal prediction based on the Bayes decision theory is given by weighting all the model trees in the model tree candidate set, where the model tree candidate set is a set of model trees in which the true model tree is assumed to be included. Because the number of all the model trees in the model tree candidate set increases exponentially according to the maximum depth of model trees, the computational complexity of weighting them increases exponentially according to the maximum depth of model trees. To solve this issue, we introduce a notion of meta-tree and propose an algorithm called MTRF (Meta-Tree Random Forest) by using multiple meta-trees. Theoretical and experimental analyses of the MTRF show the superiority of the MTRF to previous decision tree-based algorithms.

Highlights

Various studies in pattern recognition deal with a prediction problem of a targeting variable yn+1 corresponding to an explanatory variable xn+1 given pairs of explanatory and targeting variable {( xi, yi )}in=1
Under the assumption that the true model tree is in the restricted model tree candidate set represented by a meta-tree, Reference [13] proposed the optimal prediction based on the Bayes decision theory
As we have described above, if the true model tree is included in the model tree candidate set, the optimal prediction based on the Bayes decision theory is calculated

Summary

Introduction

Various studies in pattern recognition deal with a prediction problem of a targeting variable yn+1 corresponding to an explanatory variable xn+1 given pairs of explanatory and targeting variable {( xi , yi )}in=1. If the true model tree is in a model tree candidate set represented by a meta-tree, the statistically optimal prediction—optimal prediction based on the Bayes decision theory—can be calculated. Under the assumption that the true model tree is in the restricted model tree candidate set represented by a meta-tree, Reference [13] proposed the optimal prediction based on the Bayes decision theory. As we have described above, if the true model tree is included in the model tree candidate set, the optimal prediction based on the Bayes decision theory is calculated. By using the model tree candidate set represented by multiple meta-trees, we predict yn+1 based on the Bayes decision theory. We call this proposed algorithm MTRF (MetaTree Random Forest).

Model Tree

Problem Setup

Optimal Prediction Based on the Bayes Decision Theory

Previous Study

Expectation over the Parameters

Summation over All Model Trees Represented by a Meta-Tree

Proposed Method

CART and Random Forest Revisited

Comparison of Random Forest with MTRF

Experiments

Experiment 1

Experiment 2

Conclusions and Future Work

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Meta-Tree Random Forest: Probabilistic Data-Generative Model and Bayes Optimal Prediction.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Entropy (Basel, Switzerland)

Lead the way for us

Similar Papers

Statistical properties of bootstrap estimation of phylogenetic variability from nucleotide sequences. I. Four taxa with a molecular clock.
...
Molecular biology and evolution | VOL. 9
, et. al. ...
01 Nov 1992
Molecular biology and evolution | VOL. 9

Relative efficiencies of the maximum parsimony and distance-matrix methods in obtaining the correct phylogenetic tree.
...
Molecular biology and evolution | VOL. 5
, et. al. ...
01 May 1988
Molecular biology and evolution | VOL. 5

Incremental Learning of Linear Model Trees
Duncan Potts ... Claude Sammut
Machine Learning | VOL. 61
Duncan Potts, et. al.Duncan Potts ... Claude Sammut
09 Jun 2005
Machine Learning | VOL. 61

Improving the $$\epsilon $$-approximate algorithm for Probabilistic Classifier Chains
Miriam Fdez-Díaz ... Deiner Mena
Knowledge and Information Systems | VOL. 62
Miriam Fdez-Díaz, et. al.Miriam Fdez-Díaz ... Deiner Mena
30 Jan 2020
Knowledge and Information Systems | VOL. 62

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Meta-Tree Random Forest: Probabilistic Data-Generative Model and Bayes Optimal Prediction.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Entropy (Basel, Switzerland)