Tree Induction Algorithm Research Articles

• The three types of metaheuristic and the hyperheuristic-based methods to build decision tree induction algorithms are included, not just evolutionary-algorithm-based approaches for decision tree induction. • Three types of implementation strategies are introduced, according to the place where a metaheuristict is used in the decision tree induction process. • The differences in the solution representation are highlighted, as they impact the metaheuristic-based implementation. • The principal components of each method are detailed, such as fitness measures and variation operators. • An analysis of the experimental studies performed in these methods is conducted. The induction of decision trees is a widely-used approach to build classification models that guarantee high performance and expressiveness. Since a recursive-partitioning strategy guided for some splitting criterion is commonly used to induce these classifiers, overfitting, attribute selection bias, and instability to small training set changes are well-known problems in them. Other approaches, such as incremental induction, classifier ensembles, and the global search in the decision-tree-space, have been implemented to overcome these problems. In particular, metaheuristics such as simulated annealing, genetic algorithms, genetic programming, and ant colony optimization have been used to induce compact and accurate decision trees. This paper presents a state-of-the-art review of the use of single-solution-based metaheuristics and swarm and evolutionary computation algorithms to build decision trees as classification models. We outline the decision-tree-induction process components and detail the existing literature studies on metaheuristic-based approaches to building these classifiers. Several timelines showing the chronological order in which these approaches were introduced in the literature are included. A summary analysis of these studies is also conducted, focusing on their internal components and experimental studies. This work provides a useful reference point for future research in this field.

Read full abstract

Decision trees are machine learning models commonly used in various application scenarios. In the era of big data, traditional decision tree induction algorithms are not suitable for learning large-scale datasets due to their stringent data storage requirement. Online decision tree learning algorithms have been devised to tackle this problem by concurrently training with incoming samples and providing inference results. However, even the most up-to-date online tree learning algorithms still suffer from either high memory usage or high computational intensity with dependency and long latency, making them challenging to implement in hardware. To overcome these difficulties, we introduce a new quantile-based algorithm to improve the induction of the Hoeffding tree, one of the state-of-the-art online learning models. The proposed algorithm is lightweight in terms of both memory and computational demand, while still maintaining high generalization ability. A series of optimization techniques dedicated to the proposed algorithm have been investigated from the hardware perspective, including coarse-grained and fine-grained parallelism, dynamic and memory-based resource sharing, pipelining with data forwarding. Following this, we present Hard-ODT, a high-performance, hardware-efficient and scalable online decision tree learning system on a field-programmable gate array (FPGA) with system-level optimization techniques. Performance and resource utilization are modeled for the complete learning system for early and fast analysis of the tradeoff between various design metrics. Finally, we propose a design flow in which the proposed learning system is applied to FPGA run-time power monitoring as a case study. Experimental results show that our proposed algorithm outperforms the state-of-the-art Hoeffding tree learning method, leading to 0.05% to 12.3% improvement in inference accuracy. Real implementation of the complete learning system on the FPGA demonstrates a <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$384\times $ </tex-math></inline-formula> to <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$1581\times $ </tex-math></inline-formula> speedup in execution time over the state-of-the-art design. The power modeling strategy with Hard-ODT achieves an average power prediction error within 4.93% of a commercial gate-level power estimation tool.

Read full abstract

Tree Induction Algorithm Research Articles

Related Topics

Articles published on Tree Induction Algorithm

Increasing trust in AI through privacy preservation and model explainability: Federated Learning of Fuzzy Regression Trees

Enhancing Groundwater Quality Evaluation Using Associative Rule Mining Technique with Random Forest Split Gini Indexing Algorithm for Nitrate Concentration Analysis

Generative Model for Decision Trees

Better trees: an empirical study on hyperparameter tuning of classification decision tree induction algorithms

Big data decision tree for continuous-valued attributes based on unbalanced cut points

Multi-dimensional Bayesian network classifiers for partial label ranking

A pivot-based simulated annealing algorithm to determine oblique splits for decision tree induction

Regularized impurity reduction: accurate decision trees with complexity guarantees

Towards a Better Diagnosis of Prostate Cancer: Application of Machine Learning Algorithms

Deep preference learning for multiple criteria decision analysis

Induction of decision trees as classification models through metaheuristics

An integrated knowledge-based system for early detection of eye refractive error using data mining

Predicting and interpreting oxide glass properties by machine learning using large datasets

HYPER HEURISTIC EVOLUTIONARY APPROACH FOR CONSTRUCTING DECISION TREE CLASSIFIERS

GEOBIA, TREE DECISION AND HIERARCHICAL CLASSIFICATION FOR MAPPING GULLY EROSION

Hard-ODT: Hardware-Friendly Online Decision Tree Learning Algorithm and System

Spam Detection in Social Media Networking Sites using Ensemble Methodology with Cross Validation

Decision tree classifier: a detailed survey

Hardware Acceleration of Sparse Oblique Decision Trees for Edge Computing

Inquiry of Personal Intelligence Of Adolescence and Early Adulthood College Students of Tamilnadu using Id3 Algorithm

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Tree Induction Algorithm Research Articles

Related Topics

Articles published on Tree Induction Algorithm

Increasing trust in AI through privacy preservation and model explainability: Federated Learning of Fuzzy Regression Trees

Enhancing Groundwater Quality Evaluation Using Associative Rule Mining Technique with Random Forest Split Gini Indexing Algorithm for Nitrate Concentration Analysis

Generative Model for Decision Trees

Better trees: an empirical study on hyperparameter tuning of classification decision tree induction algorithms

Big data decision tree for continuous-valued attributes based on unbalanced cut points

Multi-dimensional Bayesian network classifiers for partial label ranking

A pivot-based simulated annealing algorithm to determine oblique splits for decision tree induction

Regularized impurity reduction: accurate decision trees with complexity guarantees

Towards a Better Diagnosis of Prostate Cancer: Application of Machine Learning Algorithms

Deep preference learning for multiple criteria decision analysis

Induction of decision trees as classification models through metaheuristics

An integrated knowledge-based system for early detection of eye refractive error using data mining

Predicting and interpreting oxide glass properties by machine learning using large datasets

HYPER HEURISTIC EVOLUTIONARY APPROACH FOR CONSTRUCTING DECISION TREE CLASSIFIERS

GEOBIA, TREE DECISION AND HIERARCHICAL CLASSIFICATION FOR MAPPING GULLY EROSION

Hard-ODT: Hardware-Friendly Online Decision Tree Learning Algorithm and System

Spam Detection in Social Media Networking Sites using Ensemble Methodology with Cross Validation

Decision tree classifier: a detailed survey

Hardware Acceleration of Sparse Oblique Decision Trees for Edge Computing

Inquiry of Personal Intelligence Of Adolescence and Early Adulthood College Students of Tamilnadu using Id3 Algorithm