Pruning of genetic programming trees using permutation tests

Peter Rockett

doi:10.1007/s12065-020-00379-8

Abstract

We present a novel approach based on statistical permutation tests for pruning redundant subtrees from genetic programming (GP) trees that allows us to explore the extent of effective redundancy . We observe that over a range of regression problems, median tree sizes are reduced by around 20% largely independent of test function, and that while some large subtrees are removed, the median pruned subtree comprises just three nodes; most take the form of an exact algebraic simplification. Our statistically-based pruning technique has allowed us to explore the hypothesis that a given subtree can be replaced with a constant if this substitution results in no statistical change to the behavior of the parent tree—what we term approximate simplification. In the eventuality, we infer that more than 95% of the accepted pruning proposals are the result of algebraic simplifications, which provides some practical insight into the scope of removing redundancies in GP trees.

Highlights

It has long been accepted that genetic programming (GP) produces trees that contain substantial amounts of redundancy [2, 3, 10, 15, 30]
The typical distributions of tree node counts for the French curve test function for a range of hard node count limits is shown in Fig. 2; these have been accumulated over 1000 repetitions, each with independent initial populations. (All results have incorporated the Benjamini & Hochberg multiple comparison procedures.) the corresponding plots for all the test functions look remarkably similar
From the results presented in this paper it is apparent that the proposed permutation-based pruning procedure is effective in reducing median tree sizes by about 20% independent of test function and hard node count limit

Summary

Introduction

It has long been accepted that genetic programming (GP) produces trees that contain substantial amounts of redundancy [2, 3, 10, 15, 30]. Zhang and co-workers [30, 31] have explored the use of hashing to simplify trees both at the end of a run as well as during the evolutionary run These authors found, unsurprisingly, that simplification reduced tree sizes the effect on test performance was not examined with any formal statistical procedure and does appear to have been resolved. Song et al [25] pruned trees by comparing the output of a binary node with its two inputs and replacing that node with either child if it gave the same value as the binary output within a threshold The drawbacks with both these contributions are that: (i) they involve local operations that ignore the effect of an edit higher in the tree, and (ii) both rely on setting user-defined thresholds for which there appears to be no principled method other than trial-anderror. Evolutionary Intelligence (2020) 13:649–661 authors only considered the propagated influence of a proposed simplification one or two levels up the tree

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Evolutionary Intelligence	Publication Date: Apr 20, 2020
Citations: 9	License type: open-access

R Discovery Prime

R Discovery Prime

Pruning of genetic programming trees using permutation tests

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Evolutionary Intelligence

Lead the way for us

Similar Papers

Function approximations by coupling neural networks and genetic programming trees with oblique decision trees
Y.-S Yeun ... Y.-S Yang
Artificial Intelligence in Engineering | VOL. 13
Y.-S Yeun, et. al.Y.-S Yeun ... Y.-S Yang
01 Jul 1999
Artificial Intelligence in Engineering | VOL. 13

CoInGP
Domagoj Jakobovic ... Mauro Castelli
-
Domagoj Jakobovic, et. al.Domagoj Jakobovic ... Mauro Castelli
26 Jun 2021
26 Jun 2021

The Evolution of Representations in Genetic Programming Trees
Douglas Kirkpatrick ... Arend Hintze
-
Douglas Kirkpatrick, et. al.Douglas Kirkpatrick ... Arend Hintze
01 Jan 2020
01 Jan 2020

Nonlinear model identification of an experimental ball-and-tube system using a genetic programming approach
Leandro Dos Santos Coelho ... Marcelo Wicthoff Pessôa
Mechanical Systems and Signal Processing | VOL. 23
Leandro Dos Santos Coelho, et. al.Leandro Dos Santos Coelho ... Marcelo Wicthoff Pessôa
20 Feb 2009
Mechanical Systems and Signal Processing | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Pruning of genetic programming trees using permutation tests

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Evolutionary Intelligence