Finding the best diversity generation procedures for mining contrast patterns

Milton García-Borroto,José Fco Martínez-Trinidad,Jesús Ariel Carrasco-Ochoa

doi:10.1016/j.eswa.2015.02.028

Abstract

Most understandable classifiers are based on contrast patterns, which can be accurately mined from decision trees. Nevertheless, tree diversity must be ensured to mine a representative pattern collection. In this paper, we performed an experimental comparison among different diversity generation procedures. We compare diversity generated by each procedure based on the amount of total, unique, and minimal patterns extracted from the induced tree for different minimal support thresholds. This comparison, together with an accuracy and abstention experiment, shows that Random Forest and Bagging generate the most diverse and accurate pattern collection. Additionally, we study the influence of data type in the results, finding that Random Forest is best for categorical data and Bagging for numerical data. Comparison includes most known diversity generation procedures and three new deterministic procedures introduced here. These deterministic procedures outperform existing deterministic method, but are still outperformed by random procedures.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Finding the best diversity generation procedures for mining contrast patterns

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications

Lead the way for us

Journal: Expert Systems with Applications	Publication Date: Feb 26, 2015
Citations: 27

Similar Papers

Random subsequence forests
Zengyou He ... Quan Zou
Information Sciences | VOL. 667
Zengyou He, et. al.Zengyou He ... Quan Zou
19 Mar 2024
Information Sciences | VOL. 667

A novel approach to build accurate and diverse decision tree forest.
Archana R Panhalkar ... Dharmpal D Doye
Evolutionary Intelligence | VOL. 15
Archana R Panhalkar, et. al.Archana R Panhalkar ... Dharmpal D Doye
03 Jan 2021
Evolutionary Intelligence | VOL. 15

Double random forest
Sunwoo Han ... Hyunjoong Kim
Machine Learning | VOL. 109
Sunwoo Han, et. al.Sunwoo Han ... Hyunjoong Kim
02 Jul 2020
Machine Learning | VOL. 109

Seeing the Forest for the Trees: Random Forest Models for Predicting Survival in Kidney Transplant Recipients.
Ruth Sapir-Pichhadze ... Bruce Kaplan
Transplantation | VOL. 104
Ruth Sapir-Pichhadze, et. al.Ruth Sapir-Pichhadze ... Bruce Kaplan
01 May 2020
Transplantation | VOL. 104

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Finding the best diversity generation procedures for mining contrast patterns

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications