Monte Carlo Tree Search-Based Recursive Algorithm for Feature Selection in High-Dimensional Datasets.

Muhammad Umar Chaudhry,Muhammad Yasir,Muhammad Nabeel Asghar,Jee-Hyong Lee

doi:10.3390/e22101093

Abstract

The complexity and high dimensionality are the inherent concerns of big data. The role of feature selection has gained prime importance to cope with the issue by reducing dimensionality of datasets. The compromise between the maximum classification accuracy and the minimum dimensions is as yet an unsolved puzzle. Recently, Monte Carlo Tree Search (MCTS)-based techniques have been invented that have attained great success in feature selection by constructing a binary feature selection tree and efficiently focusing on the most valuable features in the features space. However, one challenging problem associated with such approaches is a tradeoff between the tree search and the number of simulations. In a limited number of simulations, the tree might not meet the sufficient depth, thus inducing biasness towards randomness in feature subset selection. In this paper, a new algorithm for feature selection is proposed where multiple feature selection trees are built iteratively in a recursive fashion. The state space of every successor feature selection tree is less than its predecessor, thus increasing the impact of tree search in selecting best features, keeping the MCTS simulations fixed. In this study, experiments are performed on 16 benchmark datasets for validation purposes. We also compare the performance with state-of-the-art methods in literature both in terms of classification accuracy and the feature selection ratio.

Highlights

With the abundance of huge data around, more sophisticated methods are required to handle it.Among the class of different techniques, feature selection is one that has gained much attention by the researchers, mainly because of the high dimensionality of big datasets
We extend the idea of MOTiFS and propose a recursive framework to take the full advantage of tree search for optimal feature selection
The idea is based on the intuition that the state space of every successor feature selection tree is smaller than that of its predecessor, increasing the impact of tree search in selecting best features, keeping the Monte Carlo Tree Search (MCTS) simulations fixed during each recursion

Summary

Introduction

With the abundance of huge data around, more sophisticated methods are required to handle it. The search tree might not meet the sufficient depth in a limited number of MCTS simulations, inducing bias towards randomness in feature subset selection. This intuition urged us and served as a catalyst for this study. The idea is based on the intuition that the state space of every successor feature selection tree is smaller than that of its predecessor, increasing the impact of tree search in selecting best features, keeping the MCTS simulations fixed during each recursion. The algorithm starts with the full feature set F as an initial input and builds various feature selection trees in a series, each producing the best feature subset (Fbest ) as an output after S MCTS simulations.

Related Work

The Recursive Procedure

Feature

Feature Subset Generation

Reward Calculation and Backpropagation

Datasets

Experimental Setting

Comparison with MOTiFS and H-MOTiFS

Comparison with State-Of-The-Art Methods

Non-Parametric Statistical Tests

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Entropy	Publication Date: Sep 29, 2020
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Monte Carlo Tree Search-Based Recursive Algorithm for Feature Selection in High-Dimensional Datasets.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy

Lead the way for us

Similar Papers

Feature Selection for High Dimensional Data Using Monte Carlo Tree Search
Muhammad Umar Chaudhry ... Jee-Hyong Lee
IEEE access : practical innovations, open solutions | VOL. 6
Muhammad Umar Chaudhry, et. al.Muhammad Umar Chaudhry ... Jee-Hyong Lee
01 Jan 2018
IEEE access : practical innovations, open solutions | VOL. 6

Feature Selection Algorithms and Student Academic Performance: A Study
Chitra Jalota ... Rashmi Agrawal
-
Chitra Jalota, et. al.Chitra Jalota ... Rashmi Agrawal
02 Aug 2020
02 Aug 2020

An improved relief feature selection algorithm based on Monte-Carlo tree search
Jianyang Zheng ... Yunlong Liu
Systems Science & Control Engineering | VOL. 7
Jianyang Zheng, et. al.Jianyang Zheng ... Yunlong Liu
01 Jan 2019
Systems Science & Control Engineering | VOL. 7

Metaheuristic Search Based Feature Selection Methods for Classification of Cancer
L Meenachi ... S Ramakrishnan
Pattern Recognition | VOL. 119
L Meenachi, et. al.L Meenachi ... S Ramakrishnan
22 Jun 2021
Pattern Recognition | VOL. 119

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Monte Carlo Tree Search-Based Recursive Algorithm for Feature Selection in High-Dimensional Datasets.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy