A Sequential Learning Approach for Scaling Up Filter-Based Feature Subset Selection.

Gregory Ditzler,Gail Rosen,Robi Polikar

doi:10.1109/tnnls.2017.2697407

Gregory Ditzler, Gail Rosen + Show 1 more

Open Access

https://doi.org/10.1109/tnnls.2017.2697407

Copy DOI

Abstract

Increasingly, many machine learning applications are now associated with very large data sets whose sizes were almost unimaginable just a short time ago. As a result, many of the current algorithms cannot handle, or do not scale to, today's extremely large volumes of data. Fortunately, not all features that make up a typical data set carry information that is relevant or useful for prediction, and identifying and removing such irrelevant features can significantly reduce the total data size. The unfortunate dilemma, however, is that some of the current data sets are so large that common feature selection algorithms-whose very goal is to reduce the dimensionality-cannot handle such large data sets, creating a vicious cycle. We describe a sequential learning framework for feature subset selection (SLSS) that can scale with both the number of features and the number of observations. The proposed framework uses multiarm bandit algorithms to sequentially search a subset of variables, and assign a level of importance for each feature. The novel contribution of SLSS is its ability to naturally scale to large data sets, evaluate such data in a very small amount of time, and be performed independently of the optimization of any classifier to reduce unnecessary complexity. We demonstrate the capabilities of SLSS on synthetic and real-world data sets.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE transactions on neural networks and learning systems	Publication Date: May 11, 2017
Citations: 22	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

A Sequential Learning Approach for Scaling Up Filter-Based Feature Subset Selection.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems

Lead the way for us

Similar Papers

Scalable Subset Selection with Filters and Its Applications
Gregory Charles Ditzler
-
Gregory Charles DitzlerGregory Charles Ditzler
16 Jul 2021
16 Jul 2021

Selecting feature subset for high dimensional data via the propositional FOIL rules
Guangtao Wang ... Baowen Xu
Pattern Recognition | VOL. 46
Guangtao Wang, et. al.Guangtao Wang ... Baowen Xu
16 Aug 2012
Pattern Recognition | VOL. 46

Machine learning in pain research.
Jörn Lötsch ... Alfred Ultsch
Pain | VOL. 159
Jörn Lötsch, et. al.Jörn Lötsch ... Alfred Ultsch
24 Nov 2017
Pain | VOL. 159

Instance reduction for supervised learning using input-output clustering method
Sansanee Auephanwiriyakul ... Nipon Theera-Umpon
Journal of Central South University | VOL. 22
Sansanee Auephanwiriyakul, et. al.Sansanee Auephanwiriyakul ... Nipon Theera-Umpon
01 Dec 2015
Journal of Central South University | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Sequential Learning Approach for Scaling Up Filter-Based Feature Subset Selection.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems