Quick and robust feature selection: the strength of energy-efficient sparse training for autoencoders

Zahra Atashgahi,Ghada Sokar,Tim Van Der Lee,Elena Mocanu,Mykola Pechenizkiy,Decebal Constantin Mocanu,Raymond Veldhuis

doi:10.1007/s10994-021-06063-x

Abstract

Major complications arise from the recent increase in the amount of high-dimensional data, including high computational costs and memory requirements. Feature selection, which identifies the most relevant and informative attributes of a dataset, has been introduced as a solution to this problem. Most of the existing feature selection methods are computationally inefficient; inefficient algorithms lead to high energy consumption, which is not desirable for devices with limited computational and energy resources. In this paper, a novel and flexible method for unsupervised feature selection is proposed. This method, named QuickSelection (The code is available at: https://github.com/zahraatashgahi/QuickSelection), introduces the strength of the neuron in sparse neural networks as a criterion to measure the feature importance. This criterion, blended with sparsely connected denoising autoencoders trained with the sparse evolutionary training procedure, derives the importance of all input features simultaneously. We implement QuickSelection in a purely sparse manner as opposed to the typical approach of using a binary mask over connections to simulate sparsity. It results in a considerable speed increase and memory reduction. When tested on several benchmark datasets, including five low-dimensional and three high-dimensional datasets, the proposed method is able to achieve the best trade-off of classification and clustering accuracy, running time, and maximum memory usage, among widely used approaches for feature selection. Besides, our proposed method requires the least amount of energy among the state-of-the-art autoencoder-based feature selection methods.

Highlights

In the last few years, considerable attention has been paid to the problem of dimensionality reduction and many approaches have been proposed (Van Der Maaten et al, 2009)
We introduce for the first time sparse training in the world of denoising autoencoders, and we named the newly introduced model sparse denoising autoencoder
To derive clustering accuracy (Li et al, 2018), first, we perform K-means using the subset of the dataset corresponding to the selected features and get the cluster labels

Summary

Introduction

In the last few years, considerable attention has been paid to the problem of dimensionality reduction and many approaches have been proposed (Van Der Maaten et al, 2009). Feature extraction focuses on transforming the data into a lower-dimensional space. This transformation is done through a mapping which results in a new set of features (Liu and Motoda, 1998). Feature selection reduces the feature space by selecting a subset of the original attributes without generating new features (Chandrashekar & Sahin, 2014). Based on the availability of the labels, feature selection methods are divided into three categories: supervised (Ang et al, 2015; Chandrashekar & Sahin, 2014), semi-supervised (Sheikhpour et al, 2017; Zhao & Liu, 2007), and unsupervised (Dy and Brodley, 2004; Miao & Niu, 2016). Unsupervised feature selection is considered as a much harder problem (Dy & Brodley, 2004)

Objectives

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Machine Learning	Publication Date: Oct 27, 2021
Citations: 8	License type: open-access

R Discovery Prime

R Discovery Prime

Quick and robust feature selection: the strength of energy-efficient sparse training for autoencoders

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Machine Learning

Lead the way for us

Similar Papers

Orthogonally constrained matrix factorization for robust unsupervised feature selection with local preserving
Chuan Luo ... Xi Peng
Information Sciences | VOL. 586
Chuan Luo, et. al.Chuan Luo ... Xi Peng
11 Dec 2021
Information Sciences | VOL. 586

Filter unsupervised spectral feature selection method for mixed data based on a new feature correlation measure
Saúl Solorio-Fernández ... José Fco Martínez-Trinidad
Neurocomputing | VOL. 571
Saúl Solorio-Fernández, et. al.Saúl Solorio-Fernández ... José Fco Martínez-Trinidad
12 Dec 2023
Neurocomputing | VOL. 571

Robust unsupervised feature selection via matrix factorization
Shiqiang Du ... Yurun Ma
Neurocomputing | VOL. 241
Shiqiang Du, et. al.Shiqiang Du ... Yurun Ma
11 Feb 2017
Neurocomputing | VOL. 241

An efficient unsupervised feature selection procedure through feature clustering
Xuyang Yan ... Edward Tunstel
Pattern Recognition Letters | VOL. 131
Xuyang Yan, et. al.Xuyang Yan ... Edward Tunstel
03 Jan 2020
Pattern Recognition Letters | VOL. 131

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Quick and robust feature selection: the strength of energy-efficient sparse training for autoencoders

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Machine Learning