Unsupervised feature selection for large data sets

Renato Cordeiro De Amorim

doi:10.1016/j.patrec.2019.08.017

Abstract

The last decade saw a considerable increase in the availability of data. Unfortunately, this increase was overshadowed by various technical difficulties that arise when analysing large data sets. These include long processing times, large requirements for data storage, and other technical issues related to the analysis of high-dimensional data sets. By consequence, reducing the cardinality of data sets (with minimum information loss) has become of interest to virtually any data scientist. Many feature selection algorithms have been introduced in the literature, however, there are two main issues with these. First, the vast majority of such algorithms require labelled samples to learn from. One should note it is often too expensive to label a meaningful amount of data, particularly when dealing with large data sets. Second, these algorithms were not designed to deal with the volume of data we have nowadays. This paper introduces a novel unsupervised feature selection algorithm designed specifically to deal with large data sets. Our experiments demonstrate the superiority of our method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Unsupervised feature selection for large data sets

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters

Lead the way for us

Journal: Pattern Recognition Letters	Publication Date: Aug 27, 2019
Citations: 17

Similar Papers

A multiple association-based unsupervised feature selection algorithm for mixed data sets
Ayman Taha ... Susan Mckeever
Expert Systems with Applications | VOL. 212
Ayman Taha, et. al.Ayman Taha ... Susan Mckeever
05 Sep 2022
Expert Systems with Applications | VOL. 212

Unsupervised Feature Selection Approach for Cancer Prediction
Ummadi Janardhan Reddy ... B Eswara Reddy
IETE Journal of Research | VOL. 69
Ummadi Janardhan Reddy, et. al.Ummadi Janardhan Reddy ... B Eswara Reddy
04 Feb 2021
IETE Journal of Research | VOL. 69

Feature weighting as a tool for unsupervised feature selection
Deepak Panday ... Peter Lane
Information Processing Letters | VOL. 129
Deepak Panday, et. al.Deepak Panday ... Peter Lane
21 Sep 2017
Information Processing Letters | VOL. 129

A novel unsupervised feature selection method for bioinformatics data sets through feature clustering
Guangrong Li ... Xiaohua Hu
-
Guangrong Li, et. al. Guangrong Li ... Xiaohua Hu
01 Aug 2008
01 Aug 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unsupervised feature selection for large data sets

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters