Graph Representation Learning enhanced Semi-supervised Feature Selection

Jun Tan,Zhifeng Qiu,Ning Gui

doi:10.1145/3689428

Abstract

Feature selection is a key step in machine learning by eliminating features that are not related to the modeling target to create reliable and interpretable models. By exploring the potential complex correlations among features of unlabeled data, recently introduced self-supervision-enhanced feature selection greatly reduces the reliance on the labeled samples. However, they are generally based on the autoencoder with sample-wise self-supervision, which can hardly exploit the relations among samples. To address this limitation, this paper proposes Graph representation learning enhanced Semi-supervised Feature Selection (G-FS) which performs feature selection based on the discovery and exploitation of the non-Euclidean relations among features and samples by translating unlabeled “plain” tabular data into a bipartite graph. A self-supervised edge prediction task is designed to distill rich information on the graph into low-dimensional embeddings, which remove redundant features and noise. Guided by the condensed graph representation, we propose a batch-attention feature weight generation mechanism that generates more robust weights according to batch-based selection patterns rather than individual samples. The results show that G-FS achieves significant performance edges in fourteen datasets compared to twelve state-of-the-art baselines, including two recent self-supervised baselines. The source code is public available at https://github.com/Icannotnamemyselff/G-FS_Graph_enhacned_feature_selection .

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Graph Representation Learning enhanced Semi-supervised Feature Selection

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Knowledge Discovery from Data

Lead the way for us

Similar Papers

Feature Selection Based on Graph Representation
Yassine Akhiat ... Mohamed Chahhou
-
Yassine Akhiat, et. al.Yassine Akhiat ... Mohamed Chahhou
01 Oct 2018
01 Oct 2018

Feature evaluation and selection based on neighborhood soft margin
Qinghua Hu ... Daren Yu
Neurocomputing | VOL. 73
Qinghua Hu, et. al.Qinghua Hu ... Daren Yu
06 Mar 2010
Neurocomputing | VOL. 73

Feature weighting and selection with a Pareto-optimal trade-off between relevancy and redundancy
Ayan Das ... Swagatam Das
Pattern Recognition Letters | VOL. 88
Ayan Das, et. al.Ayan Das ... Swagatam Das
12 Jan 2017
Pattern Recognition Letters | VOL. 88

Using cooperative game theory to optimize the feature selection problem
Xin Sun ... Huiling Chen
Neurocomputing | VOL. 97
Xin Sun, et. al.Xin Sun ... Huiling Chen
29 May 2012
Neurocomputing | VOL. 97

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Graph Representation Learning enhanced Semi-supervised Feature Selection

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Knowledge Discovery from Data