Feature ranking is a kind of feature selection process which ranks the features based on their relevances and importance with respect to the problem. This topic has been well studied in supervised classification area. However, very few works are done for unsupervised clustering under the condition that labels of all instances are unknown beforehand. Thus, feature ranking for unsupervised clustering is a challenging task due to the absence of labels of instances for guiding the computations of the relevances of features. This paper explores the feature ranking approach within the unsupervised clustering area. We propose a novel consensus unsupervised feature ranking approach, termed as unsupervised feature ranking from multiple views (FRMV). The FRMV method firstly obtains multiple rankings of all features from different views of the same data set and then aggregates all the obtained feature rankings into a single consensus one. Experimental results on several real data sets demonstrate that FRMV is often able to identify a better feature ranking when compared with that obtained by a single feature ranking approach.
Read full abstract