Abstract

The k-means clustering algorithm is the oldest and most known method in cluster analysis. It has been widely studied with various extensions and applied in a variety of substantive areas. Since internet, social network, and big data grow rapidly, multi-view data become more important. For analyzing multi-view data, various multi-view k-means clustering algorithms have been studied. However, most of multi-view k-means clustering algorithms in the literature cannot give feature reduction during clustering procedures. In general, there often exist irrelevant feature components in multi-view data sets that may cause bad performance for these clustering algorithms. There also exists high feature dimension in multi-view data sets so it is necessary to consider reducing its dimension for clustering algorithms. In this paper, a learning mechanism for the multi-view k-means algorithm to automatically compute individual feature weight is constructed. It can reduce these irrelevant feature components in each view. A new multi-view k-means objective function is firstly proposed for constructing the learning mechanism for feature weights in multi-view clustering. A schema for eliminating irrelevant feature(s) with small weight(s) is then considered for feature reduction. Therefore, a new type of multi-view k-means, called a feature-reduction multi-view k-means (FRMVK), is proposed. The computational complexity of FRMVK is also analyzed. Numerical and real data sets are used to compare FRMVK with other feature-weighted multi-view k-means algorithms. Experimental results and comparisons actually demonstrate the effectiveness and usefulness of the proposed FRMVK clustering algorithm.

Highlights

  • Clustering is a useful tool for data analysis

  • EXPERIMENTAL RESULTS AND COMPARISONS three synthetic and four real data sets are used to illustrate the performance of the proposed feature-reduction multi-view k-means (FRMVK) algorithm

  • For the Image Segmentation (IS) data set, when we run simultaneous weighting on views and features (SWVF) with β = 10 under different α, we find that the clustering performance obtained unbalance Accuracy rate (AR) and Rand Index (RI)

Read more

Summary

INTRODUCTION

Clustering is a useful tool for data analysis. It is a method for clustering a data set into groups with the most similarity in the same cluster and the most dissimilarity between different clusters [1], [2]. P. Sinaga: Feature-Reduction Multi-View k-Means Clustering Algorithm learn a discriminative feature representation was proposed in [17]. A feature-reduction schema for multi-view k-means clustering algorithms becomes important. In this paper we propose a novel feature reduction mechanism for multi-view k-means using the idea of Yang and Yessica [20]. We propose the feature-reduction multi-view k-means (FRMVK) clustering algorithm that can automatically compute different feature weights and detects these unimportant (irrelevant) features in each view. Based on the feature-reduction mechanism in each view, the proposed FRMVK algorithm can solve the weakness in most multiview k-means algorithms for multi-view data.

RELATED WORKS
THE PROPOSED FEATURE-REDUCTION MULTI-VIEW
EXPERIMENTAL RESULTS AND COMPARISONS
Result
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call