Adaptive Regularized Semi-Supervised Clustering Ensemble

Rui Luo,Zhiwen Yu,Wenming Cao,Hau-San Wong,Cheng Liu,C L Philip Chen

doi:10.1109/access.2019.2963306

Abstract

Although semi-supervised clustering ensemble methods have achieved satisfactory performance, they fail to effectively utilize the constrained knowledge such as cannot-link and must-link when generating diverse ensemble members. In addition, they ignore negative effects brought about by redundancies and noisy data. To address the above shortcomings, in this paper we propose an approach to combine multiple semi-supervised clustering solutions via adaptively regularizing the weights of clustering ensemble members, which is referred to as ARSCE. First, we generate a series of feature subspaces by randomly selecting feature without replacement to avoid the scenario where there are two identical feature subspaces. Second, we conduct feature transformation on the above obtained feature subspaces while considering the pairwise constraints to find new clustering-friendly spaces, where clustering methods are exploited to generate various clustering solutions. Finally, we design a novel fusion strategy to integrate multiple clustering solutions into a unified clustering partition, where weights are designated for each clustering ensemble member. Extensive experiments are conducted on multiple real-world benchmarks, and experimental results demonstrate the effectiveness and superiority of our proposed method ARSCE over other counterparts.

Highlights

Clustering, as one of unsupervised learning methods, aims to split data into several disjoint groups, so that data in the same group are more similar than those from different groups
Inspired by ensemble supervised learning methods, recent years have witnessed the development of clustering ensemble, which is divided into two steps: the generations of clustering solutions and the fusion of clustering solutions
To achieve the above two goals, we propose an adaptative regularized semi-supervised clustering ensemble framework, which is referred to as adaptive Regularized semisupervised clustering ensemble method (ARSCE)

Summary

INTRODUCTION

Clustering, as one of unsupervised learning methods, aims to split data into several disjoint groups, so that data in the same group are more similar than those from different groups. Bai et al [7] propose a weighted consensus measure based on information entropy to evaluate the clustering quality These clustering ensemble methods have achieved satisfactory performance, they seldom consider the issues below: 1) how to fully exploit prior information provided by experts, denoted as must-link and cannot-link constraints, and 2) how to design a better fusion strategy to integrate all the clustering solutions into a more robust and stable solution, compared with each base clustering solution component. The contributions of this work are summarized as follows: 1) We propose a transformation working in random feature subspaces while considering pairwise constraints for finding a clustering-friendly space, where clustering solutions are generated via using traditional clustering methods.

RELATED WORK

EXPERIMENTS

EXPERIMENTAL ANALYSIS

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Adaptive Regularized Semi-Supervised Clustering Ensemble

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Multi-objective clustering ensemble for high-dimensional data based on Strength Pareto Evolutionary Algorithm (SPEA-II)
Abdul Wahid ... Peter Andreae
-
Abdul Wahid, et. al.Abdul Wahid ... Peter Andreae
01 Oct 2015
01 Oct 2015

Projective clustering ensembles
Francesco Gullo ... Andrea Tagarelli
Data Mining and Knowledge Discovery | VOL. 26
Francesco Gullo, et. al.Francesco Gullo ... Andrea Tagarelli
03 May 2012
Data Mining and Knowledge Discovery | VOL. 26

A semi-supervised clustering ensemble approach integrated constraint-based and metric-based
Siting Wei ... Canlong Zhang
-
Siting Wei, et. al.Siting Wei ... Canlong Zhang
19 Aug 2015
19 Aug 2015

Multi-objective Clustering Ensemble for Varying Number of Clusters
Sujoy Chatterjee ... Anirban Mukhopadhyay
-
Sujoy Chatterjee, et. al.Sujoy Chatterjee ... Anirban Mukhopadhyay
01 Nov 2018
01 Nov 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adaptive Regularized Semi-Supervised Clustering Ensemble

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access