A Link-Based Cluster Ensemble Approach for Categorical Data Clustering

Natthakan Iam-On,Chris Price,Simon Garrett,Tossapon Boongeon

doi:10.1109/tkde.2010.268

Abstract

Although attempts have been made to solve the problem of clustering categorical data via cluster ensembles, with the results being competitive to conventional algorithms, it is observed that these techniques unfortunately generate a final data partition based on incomplete information. The underlying ensemble-information matrix presents only cluster-data point relations, with many entries being left unknown. The paper presents an analysis that suggests this problem degrades the quality of the clustering result, and it presents a new link-based approach, which improves the conventional matrix by discovering unknown entries through similarity between clusters in an ensemble. In particular, an efficient link-based algorithm is proposed for the underlying similarity assessment. Afterward, to obtain the final clustering result, a graph partitioning technique is applied to a weighted bipartite graph that is formulated from the refined matrix. Experimental results on multiple real data sets suggest that the proposed link-based method almost always outperforms both conventional clustering algorithms for categorical data and well-known cluster ensemble techniques.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Link-Based Cluster Ensemble Approach for Categorical Data Clustering

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Journal: IEEE Transactions on Knowledge and Data Engineering	Publication Date: Mar 1, 2012
Citations: 173

Similar Papers

A Bootstrap Aggregating Technique on Link-Based Cluster Ensemble Approach for Categorical Data Clustering
S Pavan Kumar Reddy ... U Sesadri
INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY | VOL. 10
S Pavan Kumar Reddy, et. al.S Pavan Kumar Reddy ... U Sesadri
30 Aug 2013
INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY | VOL. 10

Measurement of similarity using link based cluster approach for categorical data
M Pavithra ... D Chandrakala
-
M Pavithra, et. al.M Pavithra ... D Chandrakala
01 Feb 2013
01 Feb 2013

A cluster ensemble method for clustering categorical data
Zengyou He ... Shengchun Deng
Information Fusion | VOL. 6
Zengyou He, et. al.Zengyou He ... Shengchun Deng
09 Apr 2004
Information Fusion | VOL. 6

Partition-and-merge based fuzzy genetic clustering algorithm for categorical data
Thi Phuong Quyen Nguyen ... R.J Kuo
Applied Soft Computing | VOL. 75
Thi Phuong Quyen Nguyen, et. al.Thi Phuong Quyen Nguyen ... R.J Kuo
19 Nov 2018
Applied Soft Computing | VOL. 75

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Link-Based Cluster Ensemble Approach for Categorical Data Clustering

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering