CUDAP: A Novel Clustering Algorithm for Uncertain Data Based on Approximate Backbone

Ping Jin,Yu Zong,Shichao Qu,Xin Li

doi:10.4304/jsw.9.3.732-737

Abstract

Clustering for uncertain data is an interesting research topic in data mining. Researchers prefer to define uncertain data clustering problem by using combinatorial optimization model. Heuristic clustering algorithm is an efficient way to deal with this kind of clustering problem, but initialization sensitivity is one of inevitable drawbacks. In this paper, we propose a novel clustering algorithm named CUDAP (Clustering algorithm for Uncertain Data based on Approximate backbone). In CUDAP, we (1) make M times random sampling on the original uncertain data set D m to generate M sampled data sets DS= { Ds 1 ,Ds 2 ,…,Ds M }; (2) capture the M local optimal clustering results P ={ C 1 ,C 2 ,…,C M } from DS by running UK-Medoids algorithm on each sample data set Ds i , i=1,…M ; (3) design a greedy search algorithm to find out the approximate backbone( APB ) from P ; (4) run UK-Medoids again on the original uncertain data set D m guided by new initialization which was generated from APB . Experimental results on synthetic and real world data sets demonstrate the superiority of the proposed approach in terms of clustering quality measures.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CUDAP: A Novel Clustering Algorithm for Uncertain Data Based on Approximate Backbone

Abstract

Talk to us

Similar Papers

More From: Journal of Software

Lead the way for us

Journal: Journal of Software	Publication Date: Jan 3, 2014
Citations: 7

Similar Papers

HC_AB: A new heuristic clustering algorithm based on Approximate Backbone
Yu Zong ... Enhong Chen
Information Processing Letters | VOL. 111
Yu Zong, et. al.Yu Zong ... Enhong Chen
03 Jun 2011
Information Processing Letters | VOL. 111

A Clustering Algorithm based on Local Accumulative Knowledge
Yu Zong ... Dongguan Xu
Journal of Computers | VOL. -
Yu Zong, et. al.Yu Zong ... Dongguan Xu
02 Jan 2013
Journal of Computers | VOL. -

Constraint Based Subspace Clustering for High Dimensional Uncertain Data
Xianchao Zhang ... Hong Yu
-
Xianchao Zhang, et. al.Xianchao Zhang ... Hong Yu
01 Jan 2015
01 Jan 2015

The Issue of Missing Values in Data Mining
Malcolm J Beynon
-
Malcolm J BeynonMalcolm J Beynon
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CUDAP: A Novel Clustering Algorithm for Uncertain Data Based on Approximate Backbone

Abstract

Talk to us

Similar Papers

More From: Journal of Software