Minimizing the variance of cluster mixture models for clustering uncertain objects

Francesco Gullo,Andrea Tagarelli,Giovanni Ponti

doi:10.1002/sam.11170

Abstract

AbstractIn recent years, there has been a growing interest in clustering uncertain objects. In contrast to traditional, ‘sharp’ data representation models, uncertain objects are modeled as probability distributions defined over uncertainty regions. In this context, a major issue is related to the poor efficiency of existing algorithms, which is mainly due to expensive computation of the distance between uncertain objects. In this work, we extend our earlier work in which a novel formulation to the problem of clustering uncertain objects is defined based on the minimization of the variance of the mixture models that represent the clusters being discovered. Analytical properties about the computation of variance for cluster mixture models are derived and exploited by a partitional clustering algorithm, called MMVar. This algorithm achieves high efficiency since it does not need to employ any distance measure between uncertain objects. Experiments have shown that MMVar is scalable and outperforms state‐of‐the‐art algorithms in terms of efficiency, while achieving better average performance in terms of accuracy. © 2012 Wiley Periodicals, Inc. Statistical Analysis and Data Mining 6: 116–135, 2013

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Minimizing the variance of cluster mixture models for clustering uncertain objects

Abstract

Talk to us

Similar Papers

More From: Statistical Analysis and Data Mining: The ASA Data Science Journal

Lead the way for us

Journal: Statistical Analysis and Data Mining: The ASA Data Science Journal	Publication Date: Nov 19, 2012
Citations: 9

Similar Papers

Minimizing the Variance of Cluster Mixture Models for Clustering Uncertain Objects
Francesco Gullo ... Giovanni Ponti
-
Francesco Gullo, et. al.Francesco Gullo ... Giovanni Ponti
01 Dec 2010
01 Dec 2010

A Hierarchical Algorithm for Clustering Uncertain Data via an Information-Theoretic Approach
Francesco Gullo ... Giovanni Ponti
-
Francesco Gullo, et. al.Francesco Gullo ... Giovanni Ponti
01 Dec 2008
01 Dec 2008

Improved Bisector pruning for uncertain data mining
...
-
, et. al. ...
25 Jun 2012
25 Jun 2012

Statistical and Machine-Learning Data Mining, Third Edition

-

12 Jul 2017
12 Jul 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Minimizing the variance of cluster mixture models for clustering uncertain objects

Abstract

Talk to us

Similar Papers

More From: Statistical Analysis and Data Mining: The ASA Data Science Journal