An empirical evaluation of random transformations applied to ensemble clustering

Gabriel Damasceno Rodrigues,Marcelo Keese Albertini,Xiaomin Yang

doi:10.1007/s11042-020-08947-x

Gabriel Damasceno Rodrigues, Marcelo Keese Albertini + Show 1 more

https://doi.org/10.1007/s11042-020-08947-x

Copy DOI

Abstract

Ensemble clustering techniques have improved in recent years, offering better average performance between domains and data sets. Benefits range from finding novelty clustering which are unattainable by any single clustering algorithm to providing clustering stability, such that the quality is little affected by noise, outliers or sampling variations. The main clustering ensemble strategies are: to combine results of different clustering algorithms; to produce different results by resampling the data, such as in bagging and boosting techniques; and to execute a given algorithm multiple times with different parameters or initialization. Often ensemble techniques are developed for supervised settings and later adapted to the unsupervised setting. Recently, Blaser and Fryzlewicz proposed an ensemble technique to classification based on resampling and transforming input data. Specifically, they employed random rotations to improve significantly Random Forests performance. In this work, we have empirically studied the effects of random transformations based in rotation matrices, Mahalanobis distance and density proximity to improve ensemble clustering. Our experiments considered 12 data sets and 25 variations of random transformations, given a total of 5580 data sets applied to 8 algorithms and evaluated by 4 clustering measures. Statistical tests identified 17 random transformations that are viable to be applied to ensembles and standard clustering algorithms, which had positive effects on cluster quality. In our results, the best performing transforms were Mahalanobis-based transformations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An empirical evaluation of random transformations applied to ensemble clustering

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications

Lead the way for us

Journal: Multimedia Tools and Applications	Publication Date: Jul 28, 2020
Citations: 1

Similar Papers

Investigation the performance of ensemble clustering techniques in latest GPS velocity field of Turkey
Batuhan Kilic ... Yalçın Yılmaz
-
Batuhan Kilic, et. al.Batuhan Kilic ... Yalçın Yılmaz
15 May 2023
15 May 2023

Link-based cluster ensembles for heterogeneous biological data analysis
Natthakan Iam-On ... Tossapon Boongoen
-
Natthakan Iam-On, et. al.Natthakan Iam-On ... Tossapon Boongoen
01 Dec 2010
01 Dec 2010

A new link-based method to ensemble clustering and cancer microarray data analysis
... Nattawut Kongkotchawan
International Journal of Collaborative Intelligence | VOL. 1
, et. al. ... Nattawut Kongkotchawan
01 Jan 2014
International Journal of Collaborative Intelligence | VOL. 1

Fuzzy-Rough induced spectral ensemble clustering
Guanli Yue ... Yanpeng Qu
Journal of Intelligent & Fuzzy Systems | VOL. 45
Guanli Yue, et. al.Guanli Yue ... Yanpeng Qu
02 Jul 2023
Journal of Intelligent & Fuzzy Systems | VOL. 45

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An empirical evaluation of random transformations applied to ensemble clustering

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications