A kernel density smoothing method for determining an optimal number of clusters in continuous data

J Bugrien,F Shuweihdi,K Mwitondi

doi:10.2495/risk140151

Abstract

While data clustering algorithms are becoming increasingly popular across scientific, industrial and social data mining applications, model complexity remains a major challenge. Most clustering algorithms do not incorporate a mechanism for finding an optimal scale parameter that corresponds to an appropriate number of clusters. We propose , a kernel-density smoothing-based approach to data clustering. Its main ideas derive from two unsupervised clustering approaches – kernel density estimation (KDE) and scale-spacing clustering (SSC). The novel method determines the optimal number of clusters by first finding dense regions in data before separating them based on data-dependent parameter estimates. The optimal number of clusters is determined from different levels of smoothing after the inherent number of arbitrary shape clusters has been detected without a priori information. We demonstrate the applicability of the proposed method under both nested and non-nested hierarchical clustering methodologies. Simulated and real data results are presented to validate the performance of the method, with repeated runs showing high accuracy and reliability.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A kernel density smoothing method for determining an optimal number of clusters in continuous data

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Similarity-Based Approaches for Determining the Number of Trace Clusters in Process Discovery
Pieter De Koninck ... Jochen De Weerdt
-
Pieter De Koninck, et. al.Pieter De Koninck ... Jochen De Weerdt
01 Jan 2017
01 Jan 2017

Improving the Dynamic Clustering of Hyperspectral Data Based on the Integration of Swarm Optimization and Decision Analysis
Amin Alizadeh Naeini ... Mohammad Saadatseresht
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 7
Amin Alizadeh Naeini, et. al.Amin Alizadeh Naeini ... Mohammad Saadatseresht
01 Jun 2014
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 7

Objectively Determining the Number of Similar Hydrographic Clusters with Unsupervised Machine Learning
Carola Trahms ... Arne Biastoch
-
Carola Trahms, et. al.Carola Trahms ... Arne Biastoch
15 May 2023
15 May 2023

Development and validation of consensus clustering-based framework for brain segmentation using resting fMRI.
Srikanth Ryali ... Weidong Cai
Journal of neuroscience methods | VOL. 240
Srikanth Ryali, et. al.Srikanth Ryali ... Weidong Cai
29 Nov 2014
Journal of neuroscience methods | VOL. 240

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A kernel density smoothing method for determining an optimal number of clusters in continuous data

Abstract

Talk to us

Similar Papers