A Non-parametric Method for Data Clustering with Optimal Variable Weighting

Ji-Won Chung,In-Chan Choi

doi:10.1007/11875581_97

Abstract

AbstractSince cluster analysis in data mining often deals with large-scale high-dimensional data with masking variables, it is important to remove non-contributing variables for accurate cluster recovery and also for proper interpretation of clustering results. Although the weights obtained by variable weighting methods can be used for the purpose of variable selection (or, elimination), they alone hardly provide a clear guide on selecting variables for subsequent analysis. In addition, variable selection and variable weighting are highly interrelated with the choice on the number of clusters. In this paper, we propose a non-parametric data clustering method, based on the W-k-means type clustering, for an automated and joint decision on selecting variables, determining variable weights, and deciding the number of clusters. Conclusions are drawn from computational experiments with random data and real-life data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Non-parametric Method for Data Clustering with Optimal Variable Weighting

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Technique of Cluster analysis in Data mining
Wenyang Yu ... Yubing Yang
-
Wenyang Yu, et. al.Wenyang Yu ... Yubing Yang
01 Jan 2015
01 Jan 2015

The Smart Urban Planning Based on the Cluster Analysis Algorithm
Luo Ping ... Rong Hucheng
INTERNATIONAL JOURNAL ON Advances in Information Sciences and Service Sciences | VOL. 5
Luo Ping , et. al.Luo Ping ... Rong Hucheng
15 Feb 2013
INTERNATIONAL JOURNAL ON Advances in Information Sciences and Service Sciences | VOL. 5

Clustering Algorithm and Its Application in Data Mining
Hailei Zou
Wireless Personal Communications | VOL. 110
Hailei ZouHailei Zou
21 Aug 2019
Wireless Personal Communications | VOL. 110

Hypothesis oriented cluster analysis in data mining by visualization
Ke-Bing Zhang ... Kang Zhang
-
Ke-Bing Zhang, et. al.Ke-Bing Zhang ... Kang Zhang
01 Jan 2006
01 Jan 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Non-parametric Method for Data Clustering with Optimal Variable Weighting

Abstract

Talk to us

Similar Papers