Minimum spanning tree‐based cluster analysis: A new algorithm for determining inconsistent edges

Fadi Şaar,Ahmet E Topcu

doi:10.1002/cpe.6717

Abstract

AbstractIn recent years, graph‐based data clustering algorithms have become popular as they perform connectivity‐based rather than centroid‐based partitioning. Methods related to minimum spanning tree (MST)‐based data clustering are types of graph‐based algorithms that can recognize arbitrary shapes of clusters by eliminating inconsistent edges from MST graphs. In all MST‐based data clustering algorithms, definition of inconsistent edges is the main problem that needs to be addressed. The longest edges in MST graphs are considered as inconsistent edges under ideal conditions. Nevertheless, outliers often exist in real‐world tasks, which makes the longest edges inaccurate cluster separation indicators. In this paper, we propose a new data clustering algorithm using MST and a critical distance method. The proposed algorithm solves the main issue of MST‐based data clustering, namely identifying and removing inconsistent edges to obtain clusters even in the event that the dataset contains some outliers. It begins by constructing the MST over a given weighted graph based on Euclidean distance and then splits up the graph into clusters by eliminating inconsistent edges using critical distance as a threshold. Integration of the advantages of both MST and critical distance methodology to obtain optimal clusters is the main contribution of this work. The conducted experimental analysis and results using different datasets prove that our proposed clustering algorithm yields better overall performance compared with the most common data clustering algorithms. Taking the Liver and Tumor datasets as an example, the proposed algorithm outperforms all other clustering algorithms with clustering accuracy equal to 0.579 and 0.660, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Minimum spanning tree‐based cluster analysis: A new algorithm for determining inconsistent edges

Abstract

Talk to us

Similar Papers

More From: Concurrency and Computation: Practice and Experience

Lead the way for us

Journal: Concurrency and Computation: Practice and Experience	Publication Date: Nov 18, 2021
Citations: 10

Similar Papers

Functional grouping of similar genes using eigenanalysis on minimum spanning tree based neighborhood graph
R Jothi ... Aparajita Ojha
Computers in Biology and Medicine | VOL. 71
R Jothi, et. al.R Jothi ... Aparajita Ojha
21 Feb 2016
Computers in Biology and Medicine | VOL. 71

A study of causality structure and dynamics in industrial electricity consumption based on Granger network
Can-Zhong Yao ... Xu-Zhou Zheng
Physica A: Statistical Mechanics and its Applications | VOL. 462
Can-Zhong Yao, et. al.Can-Zhong Yao ... Xu-Zhou Zheng
21 Jun 2016
Physica A: Statistical Mechanics and its Applications | VOL. 462

A scaled-MST-based clustering algorithm and application on image segmentation
Jia Li ... Xiali Wang
Journal of Intelligent Information Systems | VOL. 54
Jia Li, et. al.Jia Li ... Xiali Wang
13 Aug 2019
Journal of Intelligent Information Systems | VOL. 54

Data Clustering
-
-
--
17 Aug 2022
17 Aug 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Minimum spanning tree‐based cluster analysis: A new algorithm for determining inconsistent edges

Abstract

Talk to us

Similar Papers

More From: Concurrency and Computation: Practice and Experience