Abstract

Most of the existing clustering algorithms are often based on Euclidean distance measure. However, only using Euclidean distance measure may not be sufficient enough to partition a dataset with different structures. Thus, it is necessary to combine multiple distance measures into clustering. However, the weights for different distance measures are hard to set. Accordingly, it appears natural to keep multiple distance measures separately and to optimize them simultaneously by applying a multiobjective optimization technique. Recently a new clustering algorithm called ‘multiobjective evolutionary clustering based on combining multiple distance measures’ (MOECDM) was proposed to integrate Euclidean and Path distance measures together for partitioning the dataset with different structures. However, it is time-consuming due to the large-sized genes. This paper proposes a fast multiobjective fuzzy clustering algorithm for partitioning the dataset with different structures. In this algorithm, a real encoding scheme is adopted to represent the individual. Two fuzzy clustering objective functions are designed based on Euclidean and Path distance measures, respectively, to evaluate the goodness of each individual. An improved evolutionary operator is also introduced accordingly to increase the convergence speed and the diversity of the population. In the final generation, a set of nondominated solutions can be obtained. The best solution and the best distance measure are selected by using a semisupervised method. Afterwards, an updated algorithm is also designed to detect the optimal cluster number automatically. The proposed algorithms are applied to many datasets with different structures, and the results of eight artificial and six real-life datasets are shown in experiments. Experimental results have shown that the proposed algorithms can not only successfully partition the dataset with different structures, but also reduce the computational cost.

Highlights

  • With the rapid development of informatization, there are large amounts of data generating in all walks of life every day; the quantity continues to grow rapidly

  • The first experiment is to show that F-MFCMDM can partition the dataset with different structures by combining Euclidean distance and Path distance

  • A multiobjective fuzzy clustering algorithm called FMFCMDM is proposed to partition the dateset with different structures and to reduce the computational cost

Read more

Summary

Introduction

With the rapid development of informatization, there are large amounts of data generating in all walks of life every day; the quantity continues to grow rapidly. It is necessary to design many powerful and intelligent data analysis tools to explore the valuable knowledge from these data. Clustering analysis is such a tool and it partitions a dataset into some subgroups such that objects within a group should be as similar as possible while objects in different groups should be as dissimilar as possible. The most popular clustering techniques can be divided into partitioning clustering [4], hierarchical clustering [5], and density clustering [6] Among these clustering algorithms, partitioning clustering is the most popular one and has attracted more and more attention in recent years

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call