Abstract
The performance of data clustering algorithms is mainly dependent on their ability to balance between the exploration and exploitation of the search process. Although some data clustering algorithms have achieved reasonable quality solutions for some datasets, their performance across real-life datasets could be improved. This paper proposes an adaptive memetic differential evolution optimisation algorithm (AMADE) for addressing data clustering problems. The memetic algorithm (MA) employs an adaptive differential evolution (DE) mutation strategy, which can offer superior mutation performance across many combinatorial and continuous problem domains. By hybridising an adaptive DE mutation operator with the MA, we propose that it can lead to faster convergence and better balance the exploration and exploitation of the search. We would also expect that the performance of AMADE to be better than MA and DE if executed separately. Our experimental results, based on several real-life benchmark datasets, shows that AMADE outperformed other compared clustering algorithms when compared using statistical analysis. We conclude that the hybridisation of MA and the adaptive DE is a suitable approach for addressing data clustering problems and can improve the balance between global exploration and local exploitation of the optimisation algorithm.
Highlights
Data clustering is widely used in different applications to understand the structure of the data, to focus on a specific set of clusters for further analysis, and to detect the characteristics of each cluster
The results show that the proposed algorithm has a smaller best, average, worst and standard deviation compared with the other algorithms
The results indicate that Adaptive memetic differential evolution algorithm (AMADE) has shown consistent performance and better result than IKHCA, Combined K-harmonic means algorithm (KHM) with improved Cuckoo Search (ICS) and Particle Swarm Optimizer (PSO) algorithms (ICMPKHM), Age-based particle swarm optimization algorithm (PSOAG), Hybrid of Krill herd algorithm (KH) with Harmony search algorithm (HS) algorithms (H-KHA) and black hole (BH) on almost all the datasets
Summary
Data clustering is widely used in different applications to understand the structure of the data, to focus on a specific set of clusters for further analysis, and to detect the characteristics of each cluster. This section discusses the fundamental aspects of clustering analysis problem, differential evolution (DE) and memetic algorithms, which have been used in the proposed data clustering algorithm. This section discusses the relevant population-based approaches in the data clustering. Data Clustering is a process of partitioning a set of n objects into some clusters K, based on a specific similarity measure. Each cluster should consist of at least one object: Ci 61⁄4 ; 8 i 2 f1; 2; . Different clusters should not have objects in common: Ci \ Cj 1⁄4 ; 8 i 61⁄4 j and i; j 2 f1; 2; . Every object must be attached to a cluster: Sk i1⁄41
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.