Abstract

Determining 'the initial seed for clustering' is an issue in k-means which has attracted considerable interest, especially in recent years. Despite its popularity among clustering algorithms, k-means still has many problems such as converging to the local optimum solutions, the results obtained are strongly depends upon the selection of initial seeds, number of clusters need to be known in advance etc. Various initialisation methods were proposed to improve the performance of k-means algorithm. In this paper, a novel approach, k-minimum-average-maximum (k-MAM), is proposed for finding the initial centroids by considering distance on extreme ends. The proposed algorithm is tested with UCI repository datasets and data collected from Facebook. We compared our proposed method with simple k-means and k-means++ initialisation method based on efficiency and effectiveness. The results show that the proposed algorithm converges very fast with better accuracy.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.