Abstract
One of the difficult tasks in data clustering is clustering the high dimensional data. Clustering high dimensional data has been a major concern owing to the intrinsic sparsity of the data points. Several recent research results signifies that in case of high dimensional data, even the notion of proximity or clustering possibly will not be significant. Fuzzy C-means (FCM) and possibilistic C-means (PCM) has the capability to handle the high dimensional data, whereas FCM is sensitive to noise and PCM requires appropriate initialisation to converge to nearly global minimum. Hence to overcome this issue a fuzzy possibilistic C-means (FPCM) with symmetry-based distance measure has been proposed which can find out the number of clusters that exist in a dataset. In addition with a good fuzzy partitioning of the data, a novel fuzzy cluster validity index called FSym-index is used which depends on the symmetry-based distance. Symmetry-based distance provides a measure of integrity of clustering on several fuzzy partitions of a dataset. If the value of FSym-index is larger, the accuracy also becomes high with less execution time.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: International Journal of Computational Intelligence Studies
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.