Abstract

There are many clustering algorithms in the literature that are robust against outliers. They are robust because they decrease the effect of outliers on the cluster centroid locations but they do not result into efficient clusters as they include outliers in the final clusters. The limitation with these algorithms is that they do not identify outliers. In this paper, we propose an algorithm, density oriented fuzzy C-means (DOFCM) which identifies outliers based upon density of points in the dataset before creating clusters and results into ‘n + 1’ clusters, with ‘n’ good and one invalid cluster containing noise and outliers. Propose technique is based on the concept that if these outliers are not required in clustering then their memberships should not be involved during clustering. We tried to nullify the effect of outliers by assigning them zero membership value during clustering. It is applied to various synthetic datasets, Bensaid’s data and is compared with well known robust clustering techniques, namely, PFCM, CFCM, and NC. Results obtained after comparing the performance of these algorithms concluded that DOFCM is the best method to recognise original shape of clusters from noisy datasets.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call